
Arthur Schnitzler gehört zu den bedeutendsten österreichischen Autoren und war ein produktiver und gut vernetzter Briefschreiber. Seine Korrespondenz wurde jedoch
Short description of the project
The MultiHTR team is continuing the successful first project phase (June 1, 2020 to May 31, 2022) in order to expand the language portfolio in the second phase (June 1, 2022 to May 31, 2024) and make the latest advances in handwriting recognition (HTR) usable for the public and academia using artificial intelligence (AI). The overall project focuses on the (further) development of shorthand models for German, Yiddish written in the Hebrew alphabet, Ukrainian, Russian, Serbian and Ottoman. The automated transliteration and transcription models are intended to provide the public and researchers with access to previously inaccessible handwritten materials.
Project content
The MultiHTR team is continuing the results of the first successful project phase (June 1, 2020 to May 31, 2022) in order to expand the language portfolio in the second project phase (June 1, 2022 to May 31, 2024) and make the latest advances in the field of handwriting recognition (HTR) available to the population and academia. In this continuation, artificial intelligence (AI) will be used to develop advanced handwriting recognition models for languages and scripts not previously considered. The aim is to enable access to complex handwritten materials that were previously inaccessible to most users. The second phase focuses on the (further) development of shorthand models for German. In addition, a model for documents written in Hebrew Yiddish will be developed to make them accessible to descendants and the public. A further component is dedicated to the development of an HTR model for the Ukrainian language in order to make the indexing of Ukrainian-language archive holdings more efficient. At the same time, Ottoman-Turkish and Russian models are being further developed. The overarching goal of the project is to systematically advance progress in the field of handwriting recognition based on AI and to use the acquired technologies for the benefit of the population. In particular, the project focuses on the development of handwriting recognition models for German and for relevant migration languages in Germany/Baden-Württemberg. These models are to be trained by AI to automatically decode archive materials, ego documents and correspondence. In the first project phase, the project published models for Serbian and Russian. On the one hand, the automatically decoded texts serve as a basis for humanities research, in particular for micro-historical, discourse-analytical and sociolinguistic analyses. On the other hand, the population benefits directly by making complex, multilingual documents accessible without paleographic knowledge. The project is funded by the Baden-Württemberg Ministry of Science, Research and the Arts as part of the state's digital@bw digitization strategy.
achim.rabus@slavistik.uni-freiburg.de
multihtr@slavistik.uni-freiburg.de
Find out more at
www.multihtr.uni-freiburg.de
Add your DH research project to the project showcase by submitting a short project description via the web form. Enter project data, a brief description, a graphic or visualization as well as a detailed description of the project content with technical assignment, addressees, added value, project managers, funding information and duration.
Arthur Schnitzler gehört zu den bedeutendsten österreichischen Autoren und war ein produktiver und gut vernetzter Briefschreiber. Seine Korrespondenz wurde jedoch
Mit Dietrich online werden bibliographische Angaben zu ca. 5. Mio im deutschen Sprachraum von 1897- 1944 erschienenen Zeitschriftenaufsätzen und Zeitungsartikeln
Die digitale Arbeitsumgebung ediarum ist eine aus mehreren Softwarekomponenten bestehende Lösung, die es Wissenschaftler*innen erlaubt, Transkriptionen von Manuskripten und Drucken
Annotieren, Analysieren, Interpretieren und Visualisieren: In CATMA können Textwissenschaftler:innen so arbeiten, wie es ihren Fragestellungen am besten entspricht: qualitativ oder
Das Hidden Kosmos – Reconstructing Alexander von Humboldt’s »Kosmos-Lectures« widmete sich von 2014–16 der Ermittlung und Verzeichnung, Bild- und Volltext-Digitalisierung
Die drei von Text+ adressierten Datendomänen Sammlungen, lexikalische Ressourcen und Editionen gehören zu den klassischen Feldern geisteswissenschaftlicher Forschung. Das Plus-Zeichen
The digital working environment ediarum is a solution consisting of several software components that allows scholars to edit transcriptions of manuscripts and prints in TEI-compliant XML, add commentaries and a critical apparatus as well as indexes and publish them on the web and in print.
The Heinrich Heine Portal is based on the work of several generations of researchers by combining the two historical-critical complete editions of Heine, which were produced independently of each other in the Federal Republic of Germany and the German Democratic Republic, in one digital edition.
Wir verwenden Cookies und ähnliche Funktionen zur Verarbeitung von Daten. Die Zustimmung ist freiwillig und kann jederzeit widerrufen werden.