Congress of Balticists - Digital approaches to Old Baltic linguistic monuments

Digital approaches to Old Baltic linguistic monuments

Organizers:

Pietro Dini
Università di Pisa
Silvia Piccini
Istituto di Linguistica Computazionale "A. Zampolli", Pisa
Adriano Cerri
Università di Pisa

Program:

Open program

Abstracts:

Open abstracts

Description:

Over the last two decades, numerous significant projects have been undertaken to preserve, document, and study the legacy of Old Baltic linguistic monuments. Some of them focus on the text and resource repositories of a single linguistic tradition (cf. PKPDB for Old Prussian; SENIE for Old Latvian; SR, SLIEKKAS, ALQ and ALKT for Old Lithuanian), while others concentrate on specific authors (e.g. CorDon; SBCB) or specific textual genres (e.g. PosTiMe on Old Lithuanian Lutheran postils, OWNW on Old Baltic catechisms). However, the research landscape remains fragmented, characterized by disparate datasets and methodologies that lack integration. For instance, existing linguistic corpora and lexicons often employ divergent annotation practices and incompatible formats, making data interoperability inconvenient. Similarly, digital archives of ancient texts frequently adhere to project-specific schemas, limiting their accessibility for broader computational applications.

As a result, there is an increasing need nowadays for the establishment of a cohesive ecosystem that prioritizes the FAIRness of research data, metadata, and infrastructure, thus aligning with the principles of open science. Achieving this requires a collaborative effort among philologists, linguists and technologists to define unified standards, develop robust methodologies, and create frameworks that bridge the gap between traditional scholarship and cutting-edge technology.

This section aims to foster dialogue between tradition and innovation, focusing on the study of ancient Baltic texts through the application of new technologies, in the service of philological and linguistic research. In particular, contributions are invited on topics such as these:

the design and implementation of ontologies and lexicographical resources based on the semantic web to improve access and interoperability of linguistic data;
the digitization and analysis of ancient Baltic texts (manuscripts or printed) for the creation of digital archives;
the annotation of corpora, both synchronic and diachronic, for linguistic and philological research;
the application of computational tools for automatic language processing (NLP) in Baltic languages, including lesser-studied varieties.

We also invite researchers to present current or future projects focused on the valorization of ancient Baltic linguistic monuments through the use of digital and computational methodologies. Colleagues working from this perspective will have the opportunity to share their experiences, results, and new ideas.

References

ALKT = Kritische Edition altlitauischer Kleintexte vom Überlieferungsbeginn bis 1700.
ALQ = Altlitauisches Quellenverzeichnis.
CorDon = Digital Old Lithuanian: Corpus of Kristijonas Donelaitis (1714–1780).
OWNW = Old Words for a New World: Translating Christianity to Baltic Pagans.
PosTiMe = Postil Time Machine.
SENIE = Latviešu valodas seno tekstu korpuss.
SBCB = Samuelio Boguslavo Chylinskio Naujasis Testamentas. Rankraščio tyrimas, faksimilinis ir interaktyvus skaitmeninis leidimas.
SLIEKKAS = Technological and scientific basis for the linguistic annotation of Old Lithuanian Corpus.
PKPDB = Prūsų kalbos paveldo duomenų bazė.
SR = Senieji raštai / Database of Old Writings.

Accepted papers:

Everita Andronova
Challenges and some observations about part-of-speech tagging for early Latvian texts
Loïc Boizou, Mortimer Drach, Maxim Ionov, Jolanta Gelumbeckaitė, Øyvind Eide, Pascale Boisvert
PosTiMe – The Postil Time Machine
Pietro U. Dini, Silvia Piccini, Adriano Cerri
Old words for a new world: Lexicon and ontology of Christian terminology in early Baltic catechisms
Anna Helene Feulner, Henrik Hornecker
Das Altlitauische Quellenverzeichnis (ALQ): aktueller Stand und digitale Zukunft
Ilja Lemeškin
M. Mažvydo kaip knygų savininko ir skaitytojo įrašai. Dėl duomenų bazės „Spausdintų knygų glosynas“
Signe Rirdance, Everita Andronova
Towards a digital philology of Early Latvian: Observations and possibilities
Mindaugas Šinkūnas, Ona Aleknavičienė
LKI Senųjų raštų duomenų bazei – 30: darbai, galimybės, perspektyvos
Anta Trumpa
Dar kartą apie Georgo Elgerio žodyną „Dictionarium Polono-Latino-Lottauicum” (1683)