Please use this identifier to cite or link to this item:
https://scidar.kg.ac.rs/handle/123456789/23166Full metadata record
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Polomac, Vladimir | - |
| dc.contributor.author | Milosevic, Marko | - |
| dc.contributor.author | Pavlović, Ana Marija | - |
| dc.contributor.author | Lutovac Kaznovac, Tamara | - |
| dc.date.accessioned | 2026-06-26T07:21:53Z | - |
| dc.date.available | 2026-06-26T07:21:53Z | - |
| dc.date.issued | 2026 | - |
| dc.identifier.citation | Ка генеричком HTR моделу за српске средњовековне рукописе / Владимир Р. Поломац, Марко М. Милошевић, Ана Марија Б. Павловић, Тамара Н. Лутовац Казновац // Српски језик : студије српске и словенске. - Vol. 31, No. 1 (2026), p. 79–89. (ISSN 0354-9259) | en_US |
| dc.identifier.issn | 0354-9259 | en_US |
| dc.identifier.uri | https://scidar.kg.ac.rs/handle/123456789/23166 | - |
| dc.description | Рад је настао у оквиру међународног билатералног пројекта Креирање AI модела за аутоматску обраду српских средњовековних рукописа, који финансирају Министарство науке, технолошког развоја и иновација Републике Србије и Немачка служба за академску размену (DAAD). Претходна верзија рада саопштена је на међународној конференцији Јужнословенски језици у дигиталном окружењу (Филолошки факултет у Београду, 21–23. новембар 2024. године). | en_US |
| dc.description.abstract | This paper presents the process of training and evaluating a general-purpose HTR (Handwritten Text Recognition) model designed for the automatic transcription of Serbian medieval manuscripts written in various forms of Cyrillic script, utilizing the Transkribus software platform. The primary practical outcome of this research is the development of the first version of a generic HTR model for Serbian medieval manuscripts, named Miroslav 1.0, in honor of the Miroslav Gospel, the most representative manuscript of the Serbian medieval tradition. The model was trained on a large and heterogeneous dataset comprising approximately 600,000 words extracted from 12th to 18th century manuscripts of various genres, all written in distinct styles of Cyrillic script (uncial, semi-uncial and cursive). The quantitative evaluation of the model’s performance indicates a character error rate (CER) ranging between 5% and 10% on out-of-sample manuscripts, which is considered highly satisfactory for historical manuscript transcription tasks. The implementation of this model significantly accelerates the transcription process of Serbian medieval manuscripts into machine-readable formats, thereby opening new avenues for corpus-based and quantitative research into the Serbian written heritage. Particularly noteworthy is the model’s extensibility: its accuracy and robustness can be further enhanced by expanding the training dataset with additional material. In addition to future improvements based on dataset augmentation, we also plan to train the model using the eScriptorium platform. This will provide researchers with an open-access alternative to Transkribus, thereby promoting broader accessibility and sustainability of digital tools in Slavic medieval manuscript studies. | en_US |
| dc.language.iso | sr | en_US |
| dc.publisher | Српски језик: студије српске и словенске, 31/1, стр. 79–89. | en_US |
| dc.relation.ispartof | Srpski jezik : studije srpske i slovenske | en_US |
| dc.rights | CC0 1.0 Universal | * |
| dc.rights.uri | http://creativecommons.org/publicdomain/zero/1.0/ | * |
| dc.subject | Serbian Medieval Manuscripts | en_US |
| dc.subject | Artificial intelligence | en_US |
| dc.subject | machine learning | en_US |
| dc.subject | HTR (Handwritten Text Recognition) | en_US |
| dc.subject | Transkribus software platform | en_US |
| dc.title | Ка генеричком HTR моделу за српске средњовековне рукописе | en_US |
| dc.title.alternative | TOWARDS GENERIC HTR MODEL FOR SERBIAN MEDIEVAL MANUSCRIPTS | en_US |
| dc.title.alternative | Ka generičkom HTR modelu za srpske srednjovekovne rukopise | en_US |
| dc.type | article | en_US |
| dc.description.version | Published | en_US |
| dc.identifier.doi | https://doi.org/10.18485/sj.2026.31.1.4 | en_US |
| dc.type.version | PublishedVersion | en_US |
| Appears in Collections: | The Faculty of Philology and Arts, Kragujevac (FILUM) | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| sj-2026-31-1-4.pdf | 398.14 kB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License
