EXPERIENCE OF USING ARTIFICIAL INTELLIGENCE IN WORKING WITH MEDIEVAL MANUSCRIPTS
DOI:
https://doi.org/10.15330/gal.39.171-179Keywords:
paleography, artificial intelligence, computer vision, handwritten text recognition, machine learning.Abstract
The article presents the principles and approaches to the application of artificial intelligence (AI) in the work with handwritten historical documents. With the development of machine learning methods and computer vision, the use of automated systems for the analysis, structuring, and recognition of texts from scanned documents has become increasingly widespread, as evidenced by a substantial body of contemporary scholarly research in this field. The use of such mechanisms is becoming a standard practice in large-scale archival projects. However, despite the significant number of available tools, most existing solutions are primarily oriented toward the processing of documents from the modern period, while the problem of automated processing of medieval manuscripts remains insufficiently studied due to the variability of handwriting and the physical deterioration of the materials.
In this article, we analyze current approaches to the application of machine learning in the recognition of handwritten historical texts, in particular methods for the detection and segmentation of structural elements of documents. We also propose our own computer system capable of processing Latin-language documents from the Carolingian and Ottonian periods of the 9th–11th centuries. The particular complexity of working with documents from this period is due to the specifics of Carolingian minuscule, including the presence of numerous ligatures, ascenders and descenders, and characteristic medieval abbreviations, which pose serious challenges for standard OCR algorithms. At the same time, documents of the Carolingian period possess high source value for historical and palaeographic research, as they record early forms of administrative, legal, and written practices in Western Europe and reflect key stages in the formation of the medieval documentary tradition.
The system proposed in this paper is modular and consists of four interconnected machine learning models, each performing a specific role in the overall document processing pipeline. The system provides step-by-step detection of text lines and words, as well as their recognition through the combination of different models designed to identify visually and syntactically similar words. This approach improves the robustness of recognition under conditions of limited training data and ensures better adaptation to the specific characteristics of medieval handwriting.
