|www.mari-language.com:||ENGLISH | МАРЛА | ПО-РУССКИ|
|Main page » Corpus Infrastructure|
The Mari corpus project was initiated by scholars from Ghent (Alexandra Simonenko), Helsinki (Jack Rueter), Moscow (Anna Volkova), Munich/Vienna (Jeremy Bradley), Tromsø (Trond Trosterud), Turku (Jorma Luutonen), and Yoshkar-Ola (Andrey Chemyshev).
It represents an effort to create a morphologically annotated corpus of literary Mari (both Meadow Mari and Hill Mari) searchable in myriad ways (by lexeme, by morphological pattern, by syntactic pattern). It will contain several dozen million words of Meadow Mari, and several million words of Hill Mari (exact figures to follow), and contain texts from the early 20th century till today.
Working demos of our efforts can be found here (Tromsø) and here (Vienna); the first proper release (including tutorials on using our corpus infrastructure) is planned later in 2018.
Participating and supporting institutions:
|The Mari Web Project is primarily based at the Department of Finno-Ugric Studies at the University of Vienna. The Mari-English Dictionary was funded by the Austrian Science Fund (FWF): P22786-G20. The second stage of the project is being funded by the Kone Foundation: The Mari Web Project: Phase 2. Some of our work is carried out at the Institute of Finno-Ugric and Uralic Studies at the Ludwig Maximilian University of Munich.|
|Last update: 8 April 2018|