Topics Structuring online text database for Mongolic languages

2021/09/27

The IRC project “Structuring online text database for Mongolic languages,” started in 2020, is proceeding with building of an online text database based on linguistic corpora of Mongolic languages which the project manager has published. As an outcome, we launched “Online Text of Mongolic Languages” in November 2020.

This resource records text materials with grammatical analyses on Shinekhen Buryat, Khamnigan Mongolian and Khorchin Mongolian, with sound data. You can search items in original text, grammatical gloss, Japanese translation, and English translation of the languages and/or the text you selected on the website (AND searching available). If you want to search some morphemes in original texts, you can use the software keyboard for phonemes.

As for Shinekhen Buryat, we published ten folktales such as “A Swan and a Hunter” and “Old Man and Old Woman with a Spotted Cow.” In addition, in order to collect the sentences, we used the questionnaire Translation of daily conversation in common for the three languages, which the research group for the Mongolic languages of Inner Mongolia University prepared in the 1980s. The corpora also enable you to make a comparison of the phrases.

The corpora disclose only materials published by the project manager now. However, it is also designed to record any text materials with analyses on other Mongolic languages collected by other researchers, and expected of a wide range of use.

(Written by Yasuhiro YAMAKOSHI and Mayumi ADACHI)

2020 End-of-year report
Structuring online text database for Mongolic languages

Pick up

2025/03/12

Building an online digital resource based on the Swahili dictionary published from ILCAA

2025/03/23

Construction of Miandianguan Zazi database

2026/04/13

Digitally Archiving Kampo Arai’s Diary and Sketches in India (1916-18)

Announcements

By year