본문 바로가기
  • Home

A Comparative Study of the Performance of Russian Morphological Analyzers - MyStem, Pymorphy, and TreeTagger -

  • Journal of Humanities
  • 2026, (100), pp.123~154
  • Publisher : Institute for Humanities
  • Research Area : Humanities > Other Humanities
  • Received : January 5, 2026
  • Accepted : January 30, 2026
  • Published : February 28, 2026

Kim, Bo Ra 1

1경상국립대학교

Accredited

ABSTRACT

This study compares the practical performance of three Russian morphological analyzers-MyStem, Pymorphy, and TreeTagger—through experiments focusing on the analysis of neologisms and homonymous words. The results of the neologism analysis show that MyStem generates lemmas for almost all neologisms and tends to place them within existing Russian inflectional paradigms, presenting a wide range of possible grammatical features simultaneously. Pymorphy, which combines rule-based processing with probabilistic models, produces relatively consistent outputs by selecting a single most probable analysis; however, it occasionally misclassifies neologisms as proper nouns. TreeTagger provides stable part-of-speech and basic grammatical information, but often fails to generate lemmas for out-of-vocabulary words. In the homonym analysis, both MyStem and Pymorphy exhibit repeated misanalyses in sentences containing homonymous forms due to their lack of context-based disambiguation, whereas TreeTagger achieves the highest accuracy at the part-of-speech level by exploiting contextual information. Nevertheless, TreeTagger also shows errors in finer-grained grammatical categories and in the determination of verbal aspect. These findings indicate that the three morphological analyzers exhibit different strengths and limitations in handling neologisms and resolving ambiguity, and that the choice of an appropriate analyzer should depend on the research goals and the characteristics of the data.

Citation status

* References for papers published after 2024 are currently being built.

This paper was written with support from the National Research Foundation of Korea.