Part of Speech Tagging of Marathi Text Using Trigram Method
This work addresses a domain-specific problem for Marathi language processing, but it is incremental as it applies an existing method to new data.
The authors tackled part-of-speech tagging for Marathi, a morphologically rich language, using a statistical trigram method, achieving an evaluation result but without concrete numbers provided.
In this paper we present a Marathi part of speech tagger. It is a morphologically rich language. It is spoken by the native people of Maharashtra. The general approach used for development of tagger is statistical using trigram Method. The main concept of trigram is to explore the most likely POS for a token based on given information of previous two tags by calculating probabilities to determine which is the best sequence of a tag. In this paper we show the development of the tagger. Moreover we have also shown the evaluation done.