CLJul 15, 2013

Part of Speech Tagging of Marathi Text Using Trigram Method

arXiv:1307.4299v129 citations

Originality Synthesis-oriented

AI Analysis

This work addresses a domain-specific problem for Marathi language processing, but it is incremental as it applies an existing method to new data.

The authors tackled part-of-speech tagging for Marathi, a morphologically rich language, using a statistical trigram method, achieving an evaluation result but without concrete numbers provided.

In this paper we present a Marathi part of speech tagger. It is a morphologically rich language. It is spoken by the native people of Maharashtra. The general approach used for development of tagger is statistical using trigram Method. The main concept of trigram is to explore the most likely POS for a token based on given information of previous two tags by calculating probabilities to determine which is the best sequence of a tag. In this paper we show the development of the tagger. Moreover we have also shown the evaluation done.

View on arXiv PDF

Similar