A big data approach towards sarcasm detection in Russian
This work addresses the problem of automated text processing for Russian language users, but it is incremental as it builds on existing inflection methods.
The authors developed deterministic algorithms for Russian inflection and text synthesis, implemented in a public web-service, and tested performance against the OpenCorpora corpus to estimate morphological variability in Russian.
We present a set of deterministic algorithms for Russian inflection and automated text synthesis. These algorithms are implemented in a publicly available web-service www.passare.ru. This service provides functions for inflection of single words, word matching and synthesis of grammatically correct Russian text. Selected code and datasets are available at https://github.com/passare-ru/PassareFunctions/ Performance of the inflectional functions has been tested against the annotated corpus of Russian language OpenCorpora, compared with that of other solutions, and used for estimating the morphological variability and complexity of different parts of speech in Russian.