Semantic clustering of Russian web search results: possibilities and problems
This addresses semantic clustering for Russian web search, but it appears incremental as it applies existing distributional semantics methods to a specific language and dataset.
The paper tackled the problem of word sense induction from lexical co-occurrence graphs to cluster Russian web search results by query meanings, comparing methods and corpora, but no concrete results or numbers were reported.
The paper deals with word sense induction from lexical co-occurrence graphs. We construct such graphs on large Russian corpora and then apply this data to cluster Mail.ru Search results according to meanings of the query. We compare different methods of performing such clustering and different source corpora. Models of applying distributional semantics to big linguistic data are described.