CLIRSep 4, 2014

Semantic clustering of Russian web search results: possibilities and problems

arXiv:1409.1612v22 citations
Originality Synthesis-oriented
AI Analysis

This addresses semantic clustering for Russian web search, but it appears incremental as it applies existing distributional semantics methods to a specific language and dataset.

The paper tackled the problem of word sense induction from lexical co-occurrence graphs to cluster Russian web search results by query meanings, comparing methods and corpora, but no concrete results or numbers were reported.

The paper deals with word sense induction from lexical co-occurrence graphs. We construct such graphs on large Russian corpora and then apply this data to cluster Mail.ru Search results according to meanings of the query. We compare different methods of performing such clustering and different source corpora. Models of applying distributional semantics to big linguistic data are described.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes