IRMay 14, 2014
Which one is better: presentation-based or content-based math search?Minh-Quoc Nghiem, Giovanni Yoko Kristianto, Goran Topic et al.
Mathematical content is a valuable information source and retrieving this content has become an important issue. This paper compares two searching strategies for math expressions: presentation-based and content-based approaches. Presentation-based search uses state-of-the-art math search system while content-based search uses semantic enrichment of math expressions to convert math expressions into their content forms and searching is done using these content-based expressions. By considering the meaning of math expressions, the quality of search system is improved over presentation-based systems.
DLMay 31, 2013
A hybrid approach for semantic enrichment of MathML mathematical expressionsMinh-Quoc Nghiem, Giovanni Yoko Kristianto, Goran Topic et al.
In this paper, we present a new approach to the semantic enrichment of mathematical expression problem. Our approach is a combination of statistical machine translation and disambiguation which makes use of surrounding text of the mathematical expressions. We first use Support Vector Machine classifier to disambiguate mathematical terms using both their presentation form and surrounding text. We then use the disambiguation result to enhance the semantic enrichment of a statistical-machine-translation-based system. Experimental results show that our system archives improvements over prior systems.