CLAPJan 28, 2019

Diseño de un espacio semántico sobre la base de la Wikipedia. Una propuesta de análisis de la semántica latente para el idioma español

arXiv:1902.02173v1
Originality Synthesis-oriented
AI Analysis

This work addresses the need for semantic analysis tools in Spanish, but it is incremental as it applies an existing method to a new language.

The paper tackled the problem of creating a semantic space for the Spanish language using Latent Semantic Analysis (LSA), resulting in a document-text matrix with dimensions 1.3 x 10^6 and 5.9 x 10^6 that was decomposed into singular values for semantic analysis.

Latent Semantic Analysis (LSA) was initially conceived by the cognitive psychology at the 90s decade. Since its emergence, the LSA has been used to model cognitive processes, pointing out academic texts, compare literature works and analyse political speeches, among other applications. Taking as starting point multivariate method for dimensionality reduction, this paper propose a semantic space for Spanish language. Out results include a document text matrix with dimensions 1.3 x10^6 and 5.9x10^6, which later is decomposed into singular values. Those singular values are used to semantically words or text.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes