CLROSep 28, 2015

A Preliminary Study on the Learning Informativeness of Data Subsets

arXiv:1510.04104v1
Originality Synthesis-oriented
AI Analysis

This work addresses computational complexity issues in robotic systems for timely human-robot interaction, but it is incremental as it builds on existing methods for data subset analysis.

The study tackled the problem of reducing training data size while retaining symbolic learning potential by analyzing the learning informativeness of data subsets, and demonstrated the concept on human-written texts.

Estimating the internal state of a robotic system is complex: this is performed from multiple heterogeneous sensor inputs and knowledge sources. Discretization of such inputs is done to capture saliences, represented as symbolic information, which often presents structure and recurrence. As these sequences are used to reason over complex scenarios, a more compact representation would aid exactness of technical cognitive reasoning capabilities, which are today constrained by computational complexity issues and fallback to representational heuristics or human intervention. Such problems need to be addressed to ensure timely and meaningful human-robot interaction. Our work is towards understanding the variability of learning informativeness when training on subsets of a given input dataset. This is in view of reducing the training size while retaining the majority of the symbolic learning potential. We prove the concept on human-written texts, and conjecture this work will reduce training data size of sequential instructions, while preserving semantic relations, when gathering information from large remote sources.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes