LO CL FL LGOct 15, 2020

Learning Languages with Decidable Hypotheses

Julian Berger, Maximilian Böther, Vanja Doskoč, Jonathan Gadea Harder, Nicolas Klodt, Timo Kötzing, Winfried Lötzsch, Jannik Peters, Leon Schiller, Lars Seifert, Armin Wells, Simon Wietheger

arXiv:2011.09866v11.2

Originality Incremental advance

AI Analysis

This work addresses foundational issues in computational learning theory for researchers, but it is incremental as it builds on existing frameworks with a new hypothesis type.

The paper tackles the problem of language learning in the limit by using C-indices (programs for characteristic functions) to name decidable languages, establishing a hierarchy of learning power based on restrictions of C-indices and showing that all settings are weaker than learning with W-indices, even for computable languages.

In language learning in the limit, the most common type of hypothesis is to give an enumerator for a language. This so-called $W$-index allows for naming arbitrary computably enumerable languages, with the drawback that even the membership problem is undecidable. In this paper we use a different system which allows for naming arbitrary decidable languages, namely programs for characteristic functions (called $C$-indices). These indices have the drawback that it is now not decidable whether a given hypothesis is even a legal $C$-index. In this first analysis of learning with $C$-indices, we give a structured account of the learning power of various restrictions employing $C$-indices, also when compared with $W$-indices. We establish a hierarchy of learning power depending on whether $C$-indices are required (a) on all outputs; (b) only on outputs relevant for the class to be learned and (c) only in the limit as final, correct hypotheses. Furthermore, all these settings are weaker than learning with $W$-indices (even when restricted to classes of computable languages). We analyze all these questions also in relation to the mode of data presentation. Finally, we also ask about the relation of semantic versus syntactic convergence and derive the map of pairwise relations for these two kinds of convergence coupled with various forms of data presentation.

View on arXiv PDF

Similar