LG PRAug 24, 2012

Identification of Probabilities of Languages

arXiv:1208.5003v3

Originality Synthesis-oriented

AI Analysis

This work addresses foundational issues in computational learning theory for language modeling, but it appears incremental as it builds on existing frameworks like computable measures and typicality.

The paper tackles the problem of inferring probability distributions from infinite sequences of language data, considering both algorithms with and without round-off errors, and provides effective procedures for identification under various computability and dependency assumptions, using tools like the Strong Law of Large Numbers and Kolmogorov complexity.

We consider the problem of inferring the probability distribution associated with a language, given data consisting of an infinite sequence of elements of the languge. We do this under two assumptions on the algorithms concerned: (i) like a real-life algorothm it has round-off errors, and (ii) it has no round-off errors. Assuming (i) we (a) consider a probability mass function of the elements of the language if the data are drawn independent identically distributed (i.i.d.), provided the probability mass function is computable and has a finite expectation. We give an effective procedure to almost surely identify in the limit the target probability mass function using the Strong Law of Large Numbers. Second (b) we treat the case of possibly incomputable probabilistic mass functions in the above setting. In this case we can only pointswize converge to the target probability mass function almost surely. Third (c) we consider the case where the data are dependent assuming they are typical for at least one computable measure and the language is finite. There is an effective procedure to identify by infinite recurrence a nonempty subset of the computable measures according to which the data is typical. Here we use the theory of Kolmogorov complexity. Assuming (ii) we obtain the weaker result for (a) that the target distribution is identified by infinite recurrence almost surely; (b) stays the same as under assumption (i). We consider the associated predictions.

View on arXiv PDF

Similar