LGFeb 11

Predicting integers from continuous parameters

arXiv:2602.10751v1
AI Analysis

This addresses the challenge of integer prediction in machine learning, offering a method to avoid misrepresenting discrete labels as continuous, which is incremental but practical for applications like social media analytics.

The paper tackles the problem of predicting integer labels by modeling them directly with discrete distributions whose parameters are predicted from features, focusing on neural network outputs. They found that the Bitwise distribution and a discrete Laplace analogue performed best across tasks like tabular learning and image generation.

We study the problem of predicting numeric labels that are constrained to the integers or to a subrange of the integers. For example, the number of up-votes on social media posts, or the number of bicycles available at a public rental station. While it is possible to model these as continuous values, and to apply traditional regression, this approach changes the underlying distribution on the labels from discrete to continuous. Discrete distributions have certain benefits, which leads us to the question whether such integer labels can be modeled directly by a discrete distribution, whose parameters are predicted from the features of a given instance. Moreover, we focus on the use case of output distributions of neural networks, which adds the requirement that the parameters of the distribution be continuous so that backpropagation and gradient descent may be used to learn the weights of the network. We investigate several options for such distributions, some existing and some novel, and test them on a range of tasks, including tabular learning, sequential prediction and image generation. We find that overall the best performance comes from two distributions: Bitwise, which represents the target integer in bits and places a Bernoulli distribution on each, and a discrete analogue of the Laplace distribution, which uses a distribution with exponentially decaying tails around a continuous mean.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes