Ternary and Binary Quantization for Improved Classification
This addresses data complexity reduction for classification tasks, but it is incremental as it builds on existing quantization and random projection methods.
The paper tackles the problem of classification accuracy degradation from data quantization by showing that quantizing sparse features to ternary or binary codes can yield comparable or superior accuracy, validated through extensive experiments.
Dimension reduction and data quantization are two important methods for reducing data complexity. In the paper, we study the methodology of first reducing data dimension by random projection and then quantizing the projections to ternary or binary codes, which has been widely applied in classification. Usually, the quantization will seriously degrade the accuracy of classification due to high quantization errors. Interestingly, however, we observe that the quantization could provide comparable and often superior accuracy, as the data to be quantized are sparse features generated with common filters. Furthermore, this quantization property could be maintained in the random projections of sparse features, if both the features and random projection matrices are sufficiently sparse. By conducting extensive experiments, we validate and analyze this intriguing property.