Machine Learning approaches to do size based reasoning on Retail Shelf objects to classify product variants
This addresses a specific challenge in retail inventory management by enabling accurate product variant classification, though it is incremental as it builds on existing object detection and classification methods.
The paper tackles the problem of classifying product variants that look identical but differ in size on retail shelves by using size-based reasoning from bounding boxes and brand predictions, achieving practical differentiation where computer vision alone fails.
There has been a surge in the number of Machine Learning methods to analyze products kept on retail shelves images. Deep learning based computer vision methods can be used to detect products on retail shelves and then classify them. However, there are different sized variants of products which look exactly the same visually and the method to differentiate them is to look at their relative sizes with other products on shelves. This makes the process of deciphering the sized based variants from each other using computer vision algorithms alone impractical. In this work, we propose methods to ascertain the size variant of the product as a downstream task to an object detector which extracts products from shelf and a classifier which determines product brand. Product variant determination is the task which assigns a product variant to products of a brand based on the size of bounding boxes and brands predicted by classifier. While gradient boosting based methods work well for products whose facings are clear and distinct, a noise accommodating Neural Network method is proposed for cases where the products are stacked irregularly.