CV LGOct 12, 2021

Can machines learn to see without visual databases?

Alessandro Betti, Marco Gori, Stefano Melacci, Marcello Pelillo, Fabio Roli

arXiv:2110.05973v22.61 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of reducing dependency on massive datasets in computer vision, potentially offering an alternative to deep learning methods, though it appears incremental as it builds on existing ideas of human-like supervision.

The paper tackles the problem of enabling machines to learn visual skills without relying on large visual databases, proposing a human-like learning approach using vocal interactions and pointing aids.

This paper sustains the position that the time has come for thinking of learning machines that conquer visual skills in a truly human-like context, where a few human-like object supervisions are given by vocal interactions and pointing aids only. This likely requires new foundations on computational processes of vision with the final purpose of involving machines in tasks of visual description by living in their own visual environment under simple man-machine linguistic interactions. The challenge consists of developing machines that learn to see without needing to handle visual databases. This might open the doors to a truly orthogonal competitive track concerning deep learning technologies for vision which does not rely on the accumulation of huge visual databases.

View on arXiv PDF

Similar