When ChatGPT for Computer Vision Will Come? From 2D to 3D
This is an incremental perspective piece on the gap in AI for 3D vision, relevant to researchers and practitioners in computer vision and AI.
The paper discusses the absence of a ChatGPT-like model for computer vision, particularly in 3D, and provides an outlook on the development of AI-generated content in 3D from a data perspective.
ChatGPT and its improved variant GPT4 have revolutionized the NLP field with a single model solving almost all text related tasks. However, such a model for computer vision does not exist, especially for 3D vision. This article first provides a brief view on the progress of deep learning in text, image and 3D fields from the model perspective. Moreover, this work further discusses how AIGC evolves from the data perspective. On top of that, this work presents an outlook on the development of AIGC in 3D from the data perspective.