Can ChatGPT-like Generative Models Guarantee Factual Accuracy? On the Mistakes of New Generation Search Engines
This addresses the reliability of AI-driven search engines for users, but it is incremental as it critiques existing issues without proposing new solutions.
The paper questions the factual accuracy of ChatGPT-like models in new generation search engines, highlighting numerous mistakes in public demonstrations and calling for improvements in transparency and correctness.
Although large conversational AI models such as OpenAI's ChatGPT have demonstrated great potential, we question whether such models can guarantee factual accuracy. Recently, technology companies such as Microsoft and Google have announced new services which aim to combine search engines with conversational AI. However, we have found numerous mistakes in the public demonstrations that suggest we should not easily trust the factual claims of the AI models. Rather than criticizing specific models or companies, we hope to call on researchers and developers to improve AI models' transparency and factual correctness.