The perpetual motion machine of AI-generated data and the distraction of ChatGPT-as-scientist
It addresses the hype around AI's role in science, but is incremental as it only offers a discussion without new findings.
The paper discusses the potential of AI, particularly LLMs like ChatGPT, to solve scientific problems and generate data for training AI in data-scarce domains, but it presents no concrete results or numbers.
Since ChatGPT works so well, are we on the cusp of solving science with AI? Is not AlphaFold2 suggestive that the potential of LLMs in biology and the sciences more broadly is limitless? Can we use AI itself to bridge the lack of data in the sciences in order to then train an AI? Herein we present a discussion of these topics.