LGCVNCJun 1, 2023

MindBigData 2023 MNIST-8B The 8 billion datapoints Multimodal Dataset of Brain Signals

arXiv:2306.00455v11 citationsh-index: 1
Originality Synthesis-oriented
AI Analysis

This dataset addresses the need for large-scale brain signal data for researchers in neuroscience and AI, though it is incremental as an expansion of prior work.

The authors introduced MindBigData 2023 MNIST-8B, the largest open dataset of brain signals for machine learning, containing 8 billion datapoints from EEG recordings of a single subject viewing and listening to MNIST digits.

MindBigData 2023 MNIST-8B is the largest, to date (June 1st 2023), brain signals open dataset created for Machine Learning, based on EEG signals from a single subject captured using a custom 128 channels device, replicating the full 70,000 digits from Yaan LeCun et all MNIST dataset. The brain signals were captured while the subject was watching the pixels of the original digits one by one on a screen and listening at the same time to the spoken number 0 to 9 from the real label. The data, collection procedures, hardware and software created are described in detail, background extra information and other related datasets can be found at our previous paper MindBigData 2022: A Large Dataset of Brain Signals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes