LGAICLCVMMJul 13, 2024

IoT-LM: Large Multisensory Language Models for the Internet of Things

arXiv:2407.09801v127 citationsh-index: 88Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of scalable and efficient inference for IoT applications, such as human wellbeing and smart cities, by providing a tailored model and dataset, though it is incremental as it builds on existing large language models with new adapters.

The authors tackled the challenge of processing diverse IoT sensor data by introducing IoT-LM, a large multisensory language model, which achieved substantial improvements on 8 supervised IoT classification tasks and enabled new interactive capabilities like question-answering and reasoning.

The Internet of Things (IoT) network integrating billions of smart physical devices embedded with sensors, software, and communication technologies is a critical and rapidly expanding component of our modern world. The IoT ecosystem provides a rich source of real-world modalities such as motion, thermal, geolocation, imaging, depth, sensors, and audio to recognize the states of humans and physical objects. Machine learning presents a rich opportunity to automatically process IoT data at scale, enabling efficient inference for understanding human wellbeing, controlling physical devices, and interconnecting smart cities. To realize this potential, we introduce IoT-LM, an open-source large multisensory language model tailored for the IoT ecosystem. IoT-LM is enabled by two technical contributions: the first is MultiIoT, the most expansive unified IoT dataset to date, encompassing over 1.15 million samples from 12 modalities and 8 tasks prepared for multisensory pre-training and instruction-tuning. The second is a new multisensory multitask adapter layer to condition pre-trained large language models on multisensory IoT data. Not only does IoT-LM yield substantial improvements on 8 supervised IoT classification tasks, but it also demonstrates new interactive question-answering, reasoning, and dialog capabilities conditioned on IoT sensors. We release IoT-LM's data sources and new multisensory language modeling framework.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes