Shijia Pan

AI
h-index7
5papers
92citations
Novelty35%
AI Score44

5 Papers

27.1SYApr 10
GEM: Gear-based Environment-Integrated Mobility for Adaptive Indoor Human Sensing

Shubham Rohal, Dong Yoon Lee, Phuc Nguyen et al.

Infrastructure-based sensing systems, like Wi-Fi, thermal, vibration-based approaches, provide continuous and unobtrusive indoor human monitoring services. They are often deployed statically for long-term continuous monitoring, which often leads to inefficient sensing/inflexible deployment due to human mobility or high maintenance/data volume for dense deployments. In contrast, autonomous and human carried mobile devices can better adapt to human mobility. However, their physical presence (e.g., drones or robots) may induce observer effects, while their operation often imposes additional burdens, such as wearing (e.g., wearables) and frequent charging. We present GEM, a hybrid scheme that introduces the mobility to infrastructure-based sensing. GEM integrates a matrix of gears into everyday surfaces (e.g., floors, walls) to turn them into "public transportation" for moving infrastructure sensors around. We design and fabricate a 3 x 3 gear matrix prototype that can effectively move sensors from one location to another. We further validate the scalability of the design through simulation of up to 64 x 64 gear matrix with concurrent sensors.

AIFeb 16, 2025
PlanGenLLMs: A Modern Survey of LLM Planning Capabilities

Hui Wei, Zihao Zhang, Shenghua He et al.

LLMs have immense potential for generating plans, transforming an initial world state into a desired goal state. A large body of research has explored the use of LLMs for various planning tasks, from web navigation to travel planning and database querying. However, many of these systems are tailored to specific problems, making it challenging to compare them or determine the best approach for new tasks. There is also a lack of clear and consistent evaluation criteria. Our survey aims to offer a comprehensive overview of current LLM planners to fill this gap. It builds on foundational work by Kartam and Wilkins (1990) and examines six key performance criteria: completeness, executability, optimality, representation, generalization, and efficiency. For each, we provide a thorough analysis of representative works and highlight their strengths and weaknesses. Our paper also identifies crucial future directions, making it a valuable resource for both practitioners and newcomers interested in leveraging LLM planning to support agentic workflows.

LGJun 13, 2025
A Survey of Foundation Models for IoT: Taxonomy and Criteria-Based Analysis

Hui Wei, Dong Yoon Lee, Shubham Rohal et al.

Foundation models have gained growing interest in the IoT domain due to their reduced reliance on labeled data and strong generalizability across tasks, which address key limitations of traditional machine learning approaches. However, most existing foundation model based methods are developed for specific IoT tasks, making it difficult to compare approaches across IoT domains and limiting guidance for applying them to new tasks. This survey aims to bridge this gap by providing a comprehensive overview of current methodologies and organizing them around four shared performance objectives by different domains: efficiency, context-awareness, safety, and security & privacy. For each objective, we review representative works, summarize commonly-used techniques and evaluation metrics. This objective-centric organization enables meaningful cross-domain comparisons and offers practical insights for selecting and designing foundation model based solutions for new IoT tasks. We conclude with key directions for future research to guide both practitioners and researchers in advancing the use of foundation models in IoT applications.

SDAug 28, 2025
RARR : Robust Real-World Activity Recognition with Vibration by Scavenging Near-Surface Audio Online

Dong Yoon Lee, Alyssa Weakley, Hui Wei et al.

One in four people dementia live alone, leading family members to take on caregiving roles from a distance. Many researchers have developed remote monitoring solutions to lessen caregiving needs; however, limitations remain including privacy preserving solutions, activity recognition, and model generalizability to new users and environments. Structural vibration sensor systems are unobtrusive solutions that have been proven to accurately monitor human information, such as identification and activity recognition, in controlled settings by sensing surface vibrations generated by activities. However, when deploying in an end user's home, current solutions require a substantial amount of labeled data for accurate activity recognition. Our scalable solution adapts synthesized data from near-surface acoustic audio to pretrain a model and allows fine tuning with very limited data in order to create a robust framework for daily routine tracking.

CVApr 6, 2020
LaNet: Real-time Lane Identification by Learning Road SurfaceCharacteristics from Accelerometer Data

Madhumitha Harishankar, Jun Han, Sai Vineeth Kalluru Srinivas et al.

The resolution of GPS measurements, especially in urban areas, is insufficient for identifying a vehicle's lane. In this work, we develop a deep LSTM neural network model LaNet that determines the lane vehicles are on by periodically classifying accelerometer samples collected by vehicles as they drive in real time. Our key finding is that even adjacent patches of road surfaces contain characteristics that are sufficiently unique to differentiate between lanes, i.e., roads inherently exhibit differing bumps, cracks, potholes, and surface unevenness. Cars can capture this road surface information as they drive using inexpensive, easy-to-install accelerometers that increasingly come fitted in cars and can be accessed via the CAN-bus. We collect an aggregate of 60 km driving data and synthesize more based on this that capture factors such as variable driving speed, vehicle suspensions, and accelerometer noise. Our formulated LSTM-based deep learning model, LaNet, learns lane-specific sequences of road surface events (bumps, cracks etc.) and yields 100% lane classification accuracy with 200 meters of driving data, achieving over 90% with just 100 m (correspondingly to roughly one minute of driving). We design the LaNet model to be practical for use in real-time lane classification and show with extensive experiments that LaNet yields high classification accuracy even on smooth roads, on large multi-lane roads, and on drives with frequent lane changes. Since different road surfaces have different inherent characteristics or entropy, we excavate our neural network model and discover a mechanism to easily characterize the achievable classification accuracies in a road over various driving distances by training the model just once. We present LaNet as a low-cost, easily deployable and highly accurate way to achieve fine-grained lane identification.