ROOct 11, 2023
CoPAL: Corrective Planning of Robot Actions with Large Language ModelsFrank Joublin, Antonello Ceravola, Pavel Smirnov et al.
In the pursuit of fully autonomous robotic systems capable of taking over tasks traditionally performed by humans, the complexity of open-world environments poses a considerable challenge. Addressing this imperative, this study contributes to the field of Large Language Models (LLMs) applied to task and motion planning for robots. We propose a system architecture that orchestrates a seamless interplay between multiple cognitive levels, encompassing reasoning, planning, and motion generation. At its core lies a novel replanning strategy that handles physically grounded, logical, and semantic errors in the generated plans. We demonstrate the efficacy of the proposed feedback architecture, particularly its impact on executability, correctness, and time complexity via empirical evaluation in the context of a simulation and two intricate real-world scenarios: blocks world, barman and pizza preparation.
CVJan 2
RePose: A Real-Time 3D Human Pose Estimation and Biomechanical Analysis Framework for RehabilitationJunxiao Xue, Pavel Smirnov, Ziao Li et al.
We propose a real-time 3D human pose estimation and motion analysis method termed RePose for rehabilitation training. It is capable of real-time monitoring and evaluation of patients'motion during rehabilitation, providing immediate feedback and guidance to assist patients in executing rehabilitation exercises correctly. Firstly, we introduce a unified pipeline for end-to-end real-time human pose estimation and motion analysis using RGB video input from multiple cameras which can be applied to the field of rehabilitation training. The pipeline can help to monitor and correct patients'actions, thus aiding them in regaining muscle strength and motor functions. Secondly, we propose a fast tracking method for medical rehabilitation scenarios with multiple-person interference, which requires less than 1ms for tracking for a single frame. Additionally, we modify SmoothNet for real-time posture estimation, effectively reducing pose estimation errors and restoring the patient's true motion state, making it visually smoother. Finally, we use Unity platform for real-time monitoring and evaluation of patients' motion during rehabilitation, and to display the muscle stress conditions to assist patients with their rehabilitation training.
ROApr 11, 2024
Generating consistent PDDL domains with Large Language ModelsPavel Smirnov, Frank Joublin, Antonello Ceravola et al.
Large Language Models (LLMs) are capable of transforming natural language domain descriptions into plausibly looking PDDL markup. However, ensuring that actions are consistent within domains still remains a challenging task. In this paper we present a novel concept to significantly improve the quality of LLM-generated PDDL models by performing automated consistency checking during the generation process. Although the proposed consistency checking strategies still can't guarantee absolute correctness of generated models, they can serve as valuable source of feedback reducing the amount of correction efforts expected from a human in the loop. We demonstrate the capabilities of our error detection approach on a number of classical and custom planning domains (logistics, gripper, tyreworld, household, pizza).
AIMay 9, 2025
A Grounded Memory System For Smart Personal AssistantsFelix Ocker, Jörg Deigmöller, Pavel Smirnov et al.
A wide variety of agentic AI applications - ranging from cognitive assistants for dementia patients to robotics - demand a robust memory system grounded in reality. In this paper, we propose such a memory system consisting of three components. First, we combine Vision Language Models for image captioning and entity disambiguation with Large Language Models for consistent information extraction during perception. Second, the extracted information is represented in a memory consisting of a knowledge graph enhanced by vector embeddings to efficiently manage relational information. Third, we combine semantic search and graph query generation for question answering via Retrieval Augmented Generation. We illustrate the system's working and potential using a real-world example.
SEApr 16, 2021
Smart Home Crawler: Towards a framework for semi-automatic IoT sensor integrationMartin Strohbach, Luis Adan Saavedra, Pavel Smirnov et al.
Sensor deployments in Smart Homes have long reached commercial relevance for applications such as home automation, home safety or energy consumption awareness and reduction. Nevertheless, due to the heterogeneity of sensor devices and gateways, data integration is still a costly and timeconsuming process. In this paper we propose the Smart Home Crawler Framework that (1) provides a common semantic abstraction from the underlying sensor and gateway technologies, and (2) accelerates the integration of new devices by applying machine learning techniques for linking discovered devices to a semantic data model. We present a first prototype that was demonstrated at ICT 2018. The prototype was built as a domainspecific crawling component for IoTCrawler, a secure and privacy-preserving search engine for the Internet of Things.