Reza Langari

h-index12

9papers

71citations

Novelty49%

AI Score43

Ranked #78,079 of 201,018 authors (top 39%)#2,513 in RO (top 33%)

9 Papers

CVMar 27, 2023

Real-Time Semantic Segmentation using Hyperspectral Images for Mapping Unstructured and Unknown Environments

Anthony Medellin, Anant Bhamri, Reza Langari et al.

Autonomous navigation in unstructured off-road environments is greatly improved by semantic scene understanding. Conventional image processing algorithms are difficult to implement and lack robustness due to a lack of structure and high variability across off-road environments. The use of neural networks and machine learning can overcome the previous challenges but they require large labeled data sets for training. In our work we propose the use of hyperspectral images for real-time pixel-wise semantic classification and segmentation, without the need of any prior training data. The resulting segmented image is processed to extract, filter, and approximate objects as polygons, using a polygon approximation algorithm. The resulting polygons are then used to generate a semantic map of the environment. Using our framework. we show the capability to add new semantic classes in run-time for classification. The proposed methodology is also shown to operate in real-time and produce outputs at a frequency of 1Hz, using high resolution hyperspectral images.

CVMar 15, 2024Code

SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images

Pardis Taghavi, Reza Langari, Gaurav Pandey

This research paper presents an innovative multi-task learning framework that allows concurrent depth estimation and semantic segmentation using a single camera. The proposed approach is based on a shared encoder-decoder architecture, which integrates various techniques to improve the accuracy of the depth estimation and semantic segmentation task without compromising computational efficiency. Additionally, the paper incorporates an adversarial training component, employing a Wasserstein GAN framework with a critic network, to refine model's predictions. The framework is thoroughly evaluated on two datasets - the outdoor Cityscapes dataset and the indoor NYU Depth V2 dataset - and it outperforms existing state-of-the-art methods in both segmentation and depth estimation tasks. We also conducted ablation studies to analyze the contributions of different components, including pre-training strategies, the inclusion of critics, the use of logarithmic depth scaling, and advanced image augmentations, to provide a better understanding of the proposed framework. The accompanying source code is accessible at \url{https://github.com/PardisTaghavi/SwinMTL}.

CVApr 4

Training a Student Expert via Semi-Supervised Foundation Model Distillation

Pardis Taghavi, Tian Liu, Renjie Li et al.

Foundation models deliver strong perception but are often too computationally heavy to deploy, and adapting them typically requires costly annotations. We introduce a semi-supervised knowledge distillation (SSKD) framework that compresses pre-trained vision foundation models (VFMs) into compact experts using limited labeled and abundant unlabeled data, and instantiate it for instance segmentation where per-pixel labels are particularly expensive. The framework unfolds in three stages: (1) domain adaptation of the VFM(s) via self-training with contrastive calibration, (2) knowledge transfer through a unified multi-objective loss, and (3) student refinement to mitigate residual pseudo-label bias. Central to our approach is an instance-aware pixel-wise contrastive loss that fuses mask and class scores to extract informative negatives and enforce clear inter-instance margins. By maintaining this contrastive signal across both adaptation and distillation, we align teacher and student embeddings and more effectively leverage unlabeled images. On Cityscapes and ADE20K, our $\approx 11\times$ smaller student improves over its zero-shot VFM teacher(s) by +11.9 and +8.6 AP, surpasses adapted teacher(s) by +3.4 and +1.5 AP, and outperforms state-of-the-art SSKD methods on benchmarks.

ROMar 9

NaviDriveVLM: Decoupling High-Level Reasoning and Motion Planning for Autonomous Driving

Ximeng Tao, Pardis Taghavi, Dimitar Filev et al.

Vision-language models (VLMs) have emerged as a promising direction for end-to-end autonomous driving (AD) by jointly modeling visual observations, driving context, and language-based reasoning. However, existing VLM-based systems face a trade-off between high-level reasoning and motion planning: large models offer strong semantic understanding but are costly to adapt for precise control, whereas small VLM models can be fine-tuned efficiently but often exhibit weaker reasoning. We propose NaviDriveVLM, a decoupled framework that separates reasoning from action generation using a large-scale Navigator and a lightweight trainable Driver. This design preserves reasoning ability, reduces training cost, and provides an explicit interpretable intermediate representation for downstream planning. Experiments on the nuScenes benchmark show that NaviDriveVLM outperforms large VLM baselines in end-to-end motion planning.

CVMay 28, 2025

CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation

Pardis Taghavi, Tian Liu, Renjie Li et al.

Instance segmentation demands costly per-pixel annotations and computationally expensive models. We introduce CAST, a semi-supervised knowledge distillation (SSKD) framework that compresses pre-trained vision foundation models (VFM) into compact experts using limited labeled and abundant unlabeled data. CAST unfolds in three stages: (1) domain adaptation of the VFM(s) via self-training with contrastive calibration, (2) knowledge transfer through a unified multi-objective loss, and (3) student refinement to mitigate residual pseudo-label bias. Central to CAST is an \emph{instance-aware pixel-wise contrastive loss} that fuses mask and class scores to extract informative negatives and enforce clear inter-instance margins. By maintaining this contrastive signal across both adaptation and distillation, we align teacher and student embeddings and fully leverage unlabeled images. On Cityscapes and ADE20K, our ~11x smaller student improves over its zero-shot VFM teacher(s) by +8.5 and +7.1 AP, surpasses adapted teacher(s) by +3.4 and +1.5 AP, and further outperforms state-of-the-art SSKD methods on both benchmarks.

RODec 6, 2017

A Computational Approach for Human-like Motion Generation in Upper Limb Exoskeletons Supporting Scapulohumeral Rhythms

Rana Soltani-Zarrin, Amin Zeiaee, Reza Langari et al.

This paper proposes a computational approach for generation of reference path for upper-limb exoskeletons considering the scapulohumeral rhythms of the shoulder. The proposed method can be used in upper-limb exoskeletons with 3 Degrees of Freedom (DoF) in shoulder and 1 DoF in elbow, which are capable of supporting shoulder girdle. The developed computational method is based on Central Nervous System (CNS) governing rules. Existing computational reference generation methods are based on the assumption of fixed shoulder center during motions. This assumption can be considered valid for reaching movements with limited range of motion (RoM). However, most upper limb motions such as Activities of Daily Living (ADL) include large scale inward and outward reaching motions, during which the center of shoulder joint moves significantly. The proposed method generates the reference motion based on a simple model of human arm and a transformation can be used to map the developed motion for other exoskeleton with different kinematics. Comparison of the model outputs with experimental results of healthy subjects performing ADL, show that the proposed model is able to reproduce human-like motions.

RODec 6, 2017

A Systematic Approach For Kinematic Design Of Upper Limb Rehabilitation Exoskeletons

Rana Soltani-Zarrin, Amin Zeiaee, Reza Langari et al.

Kinematic structure of an exoskeleton is the most fundamental block of its design and is determinant of many functional capabilities of it. Although numerous upper limb rehabilitation devices have been designed in the recent years, there is not a framework that can systematically guide the kinematic design procedure. Additionally, diversity of currently available devices and the many minute details incorporated to address certain design requirements hinders pinpointing the core kinematics of the available devices to compare them against each other. This makes the review of literature for identifying drawbacks of the state of the art systems a challenging and puzzling task. In fact, lack of a unifying framework makes designing rehabilitation devices an intuitive process and prone to biases from currently available designs. This research work proposes a systematic approach for kinematic design of upper limb rehabilitation exoskeletons based on conceptual design techniques. Having defined a solution neutral problem statement based on the characteristics of an ideal device, the main functionality of the system is divided into smaller functional units via the Functional Decomposition Method. Various directions for concept generation are explored and finally, it has been shown that a vast majority of the current exoskeleton designs fit within the proposed design framework and the defined functionalities.

RODec 6, 2017

Cleverarm: A Novel Exoskeleton For Rehabilitation Of Upper Limb Impairments

Rana Soltani-Zarrin, Amin Zeiaee, Andrew Eib et al.

CLEVERarm (Compact, Low-weight, Ergonomic, Virtual and Augmented Reality Enhanced Rehabilitation arm) is a novel exoskeleton with eight degrees of freedom supporting the motion of shoulder girdle, glenohumeral joint, elbow and wrist. Of the eight degrees of freedom of the exoskeleton, six are active and the two degrees of freedom supporting the motion of wrist are passive. This paper briefly outlines the design of CLEVERarm and its control architectures.

RONov 27, 2017

Challenges and Opportunities in Exoskeleton-based Rehabilitation

Rana Soltani-Zarrin, Amin Zeiaee, Reza Langari et al.

Robotic systems are increasingly used in rehabilitation to provide high intensity training for patients with motor impairment. The results of controlled trials involving human subjects confirm the effectiveness of robot-enhanced methods and prove them to be marginally superior over standard manual therapy in some cases. Although very promising, this line of research is still in its infancy and further studies are required to fully understand the potential benefits of using robotic devices such as exoskeletons. Exoskeletons have been widely studied due to their capability in providing more control over paretic limb as well as the complexities involved in their design and control. This paper briefly discusses the main challenges in development of rehabilitation exoskeletons and elaborates more on how some of these issues are addressed in the design of CLEVERarm, a recently developed upper limb rehabilitation exoskeleton. The paper is concluded with several remarks on the current challenges in wide-spread use of exoskeletons in medical facilities, and a vision for the future of these technologies in rehabilitation medicine.