30.8SEJun 2
Multi-Modal Assessment of Road Roughness Using Smartphone Applications, Acceleration, and Passenger RatingsNovel Certada, Amirhesam Aghanouri, Joseba Gorospe et al.
This paper investigates a multi-modal and human-centric framework for low-cost road roughness assessment. The evaluation was based on three complementary data sources: smartphone-based International Roughness Index (IRI) estimates from two independent smartphone-based applications; in-vehicle GNSS-IMU Receiver (Global Navigation Satellite System Receiver with Inertial Measurement Unit) measurements, and passenger Present Serviceability Ratings (PSR). Data were collected over 1700 km across Austria, Hungary, and Romania under real traffic conditions. Inter-application agreement was evaluated using correlation analysis, Intraclass Correlation Coefficient (ICC), and Bland-Altman methods. While the two smartphone applications show strong correlation, systematic bias limits their interchangeability. A significant inverse relationship between IRI and PSR confirms perceptual sensitivity to roughness, and positive correlations between IRI and vertical acceleration validate the physical linkage between pavement irregularities and vehicle dynamics. The results demonstrate the challenges of integrating consumer-grade sensing and perception-based evaluation for road roughness monitoring as an alternative to high-cost specialized survey equipment.
CVJul 31, 2023
On Transferability of Driver Observation Models from Simulated to Real Environments in Autonomous CarsWalter Morales-Alvarez, Novel Certad, Alina Roitberg et al.
For driver observation frameworks, clean datasets collected in controlled simulated environments often serve as the initial training ground. Yet, when deployed under real driving conditions, such simulator-trained models quickly face the problem of distributional shifts brought about by changing illumination, car model, variations in subject appearances, sensor discrepancies, and other environmental alterations. This paper investigates the viability of transferring video-based driver observation models from simulation to real-world scenarios in autonomous vehicles, given the frequent use of simulation data in this domain due to safety issues. To achieve this, we record a dataset featuring actual autonomous driving conditions and involving seven participants engaged in highly distracting secondary activities. To enable direct SIM to REAL transfer, our dataset was designed in accordance with an existing large-scale simulator dataset used as the training source. We utilize the Inflated 3D ConvNet (I3D) model, a popular choice for driver observation, with Gradient-weighted Class Activation Mapping (Grad-CAM) for detailed analysis of model decision-making. Though the simulator-based model clearly surpasses the random baseline, its recognition quality diminishes, with average accuracy dropping from 85.7% to 46.6%. We also observe strong variations across different behavior classes. This underscores the challenges of model transferability, facilitating our research of more robust driver observation systems capable of dealing with real driving conditions.
30.5SYMay 7
A LiDAR-Driven Fallback Longitudinal Controller for Safer Following in Sudden Braking ScenariosMohamed Sabry, Enrico Del Re, Walter Morales-Alvarez et al.
Adaptive Cruise Control has seen significant advancements, with Collaborative Adaptive Cruise Control leveraging Vehicle-to-Vehicle communication to enhance coordination and stability. However, the reliance on stable communication channels limits its reliability. Research on reducing information dependencies in Adaptive Cruise Control systems has remained limited, despite its critical role in mitigating collision risks during sudden braking scenarios. This study proposes a novel fallback longitudinal controller that relies solely on LiDAR-based distance measurements and the velocity of a follower vehicle. The controller is designed to be time-independent, ensuring operation in the presence of sensor delays or synchronization issues. Simulation results demonstrate that the proposed controller enables vehicle-following from standstill and prevents collisions during emergency braking, even under minimal onboard information.
9.1NEMay 18
Reinterpreting Safety Thresholds as Neuron Spiking ThresholdsEnrico Del Re, Mohamed Sabry, Cristina Olaverri-Monreal
Surrogate Safety Measures (SSMs) are extensively utilised in the evaluation of traffic risk in automated driving contexts. However, the majority of SSM-based evaluations employ fixed thresholds that fail to capture the human response to sustained borderline conditions or the reaction to brief, high-risk peaks. The present work proposes a biologically inspired reinterpretation of SSM thresholds. This is modelled as spiking thresholds of leaky integrate-and-fire (LIF) neurons, with multiple SSM inputs combined into a spiking neural network (SNN). The SNN is trained to emit spikes that are aligned with human braking onsets. The training data was recorded in a controlled car-following experiment using the 3D-CoAutoSim platform with CARLA/Unreal and a 6-DOF motion platform, where induced critical events were generated. The results demonstrate that the learned spiking activity qualitatively aligns with braking behaviour across scenarios and captures reactions that are not consistently explained by threshold crossings alone. Analysis across participants further indicates that learned input thresholds remain relatively consistent, while learned decay factors encode different temporal sensitivities for the SSMs. The findings of this study indicate that spiking dynamics may serve as a mechanism to facilitate the convergence of objective SSMs with subjective human safety perception.
LGApr 14, 2023
Uncertainty-Aware Vehicle Energy Efficiency Prediction using an Ensemble of Neural NetworksJihed Khiari, Cristina Olaverri-Monreal
The transportation sector accounts for about 25% of global greenhouse gas emissions. Therefore, an improvement of energy efficiency in the traffic sector is crucial to reducing the carbon footprint. Efficiency is typically measured in terms of energy use per traveled distance, e.g. liters of fuel per kilometer. Leading factors that impact the energy efficiency are the type of vehicle, environment, driver behavior, and weather conditions. These varying factors introduce uncertainty in estimating the vehicles' energy efficiency. We propose in this paper an ensemble learning approach based on deep neural networks (ENN) that is designed to reduce the predictive uncertainty and to output measures of such uncertainty. We evaluated it using the publicly available Vehicle Energy Dataset (VED) and compared it with several baselines per vehicle and energy type. The results showed a high predictive performance and they allowed to output a measure of predictive uncertainty.
LGApr 27, 2022
Uncertainty-Aware Prediction of Battery Energy Consumption for Hybrid Electric VehiclesJihed Khiari, Cristina Olaverri-Monreal
The usability of vehicles is highly dependent on their energy consumption. In particular, one of the main factors hindering the mass adoption of electric (EV), hybrid (HEV), and plug-in hybrid (PHEV) vehicles is range anxiety, which occurs when a driver is uncertain about the availability of energy for a given trip. To tackle this problem, we propose a machine learning approach for modeling the battery energy consumption. By reducing predictive uncertainty, this method can help increase trust in the vehicle's performance and thus boost its usability. Most related work focuses on physical and/or chemical models of the battery that affect the energy consumption. We propose a data-driven approach which relies on real-world datasets including battery related attributes. Our approach showed an improvement in terms of predictive uncertainty as well as in accuracy compared to traditional methods.
6.9ROMay 7
A Cost-Effective and Climate-Resilient Air Pressure System for Rain Effect Reduction on Automated Vehicle CamerasMohamed Sabry, Joseba Gorospe, Cristina Olaverri-Monreal
Recent advances in automated vehicles have focused on improving perception performance under adverse weather conditions; however, research on physical hardware solutions remains limited, despite their importance for perception critical applications such as vehicle platooning. Existing approaches, such as hydrophilic or hydrophobic lenses and sprays, provide only partial mitigation, while industrial protection systems imply high cost and they do not enable scalability for automotive deployment. To address these limitations, this paper presents a cost-effective hardware solution for rainy conditions, designed to be compatible with multiple cameras simultaneously. Beyond its technical contribution, the proposed solution supports sustainability goals in transportation systems. By enabling compatibility with existing camera-based sensing platforms, the system extends the operational reliability of automated vehicles without requiring additional high-cost sensors or hardware replacements. This approach reduces resource consumption, supports modular upgrades, and promotes more cost-efficient deployment of automated vehicle technologies, particularly in challenging weather conditions where system failures would otherwise lead to inefficiencies and increased emissions. The proposed system was able to increase pedestrian detection accuracy of a Deep Learning model from 8.3% to 41.6%.
1.1CVMay 18
Optimising CSRNet with parameter-free attention mechanisms for crowd counting in public transportAida Rostamza, Enrico Del Re, Joshua Cherian Varughese et al.
Occupancy estimation and crowd counting are critical tasks in designing smart and efficient public transport vehicles. Given that public transport loading can vary from sparse to crowded, classical models for occupancy estimation must be adapted to suit this purpose. Attention mechanisms have shown remarkable capability in enhancing the representational power of deep neural networks for crowd counting in congested scenes with occlusion, complex backgrounds, and perspective distortion. However, conventional approaches, often implemented as parameterized sub-networks within convolutional layers, inevitably increase model size and computational cost, limiting deployment on resource-constrained edge devices. This paper investigates the effectiveness of state-of-the-art parameter-free attention mechanisms for crowd counting and density map estimation in highly congested scenes. We evaluate channel-wise (PFCA), spatial-wise (SA), and 3-D (SimAM) modules and compare their performance with parameterized attention modules constrained to introduce no more than 1% additional parameters. Furthermore, we present a novel combination of attention mechanisms that combines the strengths of PFCA and SA (PFCASA) customized for analyzing video streams onboard public transport systems. Using CSRNet as the backbone, experiments on the ShanghaiTech dataset demonstrate that parameter-free attention mechanisms achieve comparable or superior accuracy without introducing additional model parameters. A detailed performance analysis further reveals that PFCASA outperforms other attention modules in scenes with fewer than 40 individuals, while PFCA shows greater effectiveness as crowd density increases, underscoring their potential applicability for integration into smart public transport modalities.
10.6ROMay 18
Assessing Localization Technologies for Pedestrian Collision AvoidanceJoshua Varughese, Joseba Gorospe, Novel Certad et al.
Robust pedestrian safety is crucial to the next-generation of intelligent transportation systems. Such systems rely on active pedestrian localization and predictive collision alerts. Pedestrian localization can be supported by Ultra-Wideband technology and Bluetooth 6.0, which offer high-precision ranging and low-latency communication, making them promising candidates for vehicular collision warning systems. This paper assesses the localization accuracy of these technologies for pedestrian alerting and benchmarks their performance against Global Navigation Satellite Systems. Experimental evaluations performed in this paper focused on key performance metrics, including localization accuracy and robustness to environmental conditions. Preliminary results suggest that Ultra-Wideband and Bluetooth 6.0 can serve as viable alternatives or complements to Global Navigation Satellite Systems in certain scenarios, improving situational awareness and enabling timely pedestrian alerts.
13.8ROMay 18
On Improving Multimodal Pedestrian Trajectory Prediction with CVAE: A Study on Benchmark and Robot DataYuzhou Liu, Cristina Olaverri-Monreal
Accurate pedestrian trajectory prediction is crucial for autonomous systems operating in complex environments, such as modular buses and delivery robots in suburban or semi-structured areas. Social Spatio-Temporal Graph Convolutional Neural Networks (Social-STGCNN) have shown strong performance by modeling social interactions; however, producing diverse and well-calibrated future trajectories remains challenging. In this work, we build on a Social-STGCNN backbone and introduce a Conditional Variational Autoencoder (CVAE)-based probabilistic formulation to explicitly model multimodal future trajectories. We evaluate the method on the ETH and UCY pedestrian trajectory datasets as well as on a real-world pedestrian dataset collected by a mobile robot. Results show moderate gains on public benchmarks, but more consistent endpoint accuracy and improved trajectory diversity across different crowd configurations. Evaluation on robot-collected data further demonstrates the approach's effectiveness beyond curated benchmarks and supports its applicability in practical deployments.
5.0HCMay 18
In-Vehicle Human-Machine Interface to Support Drivers in Conditionally Automated PlatooningAnna-Lena Hager, Mohamed Sabry, Walter Morales-Alvarez et al.
Vehicle platooning enables close-gap driving and offers potential benefits for traffic efficiency and safety. In conditionally automated platooning, drivers remain responsible for supervising the system and intervening when necessary, making effective Human-Machine Interfaces (HMIs) critical for maintaining situational awareness and stable driver-automation coordination. This paper investigates whether an in-vehicle HMI providing continuous system-state and inter-vehicle distance information improves supervisory behavior, safety, and platoon stability. We conducted a simulation-based experiment integrated with a 6-degree-of-freedom motion system to enhance scenario realism. Dependent variables included collision occurrence, response latency following platoon disconnection, and the number of manual interventions during intact platooning. Results showed significantly fewer manual interventions when the HMI was active, with intervention rates about 80% higher without it. No significant effects were found for collision occurrence or response latency, indicating that additional information improves supervisory stability during platooning but does not substantially affect emergency reactions or collision rates.
CVOct 17, 2024Code
Inadequate contrast ratio of road markings as an indicator for ADAS failureNovel Certad, Cristina Olaverri-Monreal, Friedrich Wiesinger et al.
Road markings were reported as critical road safety features, equally needed for both human drivers and for machine vision technologies utilised by advanced driver assistance systems (ADAS) and in driving automation. Visibility of road markings is achieved because of their colour contrasting with the roadway surface. During recent testing of an open-source camera-based ADAS under several visibility conditions (day, night, rain, glare), significant failures in trajectory planning were recorded and quantified. Consistently, better ADAS reliability under poor visibility conditions was achieved with Type II road markings (i.e. structured markings, facilitating moisture drainage) as compared to Type I road marking (i.e. flat lines). To further understand these failures, analysis of contrast ratio of road markings, which the tested ADAS was detecting for traffic lane recognition, was performed. The highest contrast ratio (greater than 0.5, calculated per Michelson equation) was measured at night in the absence of confounding factors, with statistically significant difference of 0.1 in favour of Type II road markings over Type I. Under daylight conditions, contrast ratio was reduced, with slightly higher values measured with Type I. The presence of rain or wet roads caused the deterioration of the contrast ratio, with Type II road markings exhibiting significantly higher contrast ratio than Type I, even though the values were low (less than 0.1). These findings matched the output of the ADAS related to traffic lane detection and underlined the importance of road marking visibility. Inadequate lane recognition by ADAS was associated with very low contrast ratio of road markings indeed. Importantly, specific minimum contrast ratio value could not be found, which was due to the complexity of ADAS algorithms...
CVSep 23, 2025
LiDAR Point Cloud Image-based Generation Using Denoising Diffusion Probabilistic ModelsAmirhesam Aghanouri, Cristina Olaverri-Monreal
Autonomous vehicles (AVs) are expected to revolutionize transportation by improving efficiency and safety. Their success relies on 3D vision systems that effectively sense the environment and detect traffic agents. Among sensors AVs use to create a comprehensive view of surroundings, LiDAR provides high-resolution depth data enabling accurate object detection, safe navigation, and collision avoidance. However, collecting real-world LiDAR data is time-consuming and often affected by noise and sparsity due to adverse weather or sensor limitations. This work applies a denoising diffusion probabilistic model (DDPM), enhanced with novel noise scheduling and time-step embedding techniques to generate high-quality synthetic data for augmentation, thereby improving performance across a range of computer vision tasks, particularly in AV perception. These modifications impact the denoising process and the model's temporal awareness, allowing it to produce more realistic point clouds based on the projection. The proposed method was extensively evaluated under various configurations using the IAMCV and KITTI-360 datasets, with four performance metrics compared against state-of-the-art (SOTA) methods. The results demonstrate the model's superior performance over most existing baselines and its effectiveness in mitigating the effects of noisy and sparse LiDAR data, producing diverse point clouds with rich spatial relationships and structural detail.
CVSep 22, 2025
Tensor-Based Self-Calibration of Cameras via the TrifocalCalib MethodGregory Schroeder, Mohamed Sabry, Cristina Olaverri-Monreal
Estimating camera intrinsic parameters without prior scene knowledge is a fundamental challenge in computer vision. This capability is particularly important for applications such as autonomous driving and vehicle platooning, where precalibrated setups are impractical and real-time adaptability is necessary. To advance the state-of-the-art, we present a set of equations based on the calibrated trifocal tensor, enabling projective camera self-calibration from minimal image data. Our method, termed TrifocalCalib, significantly improves accuracy and robustness compared to both recent learning-based and classical approaches. Unlike many existing techniques, our approach requires no calibration target, imposes no constraints on camera motion, and simultaneously estimates both focal length and principal point. Evaluations in both procedurally generated synthetic environments and structured dataset-based scenarios demonstrate the effectiveness of our approach. To support reproducibility, we make the code publicly available.
CVMay 1, 2025
ClearLines - Camera Calibration from Straight LinesGregory Schroeder, Mohamed Sabry, Cristina Olaverri-Monreal
The problem of calibration from straight lines is fundamental in geometric computer vision, with well-established theoretical foundations. However, its practical applicability remains limited, particularly in real-world outdoor scenarios. These environments pose significant challenges due to diverse and cluttered scenes, interrupted reprojections of straight 3D lines, and varying lighting conditions, making the task notoriously difficult. Furthermore, the field lacks a dedicated dataset encouraging the development of respective detection algorithms. In this study, we present a small dataset named "ClearLines", and by detailing its creation process, provide practical insights that can serve as a guide for developing and refining straight 3D line detection algorithms.
CVApr 11, 2025
Shadow Erosion and Nighttime Adaptability for Camera-Based Automated Driving ApplicationsMohamed Sabry, Gregory Schroeder, Joshua Varughese et al.
Enhancement of images from RGB cameras is of particular interest due to its wide range of ever-increasing applications such as medical imaging, satellite imaging, automated driving, etc. In autonomous driving, various techniques are used to enhance image quality under challenging lighting conditions. These include artificial augmentation to improve visibility in poor nighttime conditions, illumination-invariant imaging to reduce the impact of lighting variations, and shadow mitigation to ensure consistent image clarity in bright daylight. This paper proposes a pipeline for Shadow Erosion and Nighttime Adaptability in images for automated driving applications while preserving color and texture details. The Shadow Erosion and Nighttime Adaptability pipeline is compared to the widely used CLAHE technique and evaluated based on illumination uniformity and visual perception quality metrics. The results also demonstrate a significant improvement over CLAHE, enhancing a YOLO-based drivable area segmentation algorithm.
HCJun 14, 2021
Real-World Evaluation of the Impact of Automated Driving System Technology on Driver Gaze Behavior, Reaction Time and TrustWalter Morales-Alvarez, Mohamed Marouf, Hadj. Hamma Tadjine et al.
Recent developments in advanced driving assistance systems (ADAS) that rely on some level of autonomy have led the automobile industry and research community to investigate the impact they might have on driving performance. However, most of the research performed so far is based on simulated environments. In this study, we investigated the behavior of drivers in a vehicle with automated driving system (ADS) capabilities in a real-life driving scenario. We analyzed their response to a take over request (TOR) at two different driving speeds while being engaged in non-driving-related tasks (NDRT). Results from the performed experiments showed that driver reaction time to a TOR, gaze behavior and self-reported trust in automation were affected by the type of NDRT being concurrently performed and driver reaction time and gaze behavior additionally depended on the driving or vehicle speed at the time of TOR.
LGSep 24, 2020
Boosting Algorithms for Delivery Time Prediction in Transportation LogisticsJihed Khiari, Cristina Olaverri-Monreal
Travel time is a crucial measure in transportation. Accurate travel time prediction is also fundamental for operation and advanced information systems. A variety of solutions exist for short-term travel time predictions such as solutions that utilize real-time GPS data and optimization methods to track the path of a vehicle. However, reliable long-term predictions remain challenging. We show in this paper the applicability and usefulness of travel time i.e. delivery time prediction for postal services. We investigate several methods such as linear regression models and tree based ensembles such as random forest, bagging, and boosting, that allow to predict delivery time by conducting extensive experiments and considering many usability scenarios. Results reveal that travel time prediction can help mitigate high delays in postal services. We show that some boosting algorithms, such as light gradient boosting and catboost, have a higher performance in terms of accuracy and runtime efficiency than other baselines such as linear regression models, bagging regressor and random forest.
HCJun 16, 2020
Response of Vulnerable Road Users to Visual Information from Autonomous Vehicles in Shared SpacesWalter Morales Alvarez, Miguel Ángel de Miguel, Fernando García et al.
Completely unmanned autonomous vehicles have been anticipated for a while. Initially, these are expected to drive only under certain conditions on some roads, and advanced functionality is required to cope with the ever-increasing challenges of safety. To enhance the public's perception of road safety and trust in new vehicular technologies, we investigate in this paper the effect of several interaction paradigms with vulnerable road users by developing and applying algorithms for the automatic analysis of pedestrian body language. We assess behavioral patterns and determine the impact of the coexistence of AVs and other road users on general road safety in a shared space for VRUs and vehicles. Results showed that the implementation of visual communication cues for interacting with VRUs is not necessarily required for a shared space in which informal traffic rules apply.
ROJun 16, 2020
Mobile Delivery Robots: Mixed Reality-Based Simulation Relying on ROS and Unity 3DYuzhou Liu, Georg Novotny, Nikita Smirnov et al.
In the context of Intelligent Transportation Systems and the delivery of goods, new technology approaches need to be developed in order to cope with certain challenges that last mile delivery entails, such as navigation in an urban environment. Autonomous delivery robots can help overcome these challenges. We propose a method for performing mixed reality (MR) simulation with ROS-based robots using Unity, which synchronizes the real and virtual environment, and simultaneously uses the sensor information of the real robots to locate themselves and project them into the virtual environment, so that they can use their virtual doppelganger to perceive the virtual world. Using this method, real and virtual robots can perceive each other and the environment in which the other party is located, thereby enabling the exchange of information between virtual and real objects. Through this approach a more realistic and reliable simulation can be obtained. Results of the demonstrated use-cases verified the feasibility and efficiency as well as the stability of implementing MR using Unity for ROS-based robots.
HCJun 4, 2020
Vehicle Automation Field Test: Impact on Driver Behavior and TrustWalter Morales Alvarez, Nikita Smirnov, Elmar Matthes et al.
With the growing technological advances in autonomous driving, the transport industry and research community seek to determine the impact that autonomous vehicles (AV) will have on consumers, as well as identify the different factors that will influence their use. Most of the research performed so far relies on laboratory-controlled conditions using driving simulators, as they offer a safe environment for testing advanced driving assistance systems (ADAS). In this study we analyze the behavior of drivers that are placed in control of an automated vehicle in a real life driving environment. The vehicle is equipped with advanced autonomy, making driver control of the vehicle unnecessary in many scenarios, although a driver take over is possible and sometimes required. In doing so, we aim to determine the impact of such a system on the driver and their driving performance. To this end road users' behavior from naturalistic driving data is analyzed focusing on awareness and diagnosis of the road situation. Results showed that the road features determined the level of visual attention and trust in the automation. They also showed that the activities performed during the automation affected the reaction time to take over the control of the vehicle.
ROJun 4, 2020
Autonomous Driving: Framework for Pedestrian Intention Estimationin a Real World ScenarioWalter Morales Alvarez, Francisco Miguel Moreno, Oscar Sipele et al.
Rapid advancements in driver-assistance technology will lead to the integration of fully autonomous vehicles on our roads that will interact with other road users. To address the problem that driverless vehicles make interaction through eye contact impossible, we describe a framework for estimating the crossing intentions of pedestrians in order to reduce the uncertainty that the lack of eye contact between road users creates. The framework was deployed in a real vehicle and tested with three experimental cases that showed a variety of communication messages to pedestrians in a shared space scenario. Results from the performed field tests showed the feasibility of the presented approach.