RONov 18, 2022Code
Rationale-aware Autonomous Driving Policy utilizing Safety Force Field implemented on CARLA SimulatorHo Suk, Taewoo Kim, Hyungbin Park et al.
Despite the rapid improvement of autonomous driving technology in recent years, automotive manufacturers must resolve liability issues to commercialize autonomous passenger car of SAE J3016 Level 3 or higher. To cope with the product liability law, manufacturers develop autonomous driving systems in compliance with international standards for safety such as ISO 26262 and ISO 21448. Concerning the safety of the intended functionality (SOTIF) requirement in ISO 26262, the driving policy recommends providing an explicit rational basis for maneuver decisions. In this case, mathematical models such as Safety Force Field (SFF) and Responsibility-Sensitive Safety (RSS) which have interpretability on decision, may be suitable. In this work, we implement SFF from scratch to substitute the undisclosed NVIDIA's source code and integrate it with CARLA open-source simulator. Using SFF and CARLA, we present a predictor for claimed sets of vehicles, and based on the predictor, propose an integrated driving policy that consistently operates regardless of safety conditions it encounters while passing through dynamic traffic. The policy does not have a separate plan for each condition, but using safety potential, it aims human-like driving blended in with traffic flow.
CVJun 24, 2025Code
MedErr-CT: A Visual Question Answering Benchmark for Identifying and Correcting Errors in CT ReportsSunggu Kyung, Hyungbin Park, Jinyoung Seo et al.
Computed Tomography (CT) plays a crucial role in clinical diagnosis, but the growing demand for CT examinations has raised concerns about diagnostic errors. While Multimodal Large Language Models (MLLMs) demonstrate promising comprehension of medical knowledge, their tendency to produce inaccurate information highlights the need for rigorous validation. However, existing medical visual question answering (VQA) benchmarks primarily focus on simple visual recognition tasks, lacking clinical relevance and failing to assess expert-level knowledge. We introduce MedErr-CT, a novel benchmark for evaluating medical MLLMs' ability to identify and correct errors in CT reports through a VQA framework. The benchmark includes six error categories - four vision-centric errors (Omission, Insertion, Direction, Size) and two lexical error types (Unit, Typo) - and is organized into three task levels: classification, detection, and correction. Using this benchmark, we quantitatively assess the performance of state-of-the-art 3D medical MLLMs, revealing substantial variation in their capabilities across different error types. Our benchmark contributes to the development of more reliable and clinically applicable MLLMs, ultimately helping reduce diagnostic errors and improve accuracy in clinical practice. The code and datasets are available at https://github.com/babbu3682/MedErr-CT.
PRApr 9
Two-grid Penalty Approximation Scheme for Doubly Reflected BSDEsWonjae Lee, Hyungbin Park
We study penalization coupled with time discretization for decoupled Markovian doubly reflected BSDEs with obstacles \(p_b(t,X_t)\le Y_t\le p_w(t,X_t)\). The DRBSDE is approximated by a penalized BSDE with parameter \(λ\) and discretized by an implicit Euler scheme with step \(Ît\). A key difficulty is that the forward approximation used to evaluate the obstacles generates an error term that is amplified by \(λ\). In the single-obstacle case this amplification can be removed by the shift \(Y-p_b(t,X)\), but no analogous transformation eliminates both obstacles simultaneously; this motivates simulating the forward SDE on a finer grid \(\tilde{Ît}\) and projecting onto the backward grid (two-grid scheme). Under structural assumptions motivated by financial barriers we sharpen penalization rates and obtain a uniform \(O(λ^{-1})\) bound for the value process. We derive an explicit error bound in \((Ît,\tilde{Ît},λ)\) and tuning rules; for \(Z\)-independent drivers, \(λ\asymp Ît^{-1/2}\) with \(\tilde{Ît}=O(Ît/λ^2)\) yields the target \(O(Ît^{1/2})\) rate. Nonsmooth barriers/payoffs are handled via a multivariate Itô--Tanaka and local-time-on-surfaces argument. We also provide numerical experiments for a one-dimensional game put under the Black--Scholes model. The observed grid-refinement errors are consistent with the predicted \(O(n^{-1/2})\) behavior, while the penalty sweep indicates that the tested regime remains pre-asymptotic with respect to the penalty parameter.
IVJun 29, 2025
MedRegion-CT: Region-Focused Multimodal LLM for Comprehensive 3D CT Report GenerationSunggu Kyung, Jinyoung Seo, Hyunseok Lim et al.
The recent release of RadGenome-Chest CT has significantly advanced CT-based report generation. However, existing methods primarily focus on global features, making it challenging to capture region-specific details, which may cause certain abnormalities to go unnoticed. To address this, we propose MedRegion-CT, a region-focused Multi-Modal Large Language Model (MLLM) framework, featuring three key innovations. First, we introduce Region Representative ($R^2$) Token Pooling, which utilizes a 2D-wise pretrained vision model to efficiently extract 3D CT features. This approach generates global tokens representing overall slice features and region tokens highlighting target areas, enabling the MLLM to process comprehensive information effectively. Second, a universal segmentation model generates pseudo-masks, which are then processed by a mask encoder to extract region-centric features. This allows the MLLM to focus on clinically relevant regions, using six predefined region masks. Third, we leverage segmentation results to extract patient-specific attributions, including organ size, diameter, and locations. These are converted into text prompts, enriching the MLLM's understanding of patient-specific contexts. To ensure rigorous evaluation, we conducted benchmark experiments on report generation using the RadGenome-Chest CT. MedRegion-CT achieved state-of-the-art performance, outperforming existing methods in natural language generation quality and clinical relevance while maintaining interpretability. The code for our framework is publicly available.
LGJul 23, 2025
Fourier Neural Operators for Non-Markovian Processes:Approximation Theorems and ExperimentsWonjae Lee, Taeyoung Kim, Hyungbin Park
This paper introduces an operator-based neural network, the mirror-padded Fourier neural operator (MFNO), designed to learn the dynamics of stochastic systems. MFNO extends the standard Fourier neural operator (FNO) by incorporating mirror padding, enabling it to handle non-periodic inputs. We rigorously prove that MFNOs can approximate solutions of path-dependent stochastic differential equations and Lipschitz transformations of fractional Brownian motions to an arbitrary degree of accuracy. Our theoretical analysis builds on Wong--Zakai type theorems and various approximation techniques. Empirically, the MFNO exhibits strong resolution generalization--a property rarely seen in standard architectures such as LSTMs, TCNs, and DeepONet. Furthermore, our model achieves performance that is comparable or superior to these baselines while offering significantly faster sample path generation than classical numerical schemes.
SYJan 12, 2020
Self-Driving like a Human driver instead of a Robocar: Personalized comfortable driving experience for autonomous vehiclesIl Bae, Jaeyoung Moon, Junekyo Jhung et al.
This paper issues an integrated control system of self-driving autonomous vehicles based on the personal driving preference to provide personalized comfortable driving experience to autonomous vehicle users. We propose an Occupant's Preference Metric (OPM) which is defining a preferred lateral and longitudinal acceleration region with maximum allowable jerk for users. Moreover, we propose a vehicle controller based on control parameters enabling integrated lateral and longitudinal control via preference-aware maneuvering of autonomous vehicles. The proposed system not only provides the criteria for the occupant's driving preference, but also provides a personalized autonomous self-driving style like a human driver instead of a Robocar. The simulation and experimental results demonstrated that the proposed system can maneuver the self-driving vehicle like a human driver by tracking the specified criterion of admissible acceleration and jerk.