Yanan He

AI
h-index3
5papers
2citations
Novelty49%
AI Score53

5 Papers

60.7AIMay 22Code
HyperGuide: Hyperbolic Guidance for Efficient Multi-Step Reasoning in Large Language Models

Yuyu Liu, Haotian Xu, Yanan He et al.

Multi-step reasoning remains a central challenge for large language models: single-pass generation is efficient but lacks accuracy; tree-search methods explore multiple paths but are computation-heavy. We address this gap by distilling reasoning progress into a hyperbolic geometric signal that guides step-by-step generation. Our approach is motivated by a structural observation: in combinatorial reasoning trees, solution-bearing states are few while dead ends are exponentially numerous. The hyperbolic space matches this asymmetry, with compact volume near the origin and exponentially expanding capacity toward the boundary, so that distance-to-origin naturally encodes solution proximity while angular separation distinguishes branches requiring different next operations. We train a lightweight head to project LLM hidden states into this space, then fine-tune a low-rank adapter interactively on its own reasoning attempts to act on the injected signal. Across multiple benchmarks, the geometric signal yields consistent gains, with larger improvements on deeper reasoning chains. Our code is publicly available at https://github.com/yuyuliu11037/HyperGuide.

66.1CVMay 10Code
GSMap: 2D Gaussians for Online HD Mapping

Zhenxuan Zeng, Sheng Yang, Lingxuan Wang et al.

Accurate High-Definition (HD) map construction is critical for autonomous driving, yet existing methods face a fundamental trade-off: vectorization-based approaches preserve topology but struggle with geometric fidelity, while rasterization-based approaches enable precise geometric supervision but produce unstructured outputs. To bridge this gap, we propose GSMap, a novel framework that unifies both paradigms via a learnable 2D Gaussian representation. Each map element is modeled as an ordered sequence of 2D Gaussians, whose centers correspond to the vertices of the vectorized polyline/polygon. This formulation enables simultaneous optimization through: (1) Differentiable rasterization that enforces pixel-level geometric constraints, and (2) Topology-aware vectorization that maintains structural regularity. Experiments on both nuScenes and Argoverse2 demonstrate that our Gaussian-based representation effectively unifies geometric and topological learning, achieving significant performance improvements and demonstrating strong compatibility with existing HD mapping architectures. Code will be available at https://github.com/peakpang/GSMap

78.3CLMay 18
Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

Leyao Wang, Yanan He, Peng Chen et al.

Deep research agents increasingly automate complex information-seeking tasks, producing evidence-grounded reports via multi-step reasoning, tool use, and synthesis. Their growing role demands scalable, reliable evaluation, positioning LLM-as-judge as a supervision paradigm for assessing factual accuracy, evidence use, and reasoning quality. Yet the reliability of these judges for deep research agents remains poorly understood, posing a critical meta-evaluation problem: before deploying LLM judges to supervise research agents, we must first evaluate the judges themselves. Existing meta-evaluations fall short in two ways: (1) reliance on coarse, subjective human-preference agreement; (2) focus on instruction-following or verifiable tasks, leaving open-ended agent executions unexplored. To address these gaps, we introduce REFLECT (REliable Fine-grained LLM judge Evaluation via Controlled inTervention), a meta-evaluation benchmark targeting fine-grained failure detection in agentic environments. REFLECT defines a detailed taxonomy of process- and outcome-level failure modes, instantiated by performing controlled and localized interventions on quality-screened agent execution traces. This yields verifiable, comprehensive, and fine-grained instances for validating the judge models. Our experiments show that current LLM judges remain unreliable: even the best-performing models achieve overall accuracies below 55% across reasoning, tool-use, and report-quality failures, with especially poor performance on evidence verification. Together, our taxonomy and findings expose systematic judge limitations, reveal tradeoffs in cost and reliability, and offer actionable guidance for building more reliable evaluation pipelines for deep research agents.

LGFeb 4
MTS-JEPA: Multi-Resolution Joint-Embedding Predictive Architecture for Time-Series Anomaly Prediction

Yanan He, Yunshi Wen, Xin Wang et al.

Multivariate time series underpin modern critical infrastructure, making the prediction of anomalies a vital necessity for proactive risk mitigation. While Joint-Embedding Predictive Architectures (JEPA) offer a promising framework for modeling the latent evolution of these systems, their application is hindered by representation collapse and an inability to capture precursor signals across varying temporal scales. To address these limitations, we propose MTS-JEPA, a specialized architecture that integrates a multi-resolution predictive objective with a soft codebook bottleneck. This design explicitly decouples transient shocks from long-term trends, and utilizes the codebook to capture discrete regime transitions. Notably, we find this constraint also acts as an intrinsic regularizer to ensure optimization stability. Empirical evaluations on standard benchmarks confirm that our approach effectively prevents degenerate solutions and achieves state-of-the-art performance under the early-warning protocol.

0.1NAApr 8
A time-nonlocal multiphysics finite element method with Crank-Nicolson scheme for poroelasticity model with secondary consolidation

Zhihao Ge, Yanan He

The paper studies a time-nonlocal multiphysics finite element method with Crank-Nicolson scheme for poroelasticity model with secondary consolidation. For the case where the physical parameters $λ,λ^*$ and $c_0$ are all finite positive constants, by introducing two auxiliary variables-the fluid content $η$ and the generalized pressure $ξ$ -- the original strongly coupled poroelasticity model is reformulated into a generalized Stokes equation with time integral terms and a diffusion equation. The reformulated model not only reveals the underlying multiphysics processes in the original model, but also exhibits time-nonlocal characteristics. A time-nonlocal multiphysics finite element method is designed for the reformulated model: the spatial discretization employs high order Taylor-Hood mixed finite element method, and the temporal discretization adopts the Crank-Nicolson scheme. The time integral terms are approximated using the composite trapezoidal rule, and the integral terms $J_ξ^n$ and $J_η^n$ are introduced for real-time updates, which not only avoids repeated calculations and improves efficiency, but also maintains second-order temporal accuracy. The existence and uniqueness of weak solutions for the reformulated model are proved via energy estimate methods, the stability of the fully discrete time-nonlocal multiphysics finite element method is established, and optimal-order error estimates are derived using projection operator techniques. Finally, numerical example verified the theoretical results and compared the long-time convergence of the Crank-Nicolson scheme and the backward Euler scheme.