Boyan Li, Chong Chen, Zhujun Xue et al.
This addresses the need for more reliable and verifiable Text-to-SQL systems for database users, representing a paradigm shift rather than an incremental improvement.
Database systems, query processing, data management
Boyan Li, Chong Chen, Zhujun Xue et al.
This addresses the need for more reliable and verifiable Text-to-SQL systems for database users, representing a paradigm shift rather than an incremental improvement.
Bowen Cao, Weibin Liao, Yushi Sun et al.
For enterprise Text-to-SQL applications, this work addresses semantic ambiguity and scalability issues in large databases by shifting from static schema to dynamic data exploration.
Yuxuan Zhu, Tengjun Jin, Yoojin Choi et al.
This work addresses the critical challenge of improving Text-to-SQL accuracy for database research and data analytics applications, representing a significant advancement rather than an incremental improvement.
Lei Ma, Jinyang Liu, Tieying Zhang et al.
This addresses the challenge of detecting system failures and security risks in logs for industries like cloud computing, offering a novel approach that improves accuracy and efficiency over prior methods.
Yuwei Xu, Shulun Zhang, Yingli Zhou et al.
For researchers in social simulation, TopoSim addresses the inefficiency and unrealistic dynamics of existing LLM-based frameworks by leveraging network topology.
Duyi Pan, Tianao Lou, Xin Li et al.
This addresses recall and precision limitations in graph-based RAG for black-box knowledge graphs, offering a plug-and-play solution for knowledge-intensive tasks.
Yanlin Qi, Xinhang Chen, Huiqiang Jiang et al. · harvard, microsoft-research
This work provides a significant improvement in the efficiency and scalability of long-context LLM inference for developers and researchers working with large language models.
Yunxiang Su, Tianjing Zeng, Zhongjun Ding et al.
For researchers and practitioners integrating LLMs into relational data processing, this work provides a standardized taxonomy and benchmark to systematically evaluate and compare LROs, addressing the lack of unified definitions and evaluation.
Zecheng Zhang, Han Zheng, Yue Xu
This work addresses the need for fine-grained, interpretable quality assessment and efficient routing in production LLM gateways, offering a practical solution for organizations managing multiple LLM providers.
Kangkang Qi, Dongyang Xie, Wenbo Li et al.
This work addresses performance and usability issues for data analysts and engineers working with LLM-based semantic queries, representing a novel method rather than an incremental improvement.
Ritwik Yadav, Supun Abeysinghe, Min Yang et al.
For data engineers managing large-scale ETL pipelines, Enzyme reduces operational overhead by automating incremental view maintenance, addressing a long-standing bottleneck in industrial database systems.
Boyan Li, Ou Ocean Kun Hei, Yue Yu et al.
For text-to-SQL systems, DPC provides a training-free method to improve selection accuracy without execution oracles, outperforming existing approaches.
Shizheng Hou, Wenqi Pei, Nuo Chen et al.
For NL2SQL researchers and practitioners, this framework provides a standardized, modular evaluation to identify bottlenecks and guide future improvements.
Kaiwen Liu, Qin Zhang
This addresses challenges in statistical estimation for noisy data in streaming and distributed systems, with incremental improvements over noiseless settings.
Zihao Zheng, Zhihao Mao, Sicheng Tian et al.
This work addresses inference efficiency for robot control systems using VLA models, offering a hybrid approach that is incremental but provides concrete speed improvements.
Yufei Li, Yisen Gao, Jiaxuan Xiong et al.
This benchmark addresses the critical need for more robust and intelligent data management systems for organizations dealing with heterogeneous, evolving, and imperfect real-world data.
Xinyi Zhang, Tiantian Chen, Zhentao Han et al.
This addresses the challenge of optimizing DBMS performance for users and administrators by automating configuration tuning, representing a novel method rather than an incremental improvement.
Yin Lin, Tianjing Zeng, Zhongjun Ding et al.
For users needing to query databases beyond standard SQL, SEMA-SQL bridges the gap between text-to-SQL and semantic operator systems by automating query generation, optimization, and execution.
David Torres Ramos, Vihan Lakshman, Chen Luo et al.
This addresses a critical challenge in data-intensive domains like computational genomics, where skewed distributions dominate, offering a principled solution with theoretical guarantees.
Yixi Zhou, Fan Zhang, Zhiqiao Guo et al.
This addresses the overlooked dimension of structural evaluation for LLM-based program generation systems, which is incremental but important for reliability.