Back to Explore
cs.CLComputer Science

Computation & Language

NLP, text generation, language models

90CVMay 28, 2025Code
Thinking with Generated Images

Ethan Chern, Zhulin Hu, Steffi Chern et al.

This approach enables AI models to engage in visual imagination and iterative refinement, benefiting domains like biochemistry, architecture, forensics, and sports, though it is a new paradigm rather than incremental.

88SEJul 31, 2025Code
SWE-Exp: Experience-Driven Software Issue Resolution

Silin Chen, Shaoxin Lin, Xiaodong Gu et al.

This addresses the inefficiency of redundant exploration in automated software engineering for developers, representing a new paradigm rather than an incremental improvement.

86CLFeb 19, 2024Code
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

Fengqing Jiang, Zhangchen Xu, Luyao Niu et al. · uw

This addresses a critical security problem for users and developers of LLMs by exposing a novel attack vector that exploits multimodal interpretation gaps, representing a significant rather than incremental advance in understanding LLM vulnerabilities.

86CLFeb 15, 2024Code
Generative Representational Instruction Tuning

Niklas Muennighoff, Hongjin Su, Liang Wang et al. · microsoft-research

This addresses the inefficiency of using separate models for retrieval and generation in applications like RAG, speeding it up by over 60% for long documents.

85CLDec 8, 2023Code
Seamless: Multilingual Expressive and Streaming Speech Translation

Seamless Communication, Loïc Barrault, Yu-An Chung et al. · meta-ai, stanford

This work addresses the problem of making machine-mediated communication more seamless and human-like for users of multilingual speech translation systems, though it builds incrementally on previous models like SeamlessM4T.

85SDFeb 24, 2025Code
AAD-LLM: Neural Attention-Driven Auditory Scene Understanding

Xilin Jiang, Sukru Samet Dindar, Vishal Choudhari et al.

This work addresses the limitation of auditory AI in aligning with human perception for applications like hearing aids or communication systems, representing a novel paradigm rather than an incremental improvement.

84LGMay 1, 2024Code
Self-Play Preference Optimization for Language Model Alignment

Yue Wu, Zhiqing Sun, Huizhuo Yuan et al. · cmu

This addresses the challenge of accurately capturing human preferences for language model alignment, offering a novel approach that outperforms existing methods without relying on external supervision from stronger models.

84CVFeb 13, 2025Code
Pixel-Level Reasoning Segmentation via Multi-turn Conversations

Dexian Cai, Xiaocui Yang, Yongkang Liu et al.

This work addresses the problem of fine-grained segmentation for dynamic user intent in multi-turn conversations, which is significant for developers of visual perception systems and conversational AI.