Back to Explore
cs.LGComputer Science

Machine Learning

Statistical learning, deep learning, optimization

90LGMay 22, 2025Code
Shape it Up! Restoring LLM Safety during Finetuning

ShengYun Peng, Pin-Yu Chen, Jianfeng Chi et al. · gatech

This addresses safety vulnerabilities in LLM customization for users and developers, offering a novel mitigation approach that is not incremental but builds on a new paradigm.

88SEJul 31, 2025Code
SWE-Exp: Experience-Driven Software Issue Resolution

Silin Chen, Shaoxin Lin, Xiaodong Gu et al.

This addresses the inefficiency of redundant exploration in automated software engineering for developers, representing a new paradigm rather than an incremental improvement.

86LGFeb 14, 2025Code
KernelBench: Can LLMs Write Efficient GPU Kernels?

Anne Ouyang, Simon Guo, Simran Arora et al.

This work addresses the problem of efficient GPU kernel generation for machine learning architectures, which is significant for ML engineers and researchers.

86CLFeb 15, 2024Code
Generative Representational Instruction Tuning

Niklas Muennighoff, Hongjin Su, Liang Wang et al. · microsoft-research

This addresses the inefficiency of using separate models for retrieval and generation in applications like RAG, speeding it up by over 60% for long documents.

85LGFeb 29, 2024Code
Watermark Stealing in Large Language Models

Nikola Jovanović, Robin Staab, Martin Vechev

This exposes a critical security flaw in AI-generated content detection, challenging the deployment readiness of current LLM watermarking methods.

84LGMay 1, 2024Code
Self-Play Preference Optimization for Language Model Alignment

Yue Wu, Zhiqing Sun, Huizhuo Yuan et al. · cmu

This addresses the challenge of accurately capturing human preferences for language model alignment, offering a novel approach that outperforms existing methods without relying on external supervision from stronger models.