SEAIPLMay 7, 2025

PR2: Peephole Raw Pointer Rewriting with LLMs for Translating C to Safer Rust

arXiv:2505.04852v26 citationsh-index: 7
Originality Incremental advance
AI Analysis

This addresses memory safety issues in translated Rust code for developers using transpilation tools, though it is an incremental improvement over existing methods.

The paper tackles the problem of unsafe raw pointers in Rust code generated by C-to-Rust transpilation tools, proposing a peephole raw pointer rewriting technique that successfully eliminates 13.22% of local raw pointers across 28 real-world C projects.

There has been a growing interest in translating C code to Rust due to Rust's robust memory and thread safety guarantees. Tools such as C2RUST enable syntax-guided transpilation from C to semantically equivalent Rust code. However, the resulting Rust programs often rely heavily on unsafe constructs--particularly raw pointers--which undermines Rust's safety guarantees. This paper aims to improve the memory safety of Rust programs generated by C2RUST by eliminating raw pointers. Specifically, we propose a peephole raw pointer rewriting technique that lifts raw pointers in individual functions to appropriate Rust data structures. Technically, PR2 employs decision-tree-based prompting to guide the pointer lifting process. Additionally, it leverages code change analysis to guide the repair of errors introduced during rewriting, effectively addressing errors encountered during compilation and test case execution. We implement PR2 as a prototype and evaluate it using gpt-4o-mini on 28 real-world C projects. The results show that PR2 successfully eliminates 13.22% of local raw pointers across these projects, significantly enhancing the safety of the translated Rust code. On average, PR2 completes the transformation of a project in 5.44 hours, at an average cost of $1.46.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes