SEMar 30

PCREQ: Automated Inference of Compatible Requirements for Python Third-party Library Upgrades

arXiv:2508.0202336.21 citationsh-index: 7
Predicted impact top 63% in SE · last 90 daysOriginality Highly original
AI Analysis

This addresses compatibility issues in Python dependency upgrades for developers, representing a novel solution rather than incremental improvement.

The paper tackles the problem of Python third-party library upgrades causing compatibility issues by proposing PCREQ, the first approach to automatically infer compatible requirements through combined version and code compatibility analysis. Results show PCREQ achieves a 94.03% inference success rate, outperforming existing tools by 18-22% and processing each scenario in 60.79 seconds on average.

Python third-party libraries (TPLs) are essential in modern software development, but upgrades often cause compatibility issues, leading to system failures. These issues fall into two categories: version compatibility issues (VCIs) and code compatibility issues (CCIs). Existing tools mainly detect dependency conflicts but overlook code-level incompatibilities, with no solution fully automating the inference of compatible versions for both VCIs and CCIs. To fill this gap, we propose PCREQ, the first approach to automatically infer compatible requirements by combining version and code compatibility analysis. PCREQ integrates six modules: knowledge acquisition, version compatibility assessment, invoked APIs and modules extraction, code compatibility assessment, version change, and missing TPL completion. PCREQ collects candidate versions, checks for conflicts, identifies API usage, evaluates code compatibility, and iteratively adjusts versions to generate a compatible requirements.txt with a detailed repair report. To evaluate PCREQ, we construct REQBench, a real-world benchmark with 2,095 upgrade scenarios derived from 34 real-world scientific/ML Python projects. Results show PCREQ achieves a 94.03% inference success rate, outperforming PyEGo (37.02%), ReadPyE (37.16%), and LLM-based approaches (GPT-4o, DeepSeek V3/R1) by 18--22%. PCREQ processes each scenario from REQBench in 60.79 s on average, demonstrating practical efficiency. PCREQ reduces manual effort in troubleshooting upgrades, advancing Python dependency maintenance automation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes