CY SEApr 24

What Should Frontier AI Developers Disclose About Internal Deployments?

Jacob Charnock, Raja Mehta Moreno, Justin Miller, William L. Anderson

arXiv:2604.2306571.0h-index: 6

AI Analysis

It addresses a gap in guidance for AI developers and regulators on what specific information to disclose about internal model deployments, which is important for safety and governance.

The paper identifies key information that frontier AI developers should disclose about internally deployed models across four categories: capabilities, usage, safety mitigations, and governance, providing a framework to improve transparency and oversight.

Frontier AI developers are increasingly deploying highly capable models internally to automate AI R&D, but these deployments currently face limited external oversight. It is essential, therefore, that developers provide evidence that internally deployed models are safe. While recent work has highlighted the risks of internal deployments and proposed broad approaches to transparency and governance, there remains little guidance on the specific information developers should disclose about them. We address this gap by identifying key information that companies should disclose about internally deployed models across four categories: capabilities, usage, safety mitigations, and governance. For each category, we analyse the key benefits and limitations of disclosure and consider how disclosure-related risks can be mitigated. Our framework could be used by developers to inform both public transparency documents, such as model system cards, and private periodic reports required under emerging frontier AI regulation.

View on arXiv PDF

Similar