Generation of Verified Assembly Code Using Dafny and Reinforcement Learning
As recently demonstrated by the AlphaDev algorithm, reinforcement learning can be used to generate assembly code that is more performant than a human implementation. However, there are no guarantees for the correctness of the generated code. Programs for simple tasks such as sorting short sequences of integers can be exhaustively tested, but this is not possible for more complex tasks. We describe how Dafny can be used to reason about the state of a machine during the execution of assembly code. This allows Dafny to verify that a given assembly program adheres to input-output constraints. The verification can be done automatically, without the need of human intervention. Therefore, Dafny can be integrated into the reinforcement learning process, and guide the agent to find efficient and verified solutions for complex tasks.
Sun 14 JanDisplayed time zone: London change
17:30 - 18:15 | |||
17:30 18mTalk | Generation of Verified Assembly Code Using Dafny and Reinforcement Learning Dafny | ||
17:48 18mTalk | CLOVER: Closed-Loop Verifiable Code Generation Dafny Chuyue Sun Stanford University, Ying Sheng Stanford University, Oded Padon VMware Research, Clark Barrett Stanford University | ||
18:06 9mDay closing | Day closing Dafny |