Generating a corpus of Hazel programs from ill-typed OCaml programs

Patrick Ferris, and Anil Madhavapeddy. In proceedings of the 2025 Workshop on Type-Driven Development (TyDe). .Patrick FerrisAnil Madhavapeddy

Generating a corpus of Hazel programs from ill-typed OCaml programs

Abstract

When developing a new programming language, having a large corpus of both correct and incorrect programs allows language designers to test and explore the capabilities of their new language. However, bootstrapping such a corpus of incorrect programs is time consuming and arduous. We therefore explore how to reuse code from more mature languages to generate a corpus of ill-typed code for newer ones.

We have developed a compiler to Hazel, an emerging language with typed holes, from the more mature OCaml ecosystem. We find it practical to generate a comprehensive corpus of ill-typed programs for Hazel development, and discuss future larger scale efforts towards bridging ecosystems.