Macro- and Micro-benchmarking in OCaml

This is an idea proposed in 2012 as a Cambridge Computer Science Part II project, and has been completed by Sebastian Funk.

Benchmarking involves the measurement of statistics such as run-time, memory allocations, garbage collections in a running program in order to analyze its performance and behaviour. To scientifically evaluate and understand the performance of a program, there is often a cycle of:

making performance observations about the program
finding a potential hypothesis, i.e. a cause for this performance behaviour
making predictions on experiments based on this hypothesis
comparing the predictions against the actual benchmark results to evaluate the hypothesis.

To be able to do all this, there is a need for an effective and robust framework to continuously make these observations that is not biased by the choice of hypothesis or the observation made. In general, any sort of improvement relies on robust and precise measurements.

Benchmarking can be split into two perspectives: micro-benchmarking, measuring a single (small) function repeatedly to collect statistics for a regression, and macro-benchmarking, measuring the performance of a complete program or library, often in a single-run. This project aims to improve the benchmarking infrastructure in OCaml, both at micro- and macro-benchmarking.

The project aims to add event tracing into OCaml, via instrumentation to the Core Bench library using Camlp4. The event-tracing tool is then a way for macro-benchmarking together with the multivariate regression for micro-benchmarking to analyze the performance of commonly used libraries to exhibit and explain abnormalities and performance differences in implementations. On a meta-level this study will give an insight into which predictors are useful for a multivariate regression in which circumstances to provide interesting results and how event-tracing can be used efficiently and compactly in large libraries.

Real World OCaml: Functional Programming for the Masses

Links

The dissertation is available on request to students from Anil Madhavapeddy but isn't online anywhere. The source code (a CamlP4 event tracer) has been superceded by modern event tracing.

Sebastian Funk went on to work at Jane Street on OCaml after his project, and one 2019 talk on his subsequent work can be seen below.

# 1st Jan 2012

ideas benchmarking idea-done idea-medium janestreet ocaml systems

Anil Madhavapeddy, Professor of Planetary Computing

Macro- and Micro-benchmarking in OCaml

Links

Related News

Real World OCaml: Functional Programming for the Masses / Oct 2022

OCaml Labs / Jan 2012

Macro- and Micro-benchmarking in OCaml

Related Reading

Links

Related News

Real World OCaml: Functional Programming for the Masses / Oct 2022

OCaml Labs / Jan 2012