DaCapo (benchmark)

DaCapo (benchmark)
Name	DaCapo (benchmark)
Title	DaCapo (benchmark)
Developer	DaCapo Project
Released	2006
Latest release	2009
Programming language	Java
Operating system	Cross-platform
Genre	Benchmark
License	Open-source

Contents

Overview
History and development
Benchmark suite components
Methodology and workload characteristics
Performance evaluation and usage
Limitations and criticism

DaCapo (benchmark) is a Java benchmarking suite designed to evaluate the performance of Java Virtual Machines, garbage collectors, just-in-time compilers, and runtime systems. It provides realistic, open-source workloads derived from widely used Java applications and libraries to exercise compiler backends, runtime services, and memory management features across diverse platforms. The suite has been used by academic researchers, engineering teams, and vendors to compare implementations from projects such as OpenJDK, Oracle Corporation, IBM, Red Hat, and ARM Holdings.

Overview

DaCapo originated as a collection of long-running, real-world Java applications intended to complement microbenchmarks like SPECjvm98 and SPECjvm2008. It targets measurement of throughput, latency, and warm-up behavior for systems including HotSpot, J9, GraalVM, Eclipse OpenJ9, and research JVMs developed at institutions such as University of California, Berkeley, Imperial College London, Princeton University, and MIT. The suite emphasizes reproducibility and portability across platforms from Intel Corporation x86 servers to ARM-based mobile processors used in devices by Samsung Electronics and Qualcomm.

History and development

The project was initiated by academics and industry collaborators to address limitations observed in prior suites sponsored by organizations like SPEC and studies published through venues such as ACM SIGPLAN, Usenix, and IEEE. Early contributors included researchers affiliated with Brandeis University, Rochester Institute of Technology, and companies including Sun Microsystems and HP. Major milestones included the 2006 initial release, a subsequent 2009 revision that refined workloads and harnesses, and later forks and derivatives used in studies at conferences like PLDI, OOPSLA, and ASPLOS.

Benchmark suite components

The suite bundles multiple benchmarks drawn from open-source projects and canonical applications. Representative components originate from projects such as Eclipse (software), Lucene, Jython, SPECweb99-inspired workloads, Apache Tomcat, H2 (database), and Xalan. The package includes harness code for harnessing benchmarks, input datasets derived from archives like Project Gutenberg and Common Crawl, and utility scripts compatible with build systems such as Apache Ant, Maven, and Gradle. Researchers integrate DaCapo workloads with tools including JVM TI, Linux perf, DTrace, and profiling frameworks from Intel VTune and Valgrind.

Methodology and workload characteristics

DaCapo workloads are designed to be long-running and to exhibit warm-up phases that reveal interactions among just-in-time compilation, inlining, and garbage collection. Methodological guidance references measurement techniques discussed at ACM and IEEE venues and leverages statistical methods from works by authors at Stanford University and CMU. Workload characteristics include varied allocation rates, object lifetimes, and thread contention patterns, making the suite relevant for evaluating collectors like G1 Garbage Collector, CMS (Concurrent Mark Sweep), and research collectors developed at Microsoft Research and Oracle Labs. Input sets and configurations mimic realistic scenarios used by organizations such as Google, Facebook, Twitter, and Amazon (company).

Performance evaluation and usage

DaCapo has been used in comparative studies to measure throughput, pause times, and compilation overhead across JVM implementations including HotSpot and GraalVM. Papers and vendor reports deploy DaCapo in performance regressions, compiler tuning, and garbage collector research at institutions like University of Cambridge and companies such as IBM and Red Hat. The suite integrates with continuous integration systems used by teams at GitHub, GitLab, and Jenkins to detect performance regressions and to evaluate patch-level changes in projects like OpenJDK and Eclipse OpenJ9.

Limitations and criticism

Critics note that, despite its realism relative to microbenchmarks, the suite has a limited set of applications compared with the broader Java ecosystem represented by projects at Apache Software Foundation and commercial services at Netflix. Some studies observed that DaCapo workloads emphasize long-running server-side behaviors and may underrepresent short-lived mobile or embedded use cases central to firms like Apple Inc. and Google LLC. Other limitations include maintenance stagnation after formal releases, leading to forks and successor suites used by research groups at ETH Zurich and University of Illinois Urbana-Champaign.

Category:Benchmark suites Category:Java platform