Meet BugSwarm.

A dataset of (soon thousands of) real software bugs and bugfixes. Use BugSwarm to accelerate your research.


Explore BugSwarm Request Full Access

Designed for researchers.

The BugSwarm dataset and infrastructure were designed from the ground up to facilitate controlled experimentation at scale while minimizing barriers to usage.

Unprecedented Scale

BugSwarm is the largest dataset of its kind, with thousands of neatly packaged reproducible bugfixes and the ability to grow continuously.

Extensible Toolset

Extensible artifacts allow contributors to easily add features. A modular mining pipeline fosters development of new mining algorithms.

Robust Ecosystem

The command line client, REST API, usage examples, artifact processing framework, and tutorials minimize barriers to usage for researchers.

Interested in BugSwarm updates?

Join the mailing list to stay informed as BugSwarm grows. Strictly useful resources — no spam. And you can unsubscribe at any time.

Learn more about BugSwarm →