Designed for researchers.
The BugSwarm dataset and infrastructure were designed from the ground up to facilitate controlled experimentation at scale while minimizing barriers to usage.
BugSwarm is the largest dataset of its kind, with thousands of neatly packaged reproducible bugfixes and the ability to grow continuously.
Extensible artifacts allow contributors to easily add features. A modular mining pipeline fosters development of new mining algorithms.
The command line client, REST API, usage examples, artifact processing framework, and tutorials minimize barriers to usage for researchers.
Interested in BugSwarm updates?
Join the mailing list to stay informed as BugSwarm grows. Strictly useful resources — no spam. And you can unsubscribe at any time.