Designed for researchers.
The BugSwarm dataset and infrastructure were designed from the ground up to facilitate controlled experimentation at scale while minimizing barriers to usage.
BugSwarm is the largest dataset of its kind, with thousands of neatly packaged reproducible bugfixes and the ability to grow continuously.
Extensible artifacts allow contributors to easily add features. A modular mining pipeline fosters development of new mining algorithms.
The command line client, REST API, usage examples, artifact processing framework, and tutorials minimize barriers to usage for researchers.