Python Framework to analyse Git repositories
A stream-processing tool for filtering terabytes of GitHub Archive data on consumer hardware. Outputs to Parquet/JSONL with zero storage overhead.
Runtime analysis for Python programs
Code evolution analysis for Git repositories
Github mining tool for MSR research
Multi-language and extensible library for mining Git repositories