mirdata: Software for Reproducible Usage of Datasets
By M. Fuentes
This library provides tools for working with common MIR datasets, including tools for:
- downloading datasets to a common location and format
- validating that the files for a dataset are all present
- loading annotation files to a common format, consistent with the format required by mir_eval
- parsing track level metadata for detailed evaluations.
Obtain mirdata: on GitHub
Related paper
Rachel M. Bittner, Magdalena Fuentes, David Rubinstein, Andreas Jansson, Keunwoo Choi, and Thor Kell. “mirdata: Software for Reproducible Usage of Datasets, “ in International Society for Music Information Retrieval (ISMIR) Conference, 2019.