Improving Perceptual Quality of Drum Transcription with the Expanded Groove MIDI Dataset

Online Supplement

Related Material

arXiv paper
Blog Post
E-GMD Dataset Download
Source Code
Colab Notebook

Contents

Dataset Examples
Listening Test Examples

Dataset Examples

These examples illustrate the diversity of sequence styles and drum kit styles available in the dataset. The full dataset contains 1,059 unique sequences synthesized using 43 drum kits for a total of 444 hours of examples, all with ground truth MIDI labels.

drummer7/session3/64_funk_112_beat_4-4

Kit Name Audio
Acoustic Kit
Raw Dnb
808

drummer4/session1/3_jazz-klezmer_152_beat_4-4

Kit Name Audio
Acoustic Kit
Alternative
909

drummer6/session1/1_rock_70_beat_6-8

Kit Name Audio
Acoustic Kit
Speed Metal
Cassette

Listening Test Examples

These are a few example questions from the Listening Test comparing our model (OaF-Drums) with others. Raters were given an example drum performance and asked to rate which synthesized transcription better captured the contents of the original. Transcriptions from the different models were synthesized using a standard SoundFont.

Notice how the addition of velocity (how hard the drum was hit) adds significantly to the perceptual similarity of OaF-Drums to the original.

Raters saw only pairwise comparisons, but for simplicity, here we just list all models at once.

Example 1

Original
OaF-Drums
DT-Ensemble
ADTLib

Example 2

Original
OaF-Drums
DT-Ensemble
ADTLib

Example 3

Original
OaF-Drums
DT-Ensemble
ADTLib