Of the original 202 tracks, 191 had agreement between 3 tempo trackers
on at least one metrical level. The tracker that "fails" to meet the
others in these cases is the Ellis tracker.
Once these tracks are removed, I have run our tempo tracker. With the
0.04 tolerance window as explained in the previous email, we miss a
total of 18 of these. Increasing the allowance window results in the
following:
allow%: 0.04 0.05 0.06 0.07 0.08 0.09 0.10
wrong: 18 15 13 8 6 6 5
My reasoning for this is the following. The three tempo trackers
contain 2 components: An initial tempo classifier, and a beat phase
calculation. In both Davies and Ellis, initial tempo is found using a
rough estimate. Dynamic programming is then used to find the best fit
of beats within the audio given this rough tempo calculation. A tempo
trace is then created from the inter-beat intervals (IBI). In ours
however, as I was asked to not program any further and just use the
tempo calculation, we are left with the rough tempo calculation from
the initial tempo estimate; no beat phase estimate is performed.
To test this hypothesis I took a look at the 18 tracks that were "incorrect".
1) Antidote - Off Dub -> this is a weird one. But its dubstep - and
all dubstep lies in the range of 140 (sometimes (but rarely) as slow
as 127, but no faster than 150). I opened this up in logic, and lined
up a metronome beat to it, and it was 140. Of the 4 trackers tested,
ours was the only one to get it right. All others found either 80 or
160.
2) Avry Tare - Ghosts of Books -> Klapuri gets it right at around 140
BPM. None of the other trackers get it. Ours guessed 120. The intro is
soft pads with slow attacks. My guess is that because the tempo
tracker is based on found onsets, 0 onsets = pre-emphasis curve only
with a peak at ~120. Will need to adjust code and look further into
this one.
3) Basic Channel - Octagon -> Strange that our tracker gets this one
wrong. Its a classic 139 4/4 minimal techno beat from the mid-90s.
There is some odd rhythm stuff that kicks in about 1/3 way into the
track, that is much slower, but since onset detection isn't piecing
the instruments apart, this should have no effect.
4) Bibio - All the Flowers -> this is a simple (and beautiful)
acoustic guitar/bass/vocal piece that has been artificially
constructed in a sequencer. Using logic as above, I hear the track
more as 6/8 which would play at 76 BPM. None of the trackers agree
with me. Klapuri's does identify a 101 BPM 4/4 track however, that
when overlaid on the music works harmonically, however doesn't quite
fit the vocals as cleanly as the 6/8 metronome. Ours found a similar
tempo as Davies and Ellis here.
5) Big Bud - Freedom -> drum and bass track from mid-late 90s. Our
tracker guessed 132 BPM. All 3 others had some level of 163 BPM
(correct) at some metrical level.
6) Boards of Canada - Bocuma -> slow ambient piece. All trackers other
than ours predict correctly 90 BPM. Ours guesses 115.
7) Commix - Japanese Electronics (Instra:Mental remix) -> modern drum
and bass piece set to 170 BPM. Ellis, Davies, and Klapuri have 170 BPM
on at least one metrical level. Ours does not. This track has a long
fade out of ambience and bass at the end. I'm assuming its a similar
problem to problem track #2, so when I adjust the tracker observation
vector, I will test this again.
8) Digital and Morphy - Shanti -> 172 BPM drum and bass track. Ours
guesses 187 BPM.
9) El-P - Patriotism -> Hip hop track from early 2000's. Three
trackers guess 85, ours guesses 88 BPM. I suspect that this is an
example of the hypothesized problem of "rough tempo" before beats are
accurately laid in. The problem is one of resolution, akin to trying
to identify an exact frequency from a 44.1kHz wavfile using a sliding
DFT with 512 points. While the audio has 44100/2 possible frequencies,
only 512 bins exist in the DFT.
10) Gerwin - Mistakes -> 140 BPM post-dubstep. The 3 others get ~140
BPM. Ours gets 112 BPM.
11) Getz & Nuage - Is The Way -> 170 BPM drum and bass track. 3 others
get it. Ours again guesses 112 BPM. There's something going on with
the 112 BPM predictions.
12) Low Limit - Turf Day -> Davies and Klapuri get 93 BPM for the 4/4
beat. Ours comes up with a similar answer to Ellis: ~116 BPM. There is
a synth zap sound that potentially causes a 5 feel. 5/4 *93 = 116 BPM.
13) Martsman - Marksman -> Klapuri is the only one that gets this one
right. Its 170 BPM (or some metrical level therein). Davies = 127 BPM;
Ellis + Ours = 114 BPM
14) Sanderson - Beautiful Click (SND remix) -> straightforward 172 BPM
drum and bass. Ours guesses 123 BPM. Not sure why this happened. Will
have a look at observation vector.
15) Stunna - Optima - Half-tempo drum and bass track. Probably
sequenced at 174 (with little drums hinting at the genre and speed),
but everything else about the track says 87 BPM. Ours gets 92. I think
that this is another example of rough-tempo calculation gone bad
(e.g., #9).
16) Teebee - Black Rain -> 175 BPM Drum and bass track. Somehow our
tracker is getting values in the 1000's. This definitely needs looking
into. The 3 others get it correct.
17) Triosk - Intensives Leben -> 150 Jazz track played to computerized
low-bit drums. 3 other algorithms get it correct. Ours says ~120 BPM.
18) Unknown - 0006 -> arpeggio at 120 BPM. Klapuri+Davies+Ours = 120
BPM, Davies = 95 BPM.
So, there are 17 errors as opposed to 18.
Next I'll check out why these errors are occuring by analyzing the
input observation vectors and output tempi of the tracks above. Will
report on that next.