What’s that sound? Audio event detection

The Challenge

Data

The database consists of audio recordings, each ten seconds long. Each audio file contains between two and four acoustic events that do not overlap. In addition, various background noises are present in the recordings. The possible acoustic events (classes) include, among others: Shatter, Bark, Doorbell, Shout, Cough, Camera, Church_bell, Scratching (performance technique), Fireworks, Burping and eructation, Meow und Cheering.

For each audio file, there is an annotation file that contains the exact start and end times of the events as well as their respective classes. In total, the dataset comprises about 36 hours of audio material, with 28 hours designated for training and 8 hours for testing purposes.

Task

The task is to detect acoustic events — such as a cat meowing or a bell ringing — and to precisely determine their respective start and end times.

Results

Student Track

Rank Score Team Name Member Name(s)
🥇1 0.6890 Error040 Lennart Heinbokel
🥈2 0.5978 L’audio_locaï Andreas Baude, Jonas Klaff, Timo Urban
🥉3 0.4570 import teamName Akira Janssen
4 0.4159 when_life_gives_you_data Max Gaber, Jannes Adam, Alexander Jochim
5 0.3923 Atropos Hannes Raith