What’s that sound? Audio event detection

The Challenge

Data

The database consists of audio recordings, each ten seconds long. Each audio file contains between two and four acoustic events that do not overlap. In addition, various background noises are present in the recordings. The possible acoustic events (classes) include, among others: Shatter, Bark, Doorbell, Shout, Cough, Camera, Church_bell, Scratching (performance technique), Fireworks, Burping and eructation, Meow und Cheering.

For each audio file, there is an annotation file that contains the exact start and end times of the events as well as their respective classes. In total, the dataset comprises about 36 hours of audio material, with 28 hours designated for training and 8 hours for testing purposes.

Task

The task is to detect acoustic events — such as a cat meowing or a bell ringing — and to precisely determine their respective start and end times.

Results

Student Track

Rank	Score	Team Name	Member Name(s)
🥇1	0.6890	Error040	Lennart Heinbokel
🥈2	0.5978	L’audio_locaï	Andreas Baude, Jonas Klaff, Timo Urban
🥉3	0.4570	import teamName	Akira Janssen
4	0.4159	when_life_gives_you_data	Max Gaber, Jannes Adam, Alexander Jochim
5	0.3923	Atropos	Hannes Raith