This is a demonstration of sound event synthesis (SES) using event labels based on the conditional WaveNet [1]. As the dataset, we used 10 different sound events (manual coffee grinder, cup clinking, alarm clock ringing, whistle, maracas, drum, electric shaver, trash box banging, tearing paper, bell ringing) contained in the RWCP-SSD (Real World Computing Partnership-Sound Scene Database) [2].
You can download a zip file of original and synthesized sounds from here.
・Manual coffee grinder
・Cup
・Clock
・Whistle
・Maracas
・Drum
・Shaver
・Trash box
・Tearing paper
・Bell