PEVD-Based Source Separation Using Informed Spherical Microphone Arrays

This website will demonstrate the source separation algorithm based on the PEVD of spherical microphone array beamformed signals. This work is developed by Vincent W. Neo, Christine Evers and Patrick A. Naylor. This is a joint work between the Speech and Audio Processing Lab at Imperial College London and the School of Electronics and Computer Science at University of Southampton in the U.K.

About This Work

Anechoic speech signals sampled at 16 kHz are taken from the TIMIT corpus [1]. The room impulses responses (RIRs) from the 2 sources to 32 microphones on a rigid sphere are simulated using the SMIRgen tool [2]. The T60 is varied b 0 s, 0.3 s and 0.7 s.

For each speech source, short utterances from a randomly selected speaker are concatenated to generate signals of 8 to 10 s duration. Each source signal was convolved with the RIR and mixed along with 30 dB sensor noise to generate the microphone signals. Eigenbeam signals are generated using the spherical harmonic (SH) transform [3,4]. The modal beamformers are directed at (90,90) in degrees for Source 1 and (90,90) in degrees for Source 2. Two microphones closest to each source were chosen for the comparative methods. The sequential matrix diagonalisation (SMD) algorithm [5] is used for all PEVD processing.