Setup

For this example, we have a 3x4x3 m room. The reverberation time is 0.3sec. We also have a version of the same room with more reverberant walls in the fourth example Room

Mod-MFCC Based Clusters

cluster	reference microphone	masked reference signal	DSB signal
1
2

Speaker Embedding Clusters

cluster	reference microphone	masked reference signal	DSB signal
1
2

Discussion

Here both Mod-MFCC based features and the speaker embeding features give good cluster allocations. Note however that the speaker embedding features do result in a bigger cluster for source 1.

ASPIRE

Setup

Mod-MFCC Based Clusters

Speaker Embedding Clusters

Discussion