TR#491: SOUND SCENE SEGMENTATION BY DYNAMIC DETECTION OF CORRELOGRAM COMODULATION

Eric D. Scheirer

To appear in Proceedings of the 1999 IJCAI Workshop on Computational Auditory Scene Analysis, Stockholm, Aug. 1999

A new technique for sound-scene analysis is presented. This technique operates by discovering common modulation behavior among groups of frequency subbands in the autocorrelogram domain. The analysis is conducted by first analyzing the autocorrelogram to estimate the amplitude modulation and period modulation of each channel of data at each time step, and then using dynamic clustering techniques to group together channels with similar modulation behavior. Implementation details of the analysis technique are presented, and its performance is demonstrated on a test sound.