Speaker Segmentation Demo for Matlab

The target of the software is to divide speech into 3 classes: Silence, Male, Female. In Stage 1, speech is classified into voiced or unvoiced frames by applying Gabor filtering and energy tracking by a method of G. Evangelopoulos. In Stage 2, it is assumed that if two speakers exist, then they would have significant different fundamental frequency and energy below 150 Hz regions, i.e. one actor would tend to be bass and the other will tend to be soprano, these differences are tracked again with the GMM algorithm. This method can be found in [5] at Journal Publications Section.




Stackoverflow profile

profile for jimver04 at Stack Overflow, Q&A for professional and enthusiast programmers

Stackoverflow Reputation curve

Google scholar citations per year

Google scholar citations per year
Click image for more details

Total Blog Pageviews