Speaker Segmentation Demo for Matlab

The target of the software is to divide speech into 3 classes: Silence, Male, Female. In Stage 1, speech is classified into voiced or unvoiced frames by applying Gabor filtering and energy tracking by a method of G. Evangelopoulos. In Stage 2, it is assumed that if two speakers exist, then they would have significant different fundamental frequency and energy below 150 Hz regions, i.e. one actor would tend to be bass and the other will tend to be soprano, these differences are tracked again with the GMM algorithm. This method can be found in [5] at Journal Publications Section.

Posted atJanuary 01, 2010

Dimitrios A. Ververidis

Speaker Segmentation Demo for Matlab

My Profile

Job 2025 - ...

Job 2022 - 2025

Job 2025 - Now

2012 - 2022

2010-2011

2009-2010

2003-2008

2001-2003

1996-2001

Popular Posts

Pages

Google scholar citations per year

Total Blog Pageviews

Contact Form

Archive