Unsupervised speaker change detection for mobile device recorded speech

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

5 Citations (Scopus)

Abstract

In this paper we propose an unsupervised speaker change detection (SCD) system developed for mobile device applications. We use Bayesian information criterion (BIC) to find initial speaker changes, which are then verified or discarded in the second phase by utilizing modified BIC and silence detector information. Silence information usage after initial BIC in decision making is useful to separate real changes from noise peaks. Enhanced peak detector adjusts BIC penalty parameter automatically, which improve the robustness and feasibility. Improved BIC based false alarm compensation (FAC) merges effectively consecutive segments belonging to same speaker. Our experiments have shown the robustness of the algorithm and it produces very satisfactory results for difficult mobile phone recorded speech data.
Original languageEnglish
Title of host publicationIEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA, 15-20 April 2007. IEEE Cat. No. 07CH378
PublisherInstitute of Electrical and Electronic Engineers IEEE
ISBN (Electronic)1-4244-0728-1
ISBN (Print)1-4244-0727-3
DOIs
Publication statusPublished - 2007
MoE publication typeA4 Article in a conference publication
EventIEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 - Honolulu, United States
Duration: 15 Apr 200720 Apr 2007

Publication series

Name
ISSN (Print)1520-6149
ISSN (Electronic)2379-190X

Conference

ConferenceIEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
Abbreviated titleICASSP '07
CountryUnited States
CityHonolulu
Period15/04/0720/04/07

Fingerprint

Mobile devices
Detectors
Mobile phones
Decision making
Experiments
Compensation and Redress

Keywords

  • Metadata
  • Mobile audio segmentation
  • Multimedia database
  • Speaker change detection
  • Speaker segmentation

Cite this

Vuorinen, O., Peltola, J., & Mäkelä, S-M. (2007). Unsupervised speaker change detection for mobile device recorded speech. In IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA, 15-20 April 2007. IEEE Cat. No. 07CH378 Institute of Electrical and Electronic Engineers IEEE. https://doi.org/10.1109/ICASSP.2007.366346
Vuorinen, Olli ; Peltola, Johannes ; Mäkelä, Satu-Marja. / Unsupervised speaker change detection for mobile device recorded speech. IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA, 15-20 April 2007. IEEE Cat. No. 07CH378. Institute of Electrical and Electronic Engineers IEEE, 2007.
@inproceedings{a09f7772bb414373a144017453fb11bc,
title = "Unsupervised speaker change detection for mobile device recorded speech",
abstract = "In this paper we propose an unsupervised speaker change detection (SCD) system developed for mobile device applications. We use Bayesian information criterion (BIC) to find initial speaker changes, which are then verified or discarded in the second phase by utilizing modified BIC and silence detector information. Silence information usage after initial BIC in decision making is useful to separate real changes from noise peaks. Enhanced peak detector adjusts BIC penalty parameter automatically, which improve the robustness and feasibility. Improved BIC based false alarm compensation (FAC) merges effectively consecutive segments belonging to same speaker. Our experiments have shown the robustness of the algorithm and it produces very satisfactory results for difficult mobile phone recorded speech data.",
keywords = "Metadata, Mobile audio segmentation, Multimedia database, Speaker change detection, Speaker segmentation",
author = "Olli Vuorinen and Johannes Peltola and Satu-Marja M{\"a}kel{\"a}",
year = "2007",
doi = "10.1109/ICASSP.2007.366346",
language = "English",
isbn = "1-4244-0727-3",
publisher = "Institute of Electrical and Electronic Engineers IEEE",
booktitle = "IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA, 15-20 April 2007. IEEE Cat. No. 07CH378",
address = "United States",

}

Vuorinen, O, Peltola, J & Mäkelä, S-M 2007, Unsupervised speaker change detection for mobile device recorded speech. in IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA, 15-20 April 2007. IEEE Cat. No. 07CH378. Institute of Electrical and Electronic Engineers IEEE, IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, Honolulu, United States, 15/04/07. https://doi.org/10.1109/ICASSP.2007.366346

Unsupervised speaker change detection for mobile device recorded speech. / Vuorinen, Olli; Peltola, Johannes; Mäkelä, Satu-Marja.

IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA, 15-20 April 2007. IEEE Cat. No. 07CH378. Institute of Electrical and Electronic Engineers IEEE, 2007.

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

TY - GEN

T1 - Unsupervised speaker change detection for mobile device recorded speech

AU - Vuorinen, Olli

AU - Peltola, Johannes

AU - Mäkelä, Satu-Marja

PY - 2007

Y1 - 2007

N2 - In this paper we propose an unsupervised speaker change detection (SCD) system developed for mobile device applications. We use Bayesian information criterion (BIC) to find initial speaker changes, which are then verified or discarded in the second phase by utilizing modified BIC and silence detector information. Silence information usage after initial BIC in decision making is useful to separate real changes from noise peaks. Enhanced peak detector adjusts BIC penalty parameter automatically, which improve the robustness and feasibility. Improved BIC based false alarm compensation (FAC) merges effectively consecutive segments belonging to same speaker. Our experiments have shown the robustness of the algorithm and it produces very satisfactory results for difficult mobile phone recorded speech data.

AB - In this paper we propose an unsupervised speaker change detection (SCD) system developed for mobile device applications. We use Bayesian information criterion (BIC) to find initial speaker changes, which are then verified or discarded in the second phase by utilizing modified BIC and silence detector information. Silence information usage after initial BIC in decision making is useful to separate real changes from noise peaks. Enhanced peak detector adjusts BIC penalty parameter automatically, which improve the robustness and feasibility. Improved BIC based false alarm compensation (FAC) merges effectively consecutive segments belonging to same speaker. Our experiments have shown the robustness of the algorithm and it produces very satisfactory results for difficult mobile phone recorded speech data.

KW - Metadata

KW - Mobile audio segmentation

KW - Multimedia database

KW - Speaker change detection

KW - Speaker segmentation

U2 - 10.1109/ICASSP.2007.366346

DO - 10.1109/ICASSP.2007.366346

M3 - Conference article in proceedings

SN - 1-4244-0727-3

BT - IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA, 15-20 April 2007. IEEE Cat. No. 07CH378

PB - Institute of Electrical and Electronic Engineers IEEE

ER -

Vuorinen O, Peltola J, Mäkelä S-M. Unsupervised speaker change detection for mobile device recorded speech. In IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu, HI, USA, 15-20 April 2007. IEEE Cat. No. 07CH378. Institute of Electrical and Electronic Engineers IEEE. 2007 https://doi.org/10.1109/ICASSP.2007.366346