Fusion of Sound Source Localization and Face Detection for Supporting Human Behavior Analysis

Markus Niiranen, Janne Vehkaperä, Satu-Marja Mäkelä, Johannes Peltola, Tomi Räty

    Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

    1 Citation (Scopus)


    This paper describes a demonstrated concept implementation that combines sound source localization and face detection from video stream for supporting human behavior analysis. System monitors space containing multiple persons using microphone array and video camera. The aim is to detect which person in the scene is producing the sound that is received by the microphones. For this task the microphone array localizes the sound in the environment. Simultaneously face detection is performed to the video signal produced by the monitoring video camera. If face is detected from the bearing of the sound the system may decide that the sound is produced by the person who's face is detected. Preliminary results indicate that the fusion may give useful information for human behavior analysis for space containing multiple persons.
    Original languageEnglish
    Title of host publicationProceedings MobiMedia 2008
    Number of pages4
    Publication statusPublished - 2008
    MoE publication typeA4 Article in a conference publication
    Event4th International Mobile Multimedia Communications Conference - Oulu, Finland
    Duration: 7 Jul 20089 Jul 2008
    Conference number: 4


    Conference4th International Mobile Multimedia Communications Conference


    • Audio localization
    • audio detection
    • microphone arrays
    • face detection


    Dive into the research topics of 'Fusion of Sound Source Localization and Face Detection for Supporting Human Behavior Analysis'. Together they form a unique fingerprint.

    Cite this