Fusion of Sound Source Localization and Face Detection for Supporting Human Behavior Analysis

Markus Niiranen, Janne Vehkaperä, Satu-Marja Mäkelä, Johannes Peltola, Tomi Räty

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

1 Citation (Scopus)

Abstract

This paper describes a demonstrated concept implementation that combines sound source localization and face detection from video stream for supporting human behavior analysis. System monitors space containing multiple persons using microphone array and video camera. The aim is to detect which person in the scene is producing the sound that is received by the microphones. For this task the microphone array localizes the sound in the environment. Simultaneously face detection is performed to the video signal produced by the monitoring video camera. If face is detected from the bearing of the sound the system may decide that the sound is produced by the person who's face is detected. Preliminary results indicate that the fusion may give useful information for human behavior analysis for space containing multiple persons.
Original languageEnglish
Title of host publicationProceedings MobiMedia 2008
Number of pages4
DOIs
Publication statusPublished - 2008
MoE publication typeA4 Article in a conference publication
Event4th International Mobile Multimedia Communications Conference - Oulu, Finland
Duration: 7 Jul 20089 Jul 2008
Conference number: 4

Conference

Conference4th International Mobile Multimedia Communications Conference
CountryFinland
CityOulu
Period7/07/089/07/08

Fingerprint

Face recognition
Fusion reactions
Acoustic waves
Microphones
Video cameras
Bearings (structural)
Monitoring

Keywords

  • Audio localization
  • audio detection
  • microphone arrays
  • face detection

Cite this

@inproceedings{273a79989fe9411bafab226188d772f8,
title = "Fusion of Sound Source Localization and Face Detection for Supporting Human Behavior Analysis",
abstract = "This paper describes a demonstrated concept implementation that combines sound source localization and face detection from video stream for supporting human behavior analysis. System monitors space containing multiple persons using microphone array and video camera. The aim is to detect which person in the scene is producing the sound that is received by the microphones. For this task the microphone array localizes the sound in the environment. Simultaneously face detection is performed to the video signal produced by the monitoring video camera. If face is detected from the bearing of the sound the system may decide that the sound is produced by the person who's face is detected. Preliminary results indicate that the fusion may give useful information for human behavior analysis for space containing multiple persons.",
keywords = "Audio localization, audio detection, microphone arrays, face detection",
author = "Markus Niiranen and Janne Vehkaper{\"a} and Satu-Marja M{\"a}kel{\"a} and Johannes Peltola and Tomi R{\"a}ty",
year = "2008",
doi = "10.4108/ICST.MOBIMEDIA2008.4071",
language = "English",
isbn = "978-963-9799-25-7",
booktitle = "Proceedings MobiMedia 2008",

}

Niiranen, M, Vehkaperä, J, Mäkelä, S-M, Peltola, J & Räty, T 2008, Fusion of Sound Source Localization and Face Detection for Supporting Human Behavior Analysis. in Proceedings MobiMedia 2008. 4th International Mobile Multimedia Communications Conference, Oulu, Finland, 7/07/08. https://doi.org/10.4108/ICST.MOBIMEDIA2008.4071

Fusion of Sound Source Localization and Face Detection for Supporting Human Behavior Analysis. / Niiranen, Markus; Vehkaperä, Janne; Mäkelä, Satu-Marja; Peltola, Johannes; Räty, Tomi.

Proceedings MobiMedia 2008. 2008.

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

TY - GEN

T1 - Fusion of Sound Source Localization and Face Detection for Supporting Human Behavior Analysis

AU - Niiranen, Markus

AU - Vehkaperä, Janne

AU - Mäkelä, Satu-Marja

AU - Peltola, Johannes

AU - Räty, Tomi

PY - 2008

Y1 - 2008

N2 - This paper describes a demonstrated concept implementation that combines sound source localization and face detection from video stream for supporting human behavior analysis. System monitors space containing multiple persons using microphone array and video camera. The aim is to detect which person in the scene is producing the sound that is received by the microphones. For this task the microphone array localizes the sound in the environment. Simultaneously face detection is performed to the video signal produced by the monitoring video camera. If face is detected from the bearing of the sound the system may decide that the sound is produced by the person who's face is detected. Preliminary results indicate that the fusion may give useful information for human behavior analysis for space containing multiple persons.

AB - This paper describes a demonstrated concept implementation that combines sound source localization and face detection from video stream for supporting human behavior analysis. System monitors space containing multiple persons using microphone array and video camera. The aim is to detect which person in the scene is producing the sound that is received by the microphones. For this task the microphone array localizes the sound in the environment. Simultaneously face detection is performed to the video signal produced by the monitoring video camera. If face is detected from the bearing of the sound the system may decide that the sound is produced by the person who's face is detected. Preliminary results indicate that the fusion may give useful information for human behavior analysis for space containing multiple persons.

KW - Audio localization

KW - audio detection

KW - microphone arrays

KW - face detection

U2 - 10.4108/ICST.MOBIMEDIA2008.4071

DO - 10.4108/ICST.MOBIMEDIA2008.4071

M3 - Conference article in proceedings

SN - 978-963-9799-25-7

BT - Proceedings MobiMedia 2008

ER -