TREC 2002 video track experiments at MediaTeam Oulu and VTT

Mika Rautiainen, Jani Penttilä, Dmitri Vorobiev, Kai Noponen, Pertti Väyrynen, Matti Hosio, Esa Matinmikko, Satu-Marja Mäkelä, Johannes Peltola, Timo Ojala, Tapio Seppänen

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientific

Abstract

In TREC 2002 Video Track MediaTeam Oulu and VTT Technical Research Centre of Finland participated jointly in semantic feature extraction, manual search and interactive search tasks. In the semantic feature extraction task, we sent results for semantic categories of cityscape, landscape, people, speech and instrumental sound. Spatio-temporal correlation of oriented gradient occurrences was used with example shots to detect shots containing people, cityscape or landscape. The audio signal features consisted of various statistical measurements and were used to detect shots containing speech or instrumental sound. Our video
browsing and retrieval system, VIRE, was used for manual and interactive search tasks. Our system offers two techniques for video retrieval: 1. Multi-modal indexing based on self-organizing feature maps with semantic filtering. 2. An interactive navigating tool that combines two inter-shot properties, temporal
coherency and metric similarities, into a view where database shots are presented in a lattice structure. We tested our interactive navigating tool with eight persons to obtain results for 25 pre-defined search topics. In this paper we give an overview of the approaches and a summary of the results.
Original languageEnglish
Title of host publicationText Retrieval Conference TREC 2002 Video Track
Number of pages12
Publication statusPublished - 2002
MoE publication typeB3 Non-refereed article in conference proceedings
EventText Retrieval Conference TREC 2002 Video Track - Gaithersburg, United States
Duration: 19 Nov 200222 Nov 2002

Conference

ConferenceText Retrieval Conference TREC 2002 Video Track
CountryUnited States
CityGaithersburg
Period19/11/0222/11/02

Fingerprint

Semantics
Feature extraction
Experiments
Acoustic waves
Self organizing maps

Cite this

Rautiainen, M., Penttilä, J., Vorobiev, D., Noponen, K., Väyrynen, P., Hosio, M., ... Seppänen, T. (2002). TREC 2002 video track experiments at MediaTeam Oulu and VTT. In Text Retrieval Conference TREC 2002 Video Track
Rautiainen, Mika ; Penttilä, Jani ; Vorobiev, Dmitri ; Noponen, Kai ; Väyrynen, Pertti ; Hosio, Matti ; Matinmikko, Esa ; Mäkelä, Satu-Marja ; Peltola, Johannes ; Ojala, Timo ; Seppänen, Tapio. / TREC 2002 video track experiments at MediaTeam Oulu and VTT. Text Retrieval Conference TREC 2002 Video Track. 2002.
@inproceedings{71d71cc4c910492f8e807e6af18dfde3,
title = "TREC 2002 video track experiments at MediaTeam Oulu and VTT",
abstract = "In TREC 2002 Video Track MediaTeam Oulu and VTT Technical Research Centre of Finland participated jointly in semantic feature extraction, manual search and interactive search tasks. In the semantic feature extraction task, we sent results for semantic categories of cityscape, landscape, people, speech and instrumental sound. Spatio-temporal correlation of oriented gradient occurrences was used with example shots to detect shots containing people, cityscape or landscape. The audio signal features consisted of various statistical measurements and were used to detect shots containing speech or instrumental sound. Our videobrowsing and retrieval system, VIRE, was used for manual and interactive search tasks. Our system offers two techniques for video retrieval: 1. Multi-modal indexing based on self-organizing feature maps with semantic filtering. 2. An interactive navigating tool that combines two inter-shot properties, temporalcoherency and metric similarities, into a view where database shots are presented in a lattice structure. We tested our interactive navigating tool with eight persons to obtain results for 25 pre-defined search topics. In this paper we give an overview of the approaches and a summary of the results.",
author = "Mika Rautiainen and Jani Penttil{\"a} and Dmitri Vorobiev and Kai Noponen and Pertti V{\"a}yrynen and Matti Hosio and Esa Matinmikko and Satu-Marja M{\"a}kel{\"a} and Johannes Peltola and Timo Ojala and Tapio Sepp{\"a}nen",
year = "2002",
language = "English",
booktitle = "Text Retrieval Conference TREC 2002 Video Track",

}

Rautiainen, M, Penttilä, J, Vorobiev, D, Noponen, K, Väyrynen, P, Hosio, M, Matinmikko, E, Mäkelä, S-M, Peltola, J, Ojala, T & Seppänen, T 2002, TREC 2002 video track experiments at MediaTeam Oulu and VTT. in Text Retrieval Conference TREC 2002 Video Track. Text Retrieval Conference TREC 2002 Video Track, Gaithersburg, United States, 19/11/02.

TREC 2002 video track experiments at MediaTeam Oulu and VTT. / Rautiainen, Mika; Penttilä, Jani; Vorobiev, Dmitri; Noponen, Kai; Väyrynen, Pertti; Hosio, Matti; Matinmikko, Esa; Mäkelä, Satu-Marja; Peltola, Johannes; Ojala, Timo; Seppänen, Tapio.

Text Retrieval Conference TREC 2002 Video Track. 2002.

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientific

TY - GEN

T1 - TREC 2002 video track experiments at MediaTeam Oulu and VTT

AU - Rautiainen, Mika

AU - Penttilä, Jani

AU - Vorobiev, Dmitri

AU - Noponen, Kai

AU - Väyrynen, Pertti

AU - Hosio, Matti

AU - Matinmikko, Esa

AU - Mäkelä, Satu-Marja

AU - Peltola, Johannes

AU - Ojala, Timo

AU - Seppänen, Tapio

PY - 2002

Y1 - 2002

N2 - In TREC 2002 Video Track MediaTeam Oulu and VTT Technical Research Centre of Finland participated jointly in semantic feature extraction, manual search and interactive search tasks. In the semantic feature extraction task, we sent results for semantic categories of cityscape, landscape, people, speech and instrumental sound. Spatio-temporal correlation of oriented gradient occurrences was used with example shots to detect shots containing people, cityscape or landscape. The audio signal features consisted of various statistical measurements and were used to detect shots containing speech or instrumental sound. Our videobrowsing and retrieval system, VIRE, was used for manual and interactive search tasks. Our system offers two techniques for video retrieval: 1. Multi-modal indexing based on self-organizing feature maps with semantic filtering. 2. An interactive navigating tool that combines two inter-shot properties, temporalcoherency and metric similarities, into a view where database shots are presented in a lattice structure. We tested our interactive navigating tool with eight persons to obtain results for 25 pre-defined search topics. In this paper we give an overview of the approaches and a summary of the results.

AB - In TREC 2002 Video Track MediaTeam Oulu and VTT Technical Research Centre of Finland participated jointly in semantic feature extraction, manual search and interactive search tasks. In the semantic feature extraction task, we sent results for semantic categories of cityscape, landscape, people, speech and instrumental sound. Spatio-temporal correlation of oriented gradient occurrences was used with example shots to detect shots containing people, cityscape or landscape. The audio signal features consisted of various statistical measurements and were used to detect shots containing speech or instrumental sound. Our videobrowsing and retrieval system, VIRE, was used for manual and interactive search tasks. Our system offers two techniques for video retrieval: 1. Multi-modal indexing based on self-organizing feature maps with semantic filtering. 2. An interactive navigating tool that combines two inter-shot properties, temporalcoherency and metric similarities, into a view where database shots are presented in a lattice structure. We tested our interactive navigating tool with eight persons to obtain results for 25 pre-defined search topics. In this paper we give an overview of the approaches and a summary of the results.

M3 - Conference article in proceedings

BT - Text Retrieval Conference TREC 2002 Video Track

ER -

Rautiainen M, Penttilä J, Vorobiev D, Noponen K, Väyrynen P, Hosio M et al. TREC 2002 video track experiments at MediaTeam Oulu and VTT. In Text Retrieval Conference TREC 2002 Video Track. 2002