Detecting generic visual events with temporal cues

Lexing Xie*, Dong Xu, Shahram Ebadollahi, Katya Scheinberg, Shih Fu Change, John R. Smith

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

We present novel algorithms for detecting generic visual events from video. Target event models will produce binary decisions on each shot about classes of events involving object actions and their interactions with the scene, such as airplane taking off, exiting car, riot. While event detection has been studied in scenarios with strong scene and imaging assumptions, the detection of generic visual events from an unconstrained domain such as broadcast news has not been explored. This work extends our recent work [3] on event detection by (1) using a novel bag-of-features representation along with the earth movers' distance to account for the temporal variations within a shot, (2) learn the importance among input modalities with a double-convex combination along both different kernels and different support vectors, which is in turn solved via multiple kernel learning. Experiments show that the bag-of-features representation significantly outperforms the static baseline; multiple kernel learning yields promising performance improvement while providing intuitive explanations for the importance of the input kernels.

Original languageEnglish
Title of host publicationConference Record of the 40th Asilomar Conference on Signals, Systems and Computers, ACSSC '06
Pages54-58
Number of pages5
DOIs
Publication statusPublished - 2006
Externally publishedYes
Event40th Asilomar Conference on Signals, Systems, and Computers, ACSSC '06 - Pacific Grove, CA, United States
Duration: 29 Oct 20061 Nov 2006

Publication series

NameConference Record - Asilomar Conference on Signals, Systems and Computers
ISSN (Print)1058-6393

Conference

Conference40th Asilomar Conference on Signals, Systems, and Computers, ACSSC '06
Country/TerritoryUnited States
CityPacific Grove, CA
Period29/10/061/11/06

Fingerprint

Dive into the research topics of 'Detecting generic visual events with temporal cues'. Together they form a unique fingerprint.

Cite this