Abstract
In this paper, we describe the system developed by the Australian National University (ANU), National ICT Australia (NICTA) for multimedia event detection applied to the TRECVID-2011 video retrieval benchmark. Our system uses five audio and visual features, leverages training events with cascaded classifier training, and sees performance improvements with spatial semantic features. A summary of our submitted runs can be found below: The best run from our system ranks fourth in mean-ActualNDC, and third in mean- F1 metric, averaged over all ten events among sixty runs from nineteen teams.
Original language | English |
---|---|
Publication status | Published - 2011 |
Event | TREC Video Retrieval Evaluation, TRECVID 2011 - Gaithersburg, MD, United States Duration: 5 Dec 2011 → 7 Dec 2011 |
Conference
Conference | TREC Video Retrieval Evaluation, TRECVID 2011 |
---|---|
Country/Territory | United States |
City | Gaithersburg, MD |
Period | 5/12/11 → 7/12/11 |