Intermediate semantics based distance metric learning for video annotation and similarity measurements

Wen Qu*, Xiangmin Zhou, Daling Wang, Shi Feng, Yifei Zhang, Ge Yu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The similarity metric between videos is integral to several key tasks,including video retrieval,classification and recommendation. Since there is no standard criterion for the similarity measurement between videos except measuring manually,it is difficult to collect large training dataset for distance metric learning algorithms. Moreover,the existing distance metric learning (DML) methods for multimedia data suffer from two critical limitations: (1) they typically attempt to learn a distance function on the single label setting,in which each item is only labeled with single label; (2) they are often designed for learning distance metrics on low-level features,which ignore the semantic similarity of the multimedia data. To address these problems,in this paper,we propose a novel framework of Intermediate Semantics based Distance Learning (ISDL) for video clips,which aims to integrate semantics of multiple modals optimally for distance metric learning. In particular,the proposed framework: (1) generates the training pairs automatically; (2) defines multi-modal concepts for similarity measure among videos; (3) learns the distance metric for video clips based on the intermediate semantics. We conduct an extensive set of experiments to evaluate the performance of the proposed algorithms,and the results validate the effectiveness of our proposed approach.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering – WISE 2016 - 17th International Conference, Proceedings
EditorsWojciech Cellary, Jianmin Wang, Mohamed F. Mokbel, Hua Wang, Rui Zhou, Yanchun Zhang
PublisherSpringer Verlag
Pages227-242
Number of pages16
ISBN (Print)9783319487397
DOIs
Publication statusPublished - 2016
Externally publishedYes
Event17th International Conference on Web Information Systems Engineering, WISE 2016 - Shanghai, China
Duration: 8 Nov 201610 Nov 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10041 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th International Conference on Web Information Systems Engineering, WISE 2016
Country/TerritoryChina
CityShanghai
Period8/11/1610/11/16

Fingerprint

Dive into the research topics of 'Intermediate semantics based distance metric learning for video annotation and similarity measurements'. Together they form a unique fingerprint.

Cite this