TY - GEN
T1 - Boosting retrieval of digital spoken content
AU - Pereira Nunes, Bernardo
AU - Mera, Alexander
AU - Casanova, Marco A.
AU - Kawase, Ricardo
PY - 2013
Y1 - 2013
N2 - Every day, the Internet expands as millions of new multimedia objects are uploaded in the form of audio, video and images. While traditional text-based content is indexed by search engines, this indexing cannot be applied to audio and video objects, resulting in a plethora of multimedia content that is inaccessible to a majority of online users. To address this issue, we introduce a technique of automatic, semantically enhanced, description generation for multimedia content. The objective is to facilitate indexing and retrieval of the objects with the help of traditional search engines. Essentially, the technique generates static Web pages automatically, which describe the content of the digital audio and video objects. These descriptions are then organized in such a way as to facilitate locating corresponding audio and video segments. The technique employs a combination of Web services and concurrently provides description translation and semantic enhancement. Thorough analysis of the click-data, comparing accesses to the digital content before and after automatic description generation, suggests a significant increase in the number of retrieval items. This outcome, however is not limited to the terms of visibility, but in supporting multilingual access, additionally decreases the number of language barriers.
AB - Every day, the Internet expands as millions of new multimedia objects are uploaded in the form of audio, video and images. While traditional text-based content is indexed by search engines, this indexing cannot be applied to audio and video objects, resulting in a plethora of multimedia content that is inaccessible to a majority of online users. To address this issue, we introduce a technique of automatic, semantically enhanced, description generation for multimedia content. The objective is to facilitate indexing and retrieval of the objects with the help of traditional search engines. Essentially, the technique generates static Web pages automatically, which describe the content of the digital audio and video objects. These descriptions are then organized in such a way as to facilitate locating corresponding audio and video segments. The technique employs a combination of Web services and concurrently provides description translation and semantic enhancement. Thorough analysis of the click-data, comparing accesses to the digital content before and after automatic description generation, suggests a significant increase in the number of retrieval items. This outcome, however is not limited to the terms of visibility, but in supporting multilingual access, additionally decreases the number of language barriers.
KW - publishing multimedia content
KW - spoken content retrieval
KW - spoken lecture processing
UR - http://www.scopus.com/inward/record.url?scp=84875847930&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-37343-5_16
DO - 10.1007/978-3-642-37343-5_16
M3 - Conference contribution
SN - 9783642373428
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 153
EP - 162
BT - Knowledge Engineering, Machine Learning and Lattice Computing with Applications - 16th International Conference, KES 2012, Revised Selected Papers
T2 - 16th International Conference on Knowledge Engineering, Machine Learning and Lattice Computing with Applications, KES 2012
Y2 - 10 September 2012 through 12 September 2012
ER -