Abstract
Conceptualization seeks to map a short text (i.e., a word or a phrase) to a set of concepts as a mechanism of understanding text. Most of prior research in conceptualization uses human-crafted knowledge bases that map instances to concepts. Such approaches to conceptualization have the limitation that the mappings are not context sensitive. To overcome this limitation, we propose a framework in which we harness the power of a probabilistic topic model which inherently captures the semantic relations between words. By combining latent Dirichlet allocation, a widely used topic model with Probase, a large-scale probabilistic knowledge base, we develop a corpus-based framework for context-dependent conceptualization. Through this simple but powerful framework, we improve conceptualization and enable a wide range of applications that rely on semantic understanding of short texts, including frame element prediction, word similarity in context, ad-query similarity, and query similarity.
Original language | English |
---|---|
Title of host publication | IJCAI International Joint Conference on Artificial Intelligence |
Place of Publication | USA |
Publisher | AAAI Press |
Pages | 2564-2661 |
Edition | Peer Reviewed |
ISBN (Print) | 9781577356332 |
DOIs | |
Publication status | Published - 2013 |
Event | 23rd International Joint Conference on Artificial Intelligence, IJCAI 2013 - Beijing China Duration: 1 Jan 2013 → … http://ijcai.org/papers13/contents.php |
Conference
Conference | 23rd International Joint Conference on Artificial Intelligence, IJCAI 2013 |
---|---|
Period | 1/01/13 → … |
Other | August 3-9 2013 |
Internet address |