AN APPROACH FOR DISCOVERING KEYWORDS FROM SPANISH TWEETS USING WIKIPEDIA

AN APPROACH FOR DISCOVERING KEYWORDS FROM SPANISH TWEETS USING WIKIPEDIA

Authors:
Daniel AYALA, Juan C. ROLDÁN, David RUIZ, Fernando O. GALLEGO

DOI:
10.14201/ADCAIJ2015427388

Volume:
Regular Issue 4 (2), 2015

Keywords: 
Twitter; Social Media Analysis; Wikipedia; Keywords Discovery

Most approaches to keywords discovery when analyzing microblogging messages (among them those from Twitter) are based on statistical and lexical information about the words that compose the text. The lack of context in the short messages can be problematic due to the low co-occurrence of words. In this paper, we present a new approach for keywords discovering from Spanish tweets based on the addition of context information using Wikipedia as a knowledge base. We present four different ways to use Wikipedia and two ways to rank the new keywords. We have tested these strategies using more than 60000 Spanish tweets, measuring performance and analyzing particularities of each strategy.

JCR

Position in 2022 Journal Citation Indicator (JCI) Ranking:
Category COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE


CONTACT