Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Detecting Temporal Trends of Technical Phrases by Using Importance Indices and Linear Regression

Hidenao Abe23 Contact Information and Shusaku Tsumoto23 Contact Information

(23)  Department of Medical Informatics, Shimane University, School of Medicine, 89-1 Enya-cho, Izumo, Shimane 693-8501, Japan
Abstract
In this paper, we propose a method for detecting temporal trends of technical terms based on importance indices and linear regression methods. In text mining, importance indices of terms such as simple frequency, document frequency including the terms, and tf-idf of the terms, play a key role for finding valuable patterns in documents. As for the documents, they are often published daily, monthly, annually, and irregularly for each purpose. Although the purposes of each set of documents are not changed, roles of terms and the relationship among them in the documents change temporally. In order to detect such temporal changes, we combined a method to extract terms, importance indices of terms, and trend identification based on linear regression analysis. Empirical results show that our method detected emergent and subsiding trends of extracted terms in a corpus of a research domain. By comparing this method with the existing burst detection method, we investigated the trend of phrases consisting of several burst words in the titles of AAAI and IJCAI.

Keywords  Text Mining - Trend Detection - TF-IDF - Jaccard Coefficient - Linear Regression


Contact Information Hidenao Abe
Email: abe@med.shimane-u.ac.jp

Contact Information Shusaku Tsumoto
Email: tsumoto@computer.org
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.113 • Server: MPWEB26
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)