Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Authorship Analysis in Cybercrime Investigation

Rong ZhengContact Information, Yi QinContact Information, Zan HuangContact Information and Hsinchun ChenContact Information

(4)  Artificial Intelligence Lab, Department of Management Information Systems, The University of Arizona, Tucson, Arizona 85721, USA
Abstract
Criminals have been using the Internet to distribute a wide range of illegal materials globally in an anonymous manner, making criminal identity tracing difficult in the cybercrime investigation process. In this study we propose to adopt the authorship analysis framework to automatically trace identities of cyber criminals through messages they post on the Internet. Under this framework, three types of message features, including style markers, structural features, and content-specific features, are extracted and inductive learning algorithms are used to build feature-based models to identify authorship of illegal messages. To evaluate the effectiveness of this framework, we conducted an experimental study on data sets of English and Chinese email and online newsgroup messages. We experimented with all three types of message features and three inductive learning algorithms. The results indicate that the proposed approach can discover real identities of authors of both English and Chinese Internet messages with relatively high accuracies.

Contact Information Rong Zheng
Email: rong@eller.arizona.edu

Contact Information Yi Qin
Email: yiqin@eller.arizona.edu

Contact Information Zan Huang
Email: zhuang@eller.arizona.edu

Contact Information Hsinchun Chen
Email: hchen@eller.arizona.edu
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Referenced by
1 newer article

  1. Tearle, M. (2008) An algorithm for automated authorship attribution using neural networks. Literary and Linguistic Computing
    [CrossRef]
Remote Address: 38.107.191.105 • Server: MPWEB25
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)