Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Missing Values: Proposition of a Typology and Characterization with an Association Rule-Based Model

Leila Ben Othman19, 20 Contact Information, François Rioult20 Contact Information, Sadok Ben Yahia19 Contact Information and Bruno Crémilleux20 Contact Information

(19)  Department of Computer Science, Faculty of Sciences of Tunis, Tunisia
(20)  GREYC - CNRS UMR, University of Caen Basse-Normandie, France, 6072
Abstract
Handling missing values when tackling real-world datasets is a great challenge arousing the interest of many scientific communities. Many works propose completion methods or implement new data mining techniques tolerating the presence of missing values. It turns out that these tasks are very hard. In this paper, we propose a new typology characterizing missing values according to relationships within the data. These relationships are automatically discovered by data mining techniques using generic bases of association rules. We define four types of missing values from these relationships. The characterization is made for each missing value. It differs from the well-known statistical methods which apply a same treatment for all missing values coming from a same attribute. We claim that such a local characterization enables us perceptive techniques to deal with missing values according to their origins: the way in which we deal with the missing values should depend on their origins (e.g., attribute meaningless w.r.t. other attributes, missing values depending on other data, missing values by accident). Experiments on a real-world medical dataset highlight the interests of such a characterization.

Keywords  Data mining - missing values - association rules


Contact Information Leila Ben Othman
Email: lbenothm@info.unicaen.fr

Contact Information François Rioult
Email: francois.Rioult@info.unicaen.fr

Contact Information Sadok Ben Yahia
Email: sadok.benyahia@fst.rnu.tn

Contact Information Bruno Crémilleux
Email: Bruno.Cremilleux@info.unicaen.fr
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.110 • Server: mpweb16
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)