Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Relational Database

Bias-Free Hypothesis Evaluation in Multirelational Domains

Christine KörnerContact Information and Stefan Wrobel1, 2 Contact Information

(1)  Fraunhofer Institut Autonome Intelligente Systeme, Germany
(2)  Dept. of Computer Science III, University of Bonn, Germany
Abstract
In propositional domains using a separate test set via random sampling or cross validation is generally considered to be an unbiased estimator of true error. In multirelational domains previous work has already noted that linkage of objects may cause these procedures to be biased and has proposed corrected sampling procedures. However, as we show in this paper, the existing procedures only address one particular case of bias introduced by linkage. In this paper we therefore introduce generalized subgraph sampling, a sampling procedure based on bin packing, which ensures that test sets are properly chosen to match the probability of reencountering previously seen objects and which includes previous approaches as a special case. Experiments with data from the Internet Movie Database illustrate the performance of our algorithm.

Contact Information Christine Körner
Email: christine.koerner@ais.fraunhofer.de

Contact Information Stefan Wrobel
Email: stefan.wrobel@ais.fraunhofer.de
Fulltext Preview (Small, Large)
Image of the first page of the fulltext


Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.114 • Server: MPWEB26
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)