View Related Documents

Abstract

In many cases synthetic data is more suitable than authentic data for the testing and training of fraud detection systems. At the same time synthetic data suffers from some drawbacks originating from the fact that it is indeed synthetic and may not have the realism of authentic data. In order to counter this disadvantage, we have developed a method for generating synthetic data that is derived from authentic data. We identify the important characteristics of authentic data and the frauds we want to detect and generate synthetic data with these properties.

Keywords  fraud detection - synthetic test data - data generation methodology - user simulation - system simulation

The author is also with Telia Research AB, SE-123 86 Farsta, Sweden

Fulltext Preview

Image of the first page of the fulltext document