A
co-location pattern is a group of spatial features/events that are frequently co-located in the same region. For example, human cases of West
Nile Virus often occur in regions with poor mosquito control and the presence of birds. For co-location pattern mining, previous
studies often emphasize the equal participation of every spatial feature. As a result, interesting patterns involving events
with substantially different frequency cannot be captured. In this paper, we address the problem of
mining co-location patterns with rare spatial features. Specifically, we first propose a new measure called the
maximal participation ratio (maxPR) and show that a co-location pattern with a relatively high maxPR value corresponds to a co-location pattern containing
rare spatial events. Furthermore, we identify a weak monotonicity property of the maxPR measure. This property can help to
develop an efficient algorithm to mine patterns with high maxPR values. As demonstrated by our experiments, our approach is
effective in identifying co-location patterns with rare events, and is efficient and scalable for large-scale data sets.
Keywords Spatial data mining - Co-location patterns - Spatial association rules
A preliminary version of the paper appeared as [13].
The research of the second author is supported in part by Natural Sciences and Engineering Research Council of Canada under
grant number 312194-05 and National Science Foundation of the United States under grant number IIS-0308001. All opinions,
findings, conclusions and recommendations in this paper are those of the authors and do not necessarily reflect the views
of the funding agency.