Volume 14, Number 1, 93-130, DOI: 10.1007/s10664-008-9095-3

An empirical analysis of information retrieval based concept location techniques in software comprehension

Brendan Cleary, Chris Exton, Jim Buckley and Michael English

View Related Documents

Abstract

Concept location, the problem of associating human oriented concepts with their counterpart solution domain concepts, is a fundamental problem that lies at the heart of software comprehension. Recent research has attempted to alleviate the impact of the concept location problem through the application of methods drawn from the information retrieval (IR) community. Here we present a new approach based on a complimentary IR method which also has a sound basis in cognitive theory. We compare our approach to related work through an experiment and present our conclusions. This research adapts and expands upon existing language modelling frameworks in IR for use in concept location, in software systems. In doing so it is novel in that it leverages implicit information available in system documentation. Surprisingly, empirical evaluation of this approach showed little performance benefit overall and several possible explanations are forwarded for this finding.

Keywords  Information retrieval - Software comprehension - Empirical analysis

Editors: Tim Menzies and Letha Etzkorn

Fulltext Preview

Image of the first page of the fulltext document