The aim of our research is to produce and assess short summaries to aid users’ relevance judgements, for example for a search
engine result page. In this paper we present our new metric for measuring summary quality based on representativeness and
judgeability, and compare the summary quality of our system to that of Google. We discuss the basis for constructing our evaluation
methodology in contrast to previous relevant open evaluations, arguing that the elements which make up an evaluation methodology:
the tasks, data and metrics, are interdependent and the way in which they are combined is critical to the effectiveness of
the methodology. The paper discusses the relationship between these three factors as implemented in our own work, as well
as in SUMMAC/MUC/DUC.