The analysis of blogs is emerging as an exciting new area in the text processing field which attempts to harness and exploit
the vast quantity of information being published by individuals. However, their particular characteristics (shortness, vocabulary
size and nature, etc.) make it difficult to achieve good results using automated clustering techniques. Moreover, the fact
that many blogs may be considered to be narrow domain means that exploiting external linguistic resources can have limited
value. In this paper, we present a methodology to improve the performance of clustering techniques on blogs, which does not
rely on external resources. Our results show that this technique can produce significant improvements in the quality of clusters
produced.