View Related Documents

Abstract

The University of Trier maintains the DBLP (Digital Bibliography & Library Project) Computer Science Bibliography which offers bibliographic information about more than 870.000 scientific publications. This paper describes the DBLP WebCrawler, a meta search engine that is able to search for full text publications in PDF format for each DBLP entry on the web. Various search engines such as Google and Yahoo are used as data sources. The retrieved documents are additionally analysed and ranked according to their relevance. The proposed system differs from systems like CiteSeer in so far, that the DBLP Webcrawler builds upon metadata and tries to find relevant full-texts whereas CiteSeer mainly starts with full-texts and extracts metadata.

Fulltext Preview

Image of the first page of the fulltext document