An essential stage in any text extraction system is the manual verification of the printed material converted by OCR. This
proves to be the most labor-intensive step in the process. In a system built and deployed at the National Library of Medicine
to automatically extract bibliographic data from scanned biomedical journals, alternative means were considered to validate
the text. This paper describes two approaches and gives preliminary performance data.