Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Camera-based analysis of text and documents: a survey

Jian LiangContact Information, David DoermannContact Information and Huiping LiContact Information

(1) Language and Media Processing Laboratory, Institute for Advanced Computer Studies, University of Maryland, College Park, USA
(2) Applied Media Analysis, Inc., Ellicott City, Maryland, USA

Abstract.  The increasing availability of high-performance, low-priced, portable digital imaging devices has created a tremendous opportunity for supplementing traditional scanning for document image acquisition. Digital cameras attached to cellular phones, PDAs, or wearable computers, and standalone image or video devices are highly mobile and easy to use; they can capture images of thick books, historical manuscripts too fragile to touch, and text in scenes, making them much more versatile than desktop scanners. Should robust solutions to the analysis of documents captured with such devices become available, there will clearly be a demand in many domains. Traditional scanner-based document analysis techniques provide us with a good reference and starting point, but they cannot be used directly on camera-captured images. Camera-captured images can suffer from low resolution, blur, and perspective distortion, as well as complex layout and interaction of the content and background. In this paper we present a survey of application domains, technical challenges, and solutions for the analysis of documents captured by digital cameras. We begin by describing typical imaging devices and the imaging process. We discuss document analysis from a single camera-captured image as well as multiple frames and highlight some sample applications under development and feasible ideas for future development.
Received: 18 December 2003, Accepted: 1 November 2004, Published online: 21 June 2005

Contact InformationJian Liang
Email: lj@umiacs.umd.edu

Contact InformationDavid Doermann
Email: doermann@umiacs.umd.edu

Contact InformationHuiping Li
Email: Huiping.Li@appliedmediaanalysis.com
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this article
Export this article as RIS | Text
 
Referenced by
11 newer articles

  1. Chitrakala Gopalan (2010) Statistical modeling for the detection, localization and extraction of text from heterogeneous textual images using combined feature scheme. Signal Image and Video Processing
    [CrossRef]
  2. Hyung Il Koo (2009) . IEEE Transactions on Image Processing 18(7)
    [CrossRef]
  3. Xu, Changsheng (2008) . IEEE Transactions on Multimedia 10(3)
    [CrossRef]
  4. Jian Liang (2008) . IEEE Transactions on Pattern Analysis and Machine Intelligence 30(4)
    [CrossRef]
  5. Choudary, Chekuri (2007) . IEEE Transactions on Multimedia 9(7)
    [CrossRef]
  6. Brown, Michael S. (2007) . IEEE Transactions on Pattern Analysis and Machine Intelligence 29(11)
    [CrossRef]
  7. Ishida, H. (2008) Generation of templates for low-resolution text recognition using a hypothesis graph. Pattern Recognition and Image Analysis 18(4)
    [CrossRef]
  8. Luong, Hiêp Q. (2008) Robust reconstruction of low-resolution document images by exploiting repetitive character behaviour. International Journal of Document Analysis and Recognition (IJDAR)
    [CrossRef]
  9. Goto, Hideaki (2008) Redefining the DCT-based feature for scene text detection. International Journal of Document Analysis and Recognition (IJDAR)
    [CrossRef]
  10. Coughlan, James (2007) Color Targets: Fiducials to Help Visually Impaired People Find Their Way by Camera Phone. EURASIP Journal on Image and Video Processing 2007
    [CrossRef]
First | Next | Last
Remote Address: 38.107.191.97 • Server: mpweb22
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)