Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Adaptive and Interactive Approaches to Document Analysis

George NagyContact Information and Sriharsha VeeramachaneniContact Information

(4)  RPI ECSE DocLab, Troy, NY 12180, USA
(5)  SRA Division, ITC-IRST, Trento, 38050, Italy
This chapter explores three aspects of learning in document analysis: (1) field classification, (2) interactive recognition, and (3) portable and networked applications. Context in document classification conventionally refers to language context, i.e., deterministic or statistical constraints on the sequence of letters in syllables or words, and on the sequence of words in phrases or sentences. We show how to exploit other types of statistical dependence, specifically the dependence between the shape features of several patterns due to the common source of the patterns within a field or a document. This type of dependence leads to field classification, where the features of some patterns may reveal useful information about the features of other patterns from the same source but not necessarily from the same class. We explore the relationship between field classification and the older concepts of unsupervised learning and adaptation. Human interaction is often more effective interspersed with algorithmic processes than only before or after the automated parts of the process. We develop a taxonomy for interaction during training and testing, and show how either human-initiated and machine-initiated interaction can lead to human and machine learning. In a section on new technologies, we discuss how new cameras and displays, web-wide access, interoperability, and essentially unlimited storage provide fertile new approaches to document analysis.

Contact Information George Nagy
Email: nagy@ecse.rpi.edu

Contact Information Sriharsha Veeramachaneni
Email: sriharsha@itc.it
Fulltext Preview (Small, Large)
Image of the first page of the fulltext


Export this chapter
Export this chapter as RIS | Text
 
Referenced by
1 newer article

  1. Gelernter, Judith (2009) Image indexing in article component databases. Journal of the American Society for Information Science and Technology
    [CrossRef]
Remote Address: 38.107.191.114 • Server: MPWEB26
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)