Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Importing Documents and Metadata into Digital Libraries: Requirements Analysis and an Extensible Architecture

Ian H. WittenContact Information, David BainbridgeContact Information, Gordon PaynterContact Information and Stefan BoddieContact Information

(6)  Computer Science Department, University of Waikato, New Zealand
(7)  Universtiy of California Science Library, Riverside, California, USA
Abstract
Flexible digital library systems need to be able to accept, or “import,” documents and metadata in a variety of forms, and associate metadata with the appropriate documents. This paper analyzes the requirements of the import process for general digital libraries. The requirements include (a) format conversion for source documents, (b) the ability to incorporate existing conversion utilities, (c) provision for metadata to be specified in the document files themselves and/or in separate metadata files, (d) format conversion for metadata files, (e) provision for metadata to be computed from the document content, and (f) flexible ways of associating metadata with documents or sets of documents. We argue that these requirements are so open-ended that they are best met by an extensible architecture that facilitates the addition of new document formats and metadata facilities to existing digital library systems. An implementation of this architecture is briefly described.

Contact Information Ian H. Witten
Email: ihw@cs.waikato.ac.nz

Contact Information David Bainbridge
Email: davidb@cs.waikato.ac.nz

Contact Information Gordon Paynter
Email: gordon.paynter@ucr.edu

Contact Information Stefan Boddie
Email: sjboddie@cs.waikato.ac.nz
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.106 • Server: mpweb01
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)