Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
|
 |
A Search Engine for Indian Languages
| |
|
A Search Engine for Indian Languages
Ashwani Mujoo7 , Manoj Kumar Malviya7 , Rajat Moona7 and T V Prabhakar7 
| (7) |
Department of Computer Science and Engineering, Indian Institute of Technology, Kanpur, India |
Abstract
There is a great need for a search engine for web documents written in languages other than English. In this paper, we describe
the design issues of a Search Engine for Indian Languages. We also describe the implementation of two Search Engines for Indian
Languages, one for documents in ISCII and the other for documents in Unicode. The software allows full-text indexing and searching
of a database of documents written in any Brahmi-based Indian Language. The Search engine gathers the HTML documents from the web, indexes and compresses the documents and then searches for the given keywords. The main features
of the search engines are phonetic tolerance, morphological analysis, compression and indexing, leading and trailing substring
matches for keywords, search through compressed documents. The implementation includes a search server architecture, which
can be accessed from a WYSIWYG front end, which is a Java swing applet. Performance results show that the search engine achieves
a compression of almost 80 percent and has an appreciable precision and recall.
Fulltext Preview (Small, Large)
 References secured to subscribers.
|
|
|
|
|
|