
|
|
|
|
Home / Computers / Software / Information Retrieval / Fulltext
|
| Categories |
|
|
|
|
|
|
|
|
Web Sites
|
- Building Task-Specific Interfaces to High Volume Conversational Data - The philosophy behind http://www.phoaks.com/ People Helping One Another Know Stuff.
www.acm.org/sigchi/chi97/proceedings/paper/lgt.htm
- Cheshire II Project Home Page - Cheshire II is a "Next-Generation Online Catalog and Full-Text Information Retrieval System." It features advanced IR techniques, including support for Boolean and probabilistic 'best match' ranked searching, SGML/XML as the primary data base format, and a client/server architecture that uses the Z39.50 Information Retrieval Protocol.
cheshire.berkeley.edu
- ClusterClick - Full-text indexing of desktop documents for researchers, journalists, and historians with low indexing overhead 13 percent beyond document space. Also displays the most important words from each document. [Windows 95/98]
www.clusterclick.com
- Dataware Technologies - Search engine vendor of BRS/Search, a text based core product, and web enabled products.
www.dataware.com
- dtSearch - Searches all popular file types, with features including hit highlighting, natural language, fuzzy, phonic, boolean, proximity, field, numeric range.
www.dtsearch.com
- Glimpse - A Unix based indexing and query system. It is good for indexing relatively small amounts of data. Different types of indexes allow you to trade off search speed for index size. The default search engine used in Harvest.
glimpse.cs.arizona.edu
- ht://Dig - A complete world wide web indexing and searching system for a small domain or intranet. Source code (GPL).
www.htdig.org
- IB Search Engine - High speed, fully featured, multilingual fielded fulltext engine. Available for many platforms including Solaris, BSD, Linux and Windows-NT.
www.bsn.com/Z39.50
- Inventory of Full-Text Information Retrieval Software Vendors - A project of the IFLA Section on Information Technology
www.ifla.org/VII/s21/p1996/fulltext.htm
- Isearch - Software for indexing and searching text documents, using full text and field based search, relevance ranked results, Boolean queries, and heterogeneous databases. Support for document types such as HTML, SGML, mail folders, and USMARC.
www.etymon.com/Isearch
- ISYS - Web Search Technology.
www.isgweb.com
- LegalProNet.Com - A secure web based service, allowing attorneys to store, manage, search and access their documents through the internet.
www.LegalProNet.com
- Locus Search Engine Software - Locus is a Linux based full text search engine.
power.locus.cz/locus
- Lucene Search Engine - Created by Doug Cutting previously of Apple computer and Xerox PARC, Lucene is a commercial Java based search engine.
jakarta.apache.org/lucene/docs
- Mark-A-Tex - Extraction-indexing software allows users to search, highlight, extract, and index a variety of file formats, with options to print, save, or re-search output.
www.freshstart.com/markatext
- Megaputer Intelligence - Megaputer provides a complete family of unique solutions for Natural Language Text Retrieval and Analysis, Data Mining and Knowledge Discovery in Databases.
www.megaputer.com
- MicroISIS by UNESCO - Non-numerical information storage and retrieval software developed to allow institutions, especially in developing countries, to streamline their information processing activities.
www.unesco.org/webworld/isis/isis.htm
- OpenText - Supplier of information retrieval and collaborative software.
www.opentext.com
- PLS - Personal Library Software is one of the major old line players in the full text search and retrieval market place. They were recently purchased by AOL. AOL is now licensing the PLS search engine free of charge to all users. There is no Linux version, however.
www.pls.com
- SearchExpress - Provides document scanning, optical character recognition and full-text searching.
www.searchexpress.com
- Simple Web Indexing System for Humans - SWISH-Enhanced is a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other text files.
swish-e.org
- SimpleScan Software, Inc. - SimpleScan Software, Inc - providing powerful, cost effective, enterprise wide document management software solutions.
www.SimpleScan.com
- The Combine Harvesting Robot - Combine is an open system for harvesting and threshing (indexing) Internet resources.
www.lub.lu.se/combine
- Thunderstone Software - Thunderstone has a number of full text search related products including their flagship text/relational database, Texis.
www.thunderstone.com
- Ultraseek Server - The tools they use at their site for sale. Demo version available for download.
software.infoseek.com/products/ultraseek/ultratop.htm
- Web Search Engine Software - Create and maintain your own search engine quickly and easily using our perl script and database management tool.
www.web-search.com/websoft.html
- WinOcular Document Imaging - Combined Computer Resources, Inc. (CCR), a software developer and integrator, specializes in customizing and integrating document imaging, COLD report management and workflow software products.
www.winocular.com
- Zebra Z39.50 Search Engine - Zebra is a fulltext and free-text indexing and retrieval system that conforms to ANSI standard Z39.50. It is very good for indexing and searching highly structured data such as MARC records, GILS records, etc. The Zebra server is freely available for noncommercial applications.
www.indexdata.dk/zebra
- ZyLAB Europe - Develops and markets ZyIMAGE, a suite of programs that allows you to efficiently and easily convert paper documents and computer-generated files into full-text searchable collections that can also be accessed from the web, distributed on optical media, or emailed after filtering according to a user's profile. Highlights hits within the document or on the original scanned image.
www.zylab.nl
- Managing Gigabytes - An excellent book about indexing techniques and a software package that implements the algorithms. The software was not updated since 1996 but still contains valuable functions and libraries. Source code (GPL).
www.cs.mu.oz.au/mg
|
|
|
|
|