|Commercial, no source code available
|GPL, open source
|Number of documents indexed and pricing
- up to 50,000 for $1,995
- up to 100,000 for 2,995
- up to 200,000 for $5,995
- up to 300,000 for $8,995
|up to several millions, depending of hardware used. Free software
|File formats indexing
|220 different file formats, including HTML, PDF and Microsoft Office documents.
|Plain text, HTML, XML, MP3, GIF + any other with external parsers
|25 language groups, can segment sentences in Chinese, Japanese, Korean and Thai.
|Accessing files via
|HTTP, HTTPS, networked file systems.
|HTTP, HTTPS, FTP, NNTP, HTTP Proxy, local file system, htdb:// scheme for SQL databases.
|Accessing content protected by
|HTTP Basic, NTLM v1 and v2, LDAP
|Yes, each collection may be divided onto subsections (tags and categories)
|Integrate search results into your sites's look and feel
|users XSLT style sheet, export results in XML
|own template language to produce result pages in any text based format.
|Display key attributes of search results
|meta tags, specified HTML attributes, specified XML tags, regex excerpts from text (all those so called the sections)
|Filter results through meta tags
|Yes, + through any section or combination of sections.
|Assign different weights for meta tags/sections
|Integration with Google Desktop and Google Toolbar for Enterprise
|Excluding pages from the search index
|Cached versions of documents
|Number Range Search
|Date Range Search
|Sort search results by
|Revevence, Date, Popularity, Importance and by all those in reverse order
- Total number of searches and unique queries
- Number of searches on particular day
- Average number of searches at different hours of the day
- Top 100 keywords and queries
|No reports. Each query can be tracked along with all search parameters for futher processing.
|Automaticaly sitemap construction
|OneBox for Enterprise
|Customer support site; email support; guaranteed replacements in the case of any hardware failure
|A phorum on project's site
|Addendum, 15 Mar. 2007
|Automatic document summarization
|Yes, the Summary Extraction Algorithm
|HTTP Content negotiation for specified languages
|Link analysis algorithm
|Yes, the Neo PopRank and the Goo PopRank
//Google Mini features, Google Mini Administrator features, DataparkSearch.