dpsearch-4.51-27092008

Changes from version 4.50, added to the previous snapshot:

  • SubDocCnt command has been added. Use it to specity the maximal number of sub-documents indexed per one document.
  • SubDocLevel command has been added. Use it to specify maximal nesting level for sub-documents.
  • HrefSection proccessing has been fixed in XML parser.
  • $(url.directory) meta-variable has been added.
  • storedoc.cgi accepts now the name of template in &tmplt= CGI-parameter.
  • Accept: HTTP header has been fixed for case when pattern is used for Content-Type in MIME command.
  • A bug in result merging has been fixed for multi-dbaddr mode.
  • Read the rest of this entry »

    Popularity: 2% [?]

Color maps of Neo PopRank

In addition to maps of the New Zealand’s Internet (TLD .nz) I made similar maps for the Belgian Internet (TLD .be). Each point on a map matching a site in one or another segment of the Internet. The color of point depends on the value of Neo PopRank for corresponding site in search engine 43N 39E. The color scale is similar to coloring of geophysical maps - higher popularity rating of a site corresponds to a “higher” area on the map.
Read the rest of this entry »

Popularity: 6% [?]

dpsearch-4.51-11082008

Changes since version 4.50 are:

  • allin<section>: operator has been added to the search query language.
  • storedoc.cgi takes now document from remote host if it unable to fetch it from stored database.

Read the rest of this entry »

Popularity: 12% [?]

www/dpsearch

FreeBSD’s port www/dpsearch has been updated to the latest version of DataparkSearch released, 4.50.

Popularity: 13% [?]

Text Messaging Outrage

My solution for the Text Messaging Outrage problem of Round 1C of Google Code Jam 2008 contest: Read the rest of this entry »

Popularity: 14% [?]

DataparkSearch 4.50

A new version, 4.50, of DataparkSearch Engine has been released. Changes since version 4.49 are:

  • Default value for PopRankSkipSameSite command has been changed to “yes”.
  • Possible memory leak has been fixed for a sub-document indexed from stored database.
  • The strict option has been added for Section command.
  • A word break has been added for French-style contractions.
  • Big lists of Russian and English synonyms have been added.
  • MaxSiteLevel command accept now a negative argument to group URLs on subdirectory basis.
  • The SkipUnreferred command has been extended to delete unreferred documents if necessary.
  • Del log processing has been fixed in splitter for case when cache log is empty.
  • Some German letters automatically replace by bi-letter combinations in accent-free search mode.
    ß -> ss, ä -> ae, ö -> oe, ü -> ue.
  • SQLite3 support has been added. Use –with-sqlite3 option for configure to enable it.
  • Indexing has been fixed for documents with several versions in different languages. You need to execute “indexer -Erehashstored” command when upgrade.
  • HTML parser understands now <!– google_ad_section_start –>, <!– google_ad_section_start(weight=ignore) –> and
    <!– google_ad_section_end –> comments as tags to include/exclude content for indexing.
  • Relevance calculation has been improved for case when acronyms and abbreviations are used.
  • Popularity: 13% [?]

Numbers

My solution for the Numbers problem of Round 1A of Google Code Jam 2008 contest. It works too long for Large input.
Read the rest of this entry »

Popularity: 15% [?]

Minimum Scalar Product

My solution for the Minimum Scalar Product problem of Round 1A of Google Code Jam 2008:
Read the rest of this entry »

Popularity: 16% [?]

Train Timetable

My solution for the Train Timetable task of Google Code Jam 2008 qualification round:
Read the rest of this entry »

Popularity: 16% [?]

Saving the Universe

My solution for the Saving the Universe task of Google Code Jam 2008 qualification round:
Read the rest of this entry »

Popularity: 17% [?]


Close
E-mail It

Fatal error: Allowed memory size of 268435456 bytes exhausted (tried to allocate 140737488309696 bytes) in Unknown on line 0