Sphider-plus version 4.2021b - The PHP Search Engine

All required information.

[ Change Log Summary ]


- Actual release:    4.2021b

- Former versions:

          Version 4.2021a

          Version 3.2020d       Version 3.2020c

          Version 3.2020b       Version 3.2020a

          Version 3.2019c

          Version 3.2019b       Version 3.2019a

          Version 3.2018b       Version 3.2018a

          Version 3.2017b        Version 3.2017a


          Version 3.2016d        Version 3.2016c

          Version 3.2016b        Version 3.2016a


          Version 3.2015e        Version 3.2014c

          Version 3.2015d        Version 3.2014b

          Version 3.2015c        Version 3.2014a

          Version 3.2015b        Version 3.2013b

          Version 3.2015a        Version 3.2013a



- Older versions:


          Version 2.9          Version 1.9

          Version 2.8          Version 1.8

          Version 2.7          Version 1.7

          Version 2.6          Version 1.6

          Version 2.5          Version 1.5

          Version 2.4          Version 1.4

          Version 2.3          Version 1.3

          Version 2.2          Version 1.2

          Version 2.1          Version 1.1

          Version 2.0          Version 1.0


Version v.2.1

Release date: September 03, 2009

In front of Sphider-plus version 2.0 the following items have been added / modified:


New item in Admin settings:

Perform a segmentation of Chinese and Korean text during index / re-index procedure.

Will divide phrases like 帽子和服装 into the base words 帽子 and 和 and 服装 ,

so that all will become searchable.

Valid for Chinese sites with charset: GB2312, GBK and GB18030

Valid for Korean sites with charset: EUC-KR and ISO10646-1933


New item in Admin setting:

Index password protected sites.

If enabled, Sphider-plus will index also .htacces protected sites (basic authorization).

Up to 3 different zones could be registered in Admin settings and will be indexed.


New options in Admin settings:

- Index framesets

- Index iframes

If enabled, both options will index html and image frames.

Not available for dynamically reloaded frames (e.g. by JavaScript).


New item in Admin setting:

Enable to decode BBCode during index / re-index into standard HTML

If selected, code like


will be converted to

<;a href="http://abc.de">;<;strong>abc.de<;/strong>;<;/a>;


New item in Admin settings:

Enable to decode entity coded sites into standard HTML characters.

If selected, entity coded text like Čapek and D&#246;hl

will be converted to Čapek and Döhl


New options in Admin settings:

- Use whitelist in order to enable index / re-index only those pages

  that include any the words in whitelist

- Use whitelist in order to enable index / re-index only those pages

  that include all the words in whitelist


Improved 'Follow sitemap.xml' procedure:

If <;sitemapindex . . >; is detected in a sitemap.xml file, and if multiple Sitemap files are available,

Sphider-plus will process the secondary Sitemaps and extract all links for index / re-index.

Also gzip-compressed files (Index Sitemap files as well as the Sitemap files) will be processed.


Improved index / re-index procedure:

If charset of a site to be indexed is undetectable, because it is not HTML standard conform

or missing HTML tag, the index procedure will no longer been interrupted.

Preferred charset as defined in Admin settings will be used for the involved link.


Improved index / re-index procedure:

If Sphider-plus is relocated by http 301 or 302, links found at the relocated site

will also be followed.


For new sites, as per default the spider-depth is now set to 'full'.


Improved UTF-8 support:

Conversion into UTF-8 charset now is obligatory.


Improved index and re-index procedures for Cyrillic and Greek languages to support upper and lower case characters.


Bug fixed that prevented to continue suspended index procedures.


'Continue suspended index procedure' enabled now also for 'Re-index' and 'Erase & Re-index'.


Improved search functions for search with wildcards and for strict search.


Improved category search:

- Selected category name is highlighted in headline of result listing.

- If activated in Admin setting, categories which would also deliver results

   are presented individual for each result link in the result listing.

- If search in category is performed, sub-categories which would also deliver results

   are presented individual for each result link in the result listing.


If media search is enabled in Admin settings, text search with wildcards will also present media results.


Improved search utility:

Queries with and without hyphen will deliver the same results,

so that queries like 'make-up' and 'make up' do have equal rights.

The same behaviour is performed for queries containing dots, commas and question marks.


Maximum length for site and link URLs to be indexed is now increased to 1024 characters.


Maximum length for link 'title' increased to 255 characters.


Code rewritten to cooperate with PHP 5.3.x


Error corrected de-language file. Thanks to Carl D. Erling



Involved files that have been modified / added for this release:

Nearly all, because of PHP 5.3 compatibility.


In order to enable the two new items:

- For new sites, as per default, the spider-depth is set to 'full'.

- URLs will be accepted for a length of up to 1024 characters.

this release requires the installation of new table sets for each database.