Fullscreen
[Show/Hide Right Column]

Search Index


The search engine in Tiki was rebuilt for Tiki7 this documentation applies only to versions 7 or later.


The search capabilities in tiki, such as those used by PluginList or Unified Search rely on a search index.

Tiki can support multiple search engines internally. Each of those will have different capabilities and limitations. The default engine should provide capabilities good enough for small and medium sites. Larger sites may need additional infrastructure to get the most performance.

Lucene (PHP Implementation)

The default implementation is based on Zend_Search_Lucene, a PHP implementation of the Java Lucene index engine. The engine has no external dependencies and can run on all hosts. However, some configuration may be required to reach acceptable performance.

The time required to build a complete index will vary depending on the content of the site. As a reference,

  • doc.tiki.org (this site), with over 1400 pages, reindexes in around 3 minutes
  • themes.tiki.org, with some pages and several hundreds of forum posts, reindexes in around 15 seconds
  • corporate intranets with several gigabytes of data in file galleries can take over an hour

However, incremental indexing can be used, in which case a complete re-index is only required periodically or upgrades.

Alexander Veremy, the author of the component provided some insight on how to adjust the parameters.

Fields


Below is a matrix between the fields and the object types.

Legend:
  • Tokenized, as in decomposed in words for full text search
  • Sortable

TokenizedSortablewiki pageforum postblog postarticlefiletrackeritemsheetcommentAvailable in Tiki version
Generic
object_type XXXXXXXXX7
object_id XXXXXXXXX7
titleXXXXXXXXX?7
language XX//X/?//7
modification_date XXXXXXXXX7
contributors XXXXXXXX7
descriptionX X XX X 7
contentsX XXXXXXXX7
Specific
wiki_content X 7
wiki_uptodateness XX 7
post_content X X7
parent_thread_id (not to be confused with parent_object_id) X 8
blog_id X X 7
blog_excerpt X 7
blog_content X 7
topic_id X X 7
article_content X 7
article_topline X 7
article_subtitle X 7
article_author X 9
gallery_id X X 7
filename_id X X 7
filetypeXX X 7
file_comment X 7
file_content X 7
tracker_id X X 7
tracker_status X X 7
tracker_field_IDX? X 7
sheet_content X 7
comment_contentX X7
Internal
view_permission X X X 7
parent_object_type X XXXXX X7
parent_object_id X XXXXX X7
parent_view_permission XXXXX X7
global_view_permission X7
hash X 7
url X X7
Global
categories XXXXXXXX7
deep_categories XXXXXXXX7
allowed_groups XXXXXXXX7
freetags XXXXXXXX7
freetags_text XXXXXXXX7
adv_rating_ID XXXXXXXXX7
comment_count XXXXXXXXX8
relations XXXXXXXX8
attachments XXXXXXXX7
attachment_contentsX XXXXXXXX7
geo_located XXXXXXXX9
geo_location XXXXXXXX9


X Available
/ Static value
? Depends on the data


The indexing for tracker fields will vary depending on the field type. As a general rule, tracker_field_ID will be used as the field and will be sortable. However, there are a few exceptions:
  • Image and File fields are not indexed
  • TextArea is not sortable
  • Multilingual fields are indexed as multiple fields:
    • The normal one contains all languages
    • tracker_field_ID_lang contains one language only (tracker_field_12_fr for example)
  • Rating and related fields store as multiple fields
    • tracker_field_ID contains the average
    • tracker_field_ID_sum contains the vote totals
    • tracker_field_ID_count contains the number of votes
  • If a language field is set for the tracker item, that language is indexed as the item language.

Some used in buildQuery/tiki-searchindex.php (need explanation on whether these are real fields or just helpers):
type: refers to object_type
deep: if this is set, categories will be considered deep-categories
autocomplete: Will search for items with title starting with this

Related:

alias


Contributors to this page: Nelson810 points  , Marc Laporte9146 points  , Louis-Philippe Huberdeau1046 points  and mlpvolt4388 points  .
Page last modified on Friday 27 April, 2012 14:34:09 UTC by Nelson810 points .
The content on this page is licensed under the terms of the Creative Commons Attribution-ShareAlike License.

Site Language

Reference Guide

Keywords

These keywords serve as "hubs" for navigation within the Tiki documentation. They correspond to development keywords (bug reports and feature requests):



Tiki Newsletter

Delivered fresh to your email inbox!
Newsletter subscribe icon
Don't miss major announcements and other news!
Contribute to Tiki