- url="http://snowball.tartarus.org">Snowball site for more
- information). The Snowball project supplies a large number of stemmers for
- many languages. A Snowball dictionary requires a language parameter to
- identify which stemmer to use, and optionally can specify a stopword file name.
+ for the English language. Snowball now provides stemming algorithms for
+ many languages (see the
Snowball
+ site for more information). Each algorithm understands how to
+ reduce common variant forms of words to a base, or stem, spelling within
+ its language. A Snowball dictionary requires a language parameter to
+ identify which stemmer to use, and optionally can specify a stopword file
+ name that gives a list of words to eliminate.
+ (
PostgreSQL's standard stopword lists are also
+ provided by the Snowball project.)
For example, there is a built-in definition equivalent to
=> \dF
- List of fulltext configurations
+ List of text search configurations
Schema | Name | Description
---------+------+-------------
public | pg |
Information about full text searching objects can be obtained
- in <literal>psql> using a set of commands:
+ in <application>psql> using a set of commands:
- \dF{,d,p}+ PATTERN
+ \dF{d,p,t}+ PATTERN
An optional + produces more details.
The optional parameter PATTERN should be the name of
- a full text searching object, optionally schema-qualified. If
+ a text searching object, optionally schema-qualified. If
PATTERN is not specified then information about all
- visible objects will be displayed. PATTERN can be a
- regular expression and can apply separately to schema
- names and object names. The following examples illustrate this:
+ visible objects will be displayed. PATTERN can be a
+ regular expression and can provide separate patterns
+ for the schema and object names. The following examples illustrate this:
=> \dF *fulltext*
- List of fulltext configurations
+ List of text search configurations
Schema | Name | Description
--------+--------------+-------------
public | fulltext_cfg |
=> \dF *.fulltext*
- List of fulltext configurations
+ List of text search configurations
Schema | Name | Description
----------+----------------------------
fulltext | fulltext_cfg |
- List full text searching configurations (add "+" for more detail)
-
- By default (without PATTERN), information about
- all visible full text configurations will be
- displayed.
+ List text searching configurations (add +> for more detail).
+
=> \dF russian
- List of fulltext configurations
- Schema | Name | Description
-------------+---------+-----------------------------------
- pg_catalog | russian | default configuration for Russian
+ List of text search configurations
+ Schema | Name | Description
+------------+---------+------------------------------------
+ pg_catalog | russian | configuration for russian language
=> \dF+ russian
- Configuration "pg_catalog.russian"
- Parser name: "pg_catalog.default"
- Token | Dictionaries
---------------+-------------------------
- email | pg_catalog.simple
- file | pg_catalog.simple
- float | pg_catalog.simple
- host | pg_catalog.simple
- hword | pg_catalog.russian_stem
- int | pg_catalog.simple
- lhword | public.tz_simple
- lpart_hword | public.tz_simple
- lword | public.tz_simple
- nlhword | pg_catalog.russian_stem
- nlpart_hword | pg_catalog.russian_stem
- nlword | pg_catalog.russian_stem
- part_hword | pg_catalog.simple
- sfloat | pg_catalog.simple
- uint | pg_catalog.simple
- uri | pg_catalog.simple
- url | pg_catalog.simple
- version | pg_catalog.simple
- word | pg_catalog.russian_stem
+Text search configuration "pg_catalog.russian"
+Parser: "pg_catalog.default"
+ Token | Dictionaries
+--------------+--------------
+ email | simple
+ file | simple
+ float | simple
+ host | simple
+ hword | russian_stem
+ int | simple
+ lhword | english_stem
+ lpart_hword | english_stem
+ lword | english_stem
+ nlhword | russian_stem
+ nlpart_hword | russian_stem
+ nlword | russian_stem
+ part_hword | russian_stem
+ sfloat | simple
+ uint | simple
+ uri | simple
+ url | simple
+ version | simple
+ word | russian_stem
\dFd[+] [PATTERN]
- List full text dictionaries (add "+" for more detail).
-
- By default (without PATTERN), information about
- all visible dictionaries will be displayed.
+ List text search dictionaries (add +> for more detail).
=> \dFd
- List of fulltext dictionaries
- Schema | Name | Description
-------------+------------+-----------------------------------------------------------
- pg_catalog | danish | Snowball stemmer for danish language
- pg_catalog | dutch | Snowball stemmer for dutch language
- pg_catalog | english | Snowball stemmer for english language
- pg_catalog | finnish | Snowball stemmer for finnish language
- pg_catalog | french | Snowball stemmer for french language
- pg_catalog | german | Snowball stemmer for german language
- pg_catalog | hungarian | Snowball stemmer for hungarian language
- pg_catalog | italian | Snowball stemmer for italian language
- pg_catalog | norwegian | Snowball stemmer for norwegian language
- pg_catalog | portuguese | Snowball stemmer for portuguese language
- pg_catalog | romanian | Snowball stemmer for romanian language
- pg_catalog | russian | Snowball stemmer for russian language
- pg_catalog | simple | simple dictionary: just lower case and check for stopword
- pg_catalog | spanish | Snowball stemmer for spanish language
- pg_catalog | swedish | Snowball stemmer for swedish language
- pg_catalog | turkish | Snowball stemmer for turkish language
+ List of text search dictionaries
+ Schema | Name | Description
+------------+-----------------+-----------------------------------------------------------
+ pg_catalog | danish_stem | snowball stemmer for danish language
+ pg_catalog | dutch_stem | snowball stemmer for dutch language
+ pg_catalog | english_stem | snowball stemmer for english language
+ pg_catalog | finnish_stem | snowball stemmer for finnish language
+ pg_catalog | french_stem | snowball stemmer for french language
+ pg_catalog | german_stem | snowball stemmer for german language
+ pg_catalog | hungarian_stem | snowball stemmer for hungarian language
+ pg_catalog | italian_stem | snowball stemmer for italian language
+ pg_catalog | norwegian_stem | snowball stemmer for norwegian language
+ pg_catalog | portuguese_stem | snowball stemmer for portuguese language
+ pg_catalog | romanian_stem | snowball stemmer for romanian language
+ pg_catalog | russian_stem | snowball stemmer for russian language
+ pg_catalog | simple | simple dictionary: just lower case and check for stopword
+ pg_catalog | spanish_stem | snowball stemmer for spanish language
+ pg_catalog | swedish_stem | snowball stemmer for swedish language
+ pg_catalog | turkish_stem | snowball stemmer for turkish language
\dFp[+] [PATTERN]
- List full text parsers (add "+" for more detail)
-
- By default (without PATTERN), information about
- all visible full text parsers will be displayed.
+ List text search parsers (add +> for more detail).
+
- => \dFp
- List of fulltext parsers
- Schema | Name | Description
+=> \dFp
+ List of text search parsers
+ Schema | Name | Description
------------+---------+---------------------
pg_catalog | default | default word parser
- (1 row)
=> \dFp+
- Fulltext parser "pg_catalog.default"
- Method | Function | Description
--------------------+---------------------------+-------------
- Start parse | pg_catalog.prsd_start |
- Get next token | pg_catalog.prsd_nexttoken |
- End parse | pg_catalog.prsd_end |
- Get headline | pg_catalog.prsd_headline |
- Get lexeme's type | pg_catalog.prsd_lextype |
-
- Token's types for parser "pg_catalog.default"
- Token name | Description
+ Text search parser "pg_catalog.default"
+ Method | Function | Description
+------------------+----------------+-------------
+ Start parse | prsd_start |
+ Get next token | prsd_nexttoken |
+ End parse | prsd_end |
+ Get headline | prsd_headline |
+ Get lexeme types | prsd_lextype |
+
+ Token types for parser "pg_catalog.default"
+ Token name | Description
--------------+-----------------------------------
blank | Space symbols
email | Email
+
+
+ \dFt[+] [PATTERN]
+
+ List text search templates (add +> for more detail).
+
+
+=> \dFt
+ List of text search templates
+ Schema | Name | Description
+------------+-----------+-----------------------------------------------------------
+ pg_catalog | ispell | ispell dictionary
+ pg_catalog | simple | simple dictionary: just lower case and check for stopword
+ pg_catalog | snowball | snowball stemmer
+ pg_catalog | synonym | synonym dictionary: replace word by its synonym
+ pg_catalog | thesaurus | thesaurus dictionary: phrase by phrase substitution
+
+
+
+
+
- ts_debug type defined as:
+ ts_debug's result type is defined as:
CREATE TYPE ts_debug AS (
SELECT * FROM ts_debug('public.english','The Brightest supernovaes');
- Alias | Description | Token | Dicts list | Lexized token
+ Alias | Description | Token | Dictionaries | Lexized token
-------+---------------+-------------+---------------------------------------+---------------------------------
lword | Latin word | The | {public.english_ispell,pg_catalog.english_stem} | public.english_ispell: {}
blank | Space symbols | | |