Documentation Source Text

Check-in [ad0172e592]
Login

Many hyperlinks are disabled.
Use anonymous login to enable hyperlinks.

Overview
Comment:Merge fixes off of the 3.11 branch.
Downloads: Tarball | ZIP archive
Timelines: family | ancestors | descendants | both | trunk
Files: files | file ages | folders
SHA1: ad0172e5922c0f076fe40177ba0c5391c1cf1ed6
User & Date: drh 2016-03-22 17:23:44.597
Context
2016-03-23
11:51
Update testing statistics. (check-in: ad4c9929f2 user: drh tags: trunk)
2016-03-22
17:23
Merge fixes off of the 3.11 branch. (check-in: ad0172e592 user: drh tags: trunk)
17:23
Fix typos. Add sqlite3_system_errno() to the change log. (check-in: d5d9bb664f user: drh tags: trunk)
2016-03-21
20:02
Fix the description of the case folding performed by the unicode61 tokenizer in FTS3. (check-in: 37a01760c6 user: drh tags: branch-3.11)
Changes
Unified Diff Ignore Whitespace Patch
Changes to pages/fts3.in.
2200
2201
2202
2203
2204
2205
2206
2207
2208
2209
2210
2211
2212
2213
2214
  processing is required, for example to implement stemming or
  discard punctuation, this can be done by creating a tokenizer
  implementation that uses the ICU tokenizer as part of its implementation.

<tcl>hd_fragment unicode61 unicode61</tcl>
<p>
  The "unicode61" tokenizer is available beginning with SQLite [version 3.7.13].
  Unicode61 works very much like "simple" except that it does full unicode
  case folding according to rules in Unicode Version 6.1 and it recognizes
  unicode space and punctuation characters and uses those to separate tokens.
  The simple tokenizer only does case folding of ASCII characters and only
  recognizes ASCII space and punctuation characters as token separators.

<p>
  By default, "unicode61" also removes all diacritics from Latin script







|







2200
2201
2202
2203
2204
2205
2206
2207
2208
2209
2210
2211
2212
2213
2214
  processing is required, for example to implement stemming or
  discard punctuation, this can be done by creating a tokenizer
  implementation that uses the ICU tokenizer as part of its implementation.

<tcl>hd_fragment unicode61 unicode61</tcl>
<p>
  The "unicode61" tokenizer is available beginning with SQLite [version 3.7.13].
  Unicode61 works very much like "simple" except that it does simple unicode
  case folding according to rules in Unicode Version 6.1 and it recognizes
  unicode space and punctuation characters and uses those to separate tokens.
  The simple tokenizer only does case folding of ASCII characters and only
  recognizes ASCII space and punctuation characters as token separators.

<p>
  By default, "unicode61" also removes all diacritics from Latin script