Many hyperlinks are disabled.
Use anonymous login
to enable hyperlinks.
Overview
Comment: | Merge fixes off of the 3.11 branch. |
---|---|
Downloads: | Tarball | ZIP archive |
Timelines: | family | ancestors | descendants | both | trunk |
Files: | files | file ages | folders |
SHA1: |
ad0172e5922c0f076fe40177ba0c5391 |
User & Date: | drh 2016-03-22 17:23:44.597 |
Context
2016-03-23
| ||
11:51 | Update testing statistics. (check-in: ad4c9929f2 user: drh tags: trunk) | |
2016-03-22
| ||
17:23 | Merge fixes off of the 3.11 branch. (check-in: ad0172e592 user: drh tags: trunk) | |
17:23 | Fix typos. Add sqlite3_system_errno() to the change log. (check-in: d5d9bb664f user: drh tags: trunk) | |
2016-03-21
| ||
20:02 | Fix the description of the case folding performed by the unicode61 tokenizer in FTS3. (check-in: 37a01760c6 user: drh tags: branch-3.11) | |
Changes
Changes to pages/fts3.in.
︙ | ︙ | |||
2200 2201 2202 2203 2204 2205 2206 | processing is required, for example to implement stemming or discard punctuation, this can be done by creating a tokenizer implementation that uses the ICU tokenizer as part of its implementation. <tcl>hd_fragment unicode61 unicode61</tcl> <p> The "unicode61" tokenizer is available beginning with SQLite [version 3.7.13]. | | | 2200 2201 2202 2203 2204 2205 2206 2207 2208 2209 2210 2211 2212 2213 2214 | processing is required, for example to implement stemming or discard punctuation, this can be done by creating a tokenizer implementation that uses the ICU tokenizer as part of its implementation. <tcl>hd_fragment unicode61 unicode61</tcl> <p> The "unicode61" tokenizer is available beginning with SQLite [version 3.7.13]. Unicode61 works very much like "simple" except that it does simple unicode case folding according to rules in Unicode Version 6.1 and it recognizes unicode space and punctuation characters and uses those to separate tokens. The simple tokenizer only does case folding of ASCII characters and only recognizes ASCII space and punctuation characters as token separators. <p> By default, "unicode61" also removes all diacritics from Latin script |
︙ | ︙ |