Documentation Source Text

Check-in [569262e571]
Login

Many hyperlinks are disabled.
Use anonymous login to enable hyperlinks.

Overview
Comment:Fix "asterix" typo in fts5.in.
Downloads: Tarball | ZIP archive
Timelines: family | ancestors | descendants | both | trunk
Files: files | file ages | folders
SHA3-256: 569262e57133a155e6d6c3c8bb5afd8fa2ecfc184c134589e1d0940a18d3170c
User & Date: dan 2019-02-11 13:24:30.845
Context
2019-02-12
12:27
Clarify how the SQLITE_OK_LOAD_PERMANENTLY return value should be used. (check-in: 8182293dd8 user: drh tags: trunk)
2019-02-11
13:24
Fix "asterix" typo in fts5.in. (check-in: 569262e571 user: dan tags: trunk)
13:21
Add documentation for the fts3/4/5 remove_diacritic options. (check-in: 34f973966a user: dan tags: trunk)
Changes
Unified Diff Ignore Whitespace Patch
Changes to pages/fts5.in.
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
  "2", then diacritics are correctly removed from all Latin characters.

  <tr><td> categories
  <td>This option may be used to modify the set of Unicode general categories
  that are considered to correspond to token characters. The argument must
  consist of a space separated list of two-character general category
  abbreviations (e.g. "Lu" or "Nd"), or of the same with the second character
  replaced with an asterix ("*"), interpreted as a glob pattern. The default
  value is "L* N* Co".

  <tr><td> tokenchars
  <td> This option is used to specify additional unicode characters that 
  should be considered token characters, even if they are white-space or
  punctuation characters according to Unicode 6.1. All characters in the
  string that this option is set to are considered token characters.







|







600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
  "2", then diacritics are correctly removed from all Latin characters.

  <tr><td> categories
  <td>This option may be used to modify the set of Unicode general categories
  that are considered to correspond to token characters. The argument must
  consist of a space separated list of two-character general category
  abbreviations (e.g. "Lu" or "Nd"), or of the same with the second character
  replaced with an asterisk ("*"), interpreted as a glob pattern. The default
  value is "L* N* Co".

  <tr><td> tokenchars
  <td> This option is used to specify additional unicode characters that 
  should be considered token characters, even if they are white-space or
  punctuation characters according to Unicode 6.1. All characters in the
  string that this option is set to are considered token characters.