Artifact 0113d3acf13429e6dc38e0647d1bc71211c31a4d:
- File ext/fts3/fts3_unicode2.c — part of check-in [6cfd9af5] at 2013-06-05 16:17:21 on branch trunk — Up until now the fts4 "unicode61" tokenizer has treated all private use codepoints except the first and last of each of the three ranges as alphanumeric (eligible to be part of tokens). This commit fixes this so that all private use codepoints are considered alphanumeric. In other words, it fixes the handling of codepoints 0xE000, 0xF8FF, 0xF0000, 0xFFFFD, 0x100000 and 0x10FFFD. (user: dan size: 16670) [more...]
A hex dump of this file is not available. Please download the raw binary file and generate a hex dump yourself.