mirrors/Nim - Nim - Kyren's Code

mirrors/Nim

Fork 0

mirror of https://github.com/nim-lang/Nim.git synced 2026-02-12 22:33:49 +00:00

Commit Graph

Author SHA1 Message Date

Author	SHA1	Message	Date
Alexander Kernozhitsky	b172b34a24	Treat CJK Ideographs as letters in `isAlpha()` (#23651 ) Because of the bug in `tools/parse_unicodedata.nim`, CJK Ideographs were not considered letters in `isAlpha()`, even though they have category Lo. This is because they are specified as range in `UnicodeData.txt`, not as separate characters: ``` 4E00;<CJK Ideograph, First>;Lo;0;L;;;;;N;;;;; 9FEF;<CJK Ideograph, Last>;Lo;0;L;;;;;N;;;;; ``` The parser was not prepared to parse such ranges and thus omitted almost all CJK Ideographs from consideration. To fix this, we need to consider ranges from `UnicodeData.txt` in `tools/parse_unicodedata.nim`.	2024-05-29 06:42:07 +02:00
Gianmarco	4c38569229	Change unicode lookup tables to have int32 elements to support platforms where sizeof(int) < 4 (#23433 ) Fixes an issue that comes up when using strutils.`%` or any other strutils/strformat feature that uses the unicode lookup tables behind the scenes, on systems where ints are than 32-bit wide. Tested with: ```bash ./koch test cat lib ``` Refer to the discussion in #23125.	2024-03-25 10:59:48 +01:00
Miran	aeb30a72c0	update unicode.nim (#10921 ) * update unicode.nim * create a script to create the needed unicode data * make unicode.nim compatible with Unicode v12.0.0 * slightly improve unicode.nim documentation (fixes #4795) * more documentation	2019-03-31 08:36:04 +02:00

Alexander Kernozhitsky

b172b34a24

Treat CJK Ideographs as letters in isAlpha() (#23651 )

Because of the bug in `tools/parse_unicodedata.nim`, CJK Ideographs were
not considered letters in `isAlpha()`, even though they have category
Lo. This is because they are specified as range in `UnicodeData.txt`,
not as separate characters:

```
4E00;<CJK Ideograph, First>;Lo;0;L;;;;;N;;;;;
9FEF;<CJK Ideograph, Last>;Lo;0;L;;;;;N;;;;;
```

The parser was not prepared to parse such ranges and thus omitted almost
all CJK Ideographs from consideration.

To fix this, we need to consider ranges from `UnicodeData.txt` in
`tools/parse_unicodedata.nim`.

2024-05-29 06:42:07 +02:00

Gianmarco

4c38569229

Change unicode lookup tables to have int32 elements to support platforms where sizeof(int) < 4 (#23433 )

Fixes an issue that comes up when using strutils.`%` or any other
strutils/strformat feature that uses the unicode lookup tables behind
the scenes, on systems where ints are than 32-bit wide.

Tested with:

```bash
./koch test cat lib
```

Refer to the discussion in #23125.

2024-03-25 10:59:48 +01:00

Miran

aeb30a72c0

update unicode.nim (#10921 )

* update unicode.nim

* create a script to create the needed unicode data
* make unicode.nim compatible with Unicode v12.0.0
* slightly improve unicode.nim documentation (fixes #4795)

* more documentation

2019-03-31 08:36:04 +02:00

3 Commits