tinycc

github-repos/tinycc

Fork 0

mirror of https://github.com/mirror/tinycc.git synced 2025-01-09 04:50:07 +08:00

Commit Graph

Author	SHA1	Message	Date
Petr Skocik	704c8163fd	re-add accidentally deleted printf("\n"); to tests2/97*.c	2021-01-18 08:32:50 +01:00
Petr Skocik	ffb95c2e0c	Better handling of UCNs in strings As the standard requires, take 4 hex digits after the \u opener of a Universal Character Name, or take 8 hex digits after \U, but reject smaller counts and don't consume more (https://port70.net/~nsz/c/c11/n1570.html#6.4.3, https://port70.net/~nsz/c/c99/n1256.html#6.4.3). The unicode codepoint used to get truncated to 1 byte. Now it gets expanded into UTF-8, matching gcc & clang behavior on Linux. TODO: Universal character names should also be supported in identifiers, as in, e.g., char \u010dau_sv\u011bte[]="čau_světe";	2021-01-18 00:49:24 +01:00
Zhang Boyang	978d1ecce0	Add test case for wide char handling in wide string literal	2017-09-10 16:50:19 +08:00

Author

SHA1

Message

Date

Petr Skocik

704c8163fd

re-add accidentally deleted printf("\n"); to tests2/97*.c

2021-01-18 08:32:50 +01:00

Petr Skocik

ffb95c2e0c

Better handling of UCNs in strings

As the standard requires, take 4 hex digits after the \u opener of a
Universal Character Name, or take 8 hex digits after \U, but reject
smaller counts and don't consume more (https://port70.net/~nsz/c/c11/n1570.html#6.4.3,
https://port70.net/~nsz/c/c99/n1256.html#6.4.3).

The unicode codepoint used to get truncated to 1 byte. Now it gets expanded into UTF-8,
matching gcc & clang behavior on Linux.

TODO: Universal character names should also be supported in identifiers,
as in, e.g., char \u010dau_sv\u011bte[]="čau_světe";

2021-01-18 00:49:24 +01:00

Zhang Boyang

978d1ecce0

Add test case for wide char handling in wide string literal

2017-09-10 16:50:19 +08:00

3 Commits