Add full Unicode support #89

jtojnar · 2024-04-14T15:50:36Z

dconf dump uses g_variant_print, which prints most Unicode characters verbatim. dconf2nix would read those and then use show to serialize the parsed strings. But show encodes Unicode characters as a decimal number preceded by a backslash, (e.g. \129315), which means nothing to Nix.

Let’s encode strings as UTF-8 when dumping them to Nix.

Also fix the test data from e2b5065, they were copied as reported by parserTraced but the actual data was mostly Unicode with few escape sequences.

`dconf dump` uses `g_variant_print`, which prints most Unicode characters verbatim. dconf2nix would read those and then use `show` to serialize the parsed strings. But `show` encodes Unicode characters as a decimal number preceded by a backslash, (e.g. `\129315`), which means nothing to Nix. We have previously implemented special handling of strings consisting of just a single emoji code point, to be able to import GNOME Characters history. But an emoji glyph can consist of multiple code points, which was not handled. Let’s revert the emoji hack and add systematic Unicode support in next commit. Reverts 8a33e7c Reverts 9b44d67

`dconf dump` uses `g_variant_print`, which prints most Unicode characters verbatim. dconf2nix would read those and then use `show` to serialize the parsed strings. But `show` encodes Unicode characters as a decimal number preceded by a backslash, (e.g. `\129315`), which means nothing to Nix. Let’s encode strings as UTF-8 when dumping them to Nix. Also fix the test data from e2b5065, they were copied as reported by `parserTraced` but the actual data was mostly Unicode with few escape sequences.

jtojnar mentioned this pull request Apr 14, 2024

Add full Variant support #90

Merged

gvolpe approved these changes Apr 15, 2024

View reviewed changes

jtojnar added 2 commits April 16, 2024 00:07

jtojnar merged commit e8a5dd1 into nix-community:master Apr 15, 2024
1 check passed

jtojnar deleted the unicode branch April 15, 2024 22:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add full Unicode support #89

Add full Unicode support #89

jtojnar commented Apr 14, 2024 •

edited

Loading

Add full Unicode support #89

Add full Unicode support #89

Conversation

jtojnar commented Apr 14, 2024 • edited Loading

jtojnar commented Apr 14, 2024 •

edited

Loading