Skip to content

Use of C++20 UTF-8 types #50

Open
Open
@Hawk777

Description

@Hawk777

C++20 adds the char8_t type, which has the same size, signedness, and alignment as unsigned char (not char, which might be signed), and is intended to be used to hold UTF-8 code units. Along with that are the u8string and u8string_view classes. It might be nice if libxml++ allowed users to use those classes, rather than std::string, for moving data into and out of XML documents, since it makes it more clear that UTF-8 is in use (i.e. acting as a marker), and also avoids the annoying hazards of char potentially being signed.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions