• Re: Wondering ...

    From hrtuybxi@hrtuybxi@outlook.com to Lawrence D'Oliveiro on Wed May 28 20:18:54 2025
    From Newsgroup: comp.misc

    On 24/04/2025 02:16, Lawrence D'Oliveiro wrote:
    I was slightly disillusioned when I found that there were certain
    characters that were not allowed in XML files, even when entity-encoded.

    Not even in CDATA section? I wonder what characters those might be.
    --- Synchronet 3.21a-Linux NewsLink 1.2
  • From Rich@rich@example.invalid to comp.misc on Wed May 28 19:30:20 2025
    From Newsgroup: comp.misc

    hrtuybxi@outlook.com wrote:
    On 24/04/2025 02:16, Lawrence D'Oliveiro wrote:
    I was slightly disillusioned when I found that there were certain
    characters that were not allowed in XML files, even when entity-encoded.

    Not even in CDATA section? I wonder what characters those might be.

    https://www.w3.org/TR/REC-xml/#charsets

    Character Range
    [2] Char ::= #x9 | #xA | #xD | [#x20-#xD7FF] |
    [#xE000-#xFFFD] | [#x10000-#x10FFFF]
    /* any Unicode character, excluding the surrogate blocks, FFFE, and FFFF. */

    Most of the control characters are explicitly excluded, plus some
    unicode code points.

    And, yes, even in CDATA sections, because CDATA references the above
    "Char" definition to define what characters are allowed in CDATA:

    [20] CData ::= (Char* - (Char* ']]>' Char*))

    --- Synchronet 3.21a-Linux NewsLink 1.2
  • From Lawrence D'Oliveiro@ldo@nz.invalid to comp.misc on Wed May 28 23:00:36 2025
    From Newsgroup: comp.misc

    On Wed, 28 May 2025 20:18:54 +0200, hrtuybxi wrote:

    On 24/04/2025 02:16, Lawrence D'Oliveiro wrote:

    I was slightly disillusioned when I found that there were certain
    characters that were not allowed in XML files, even when
    entity-encoded.

    Not even in CDATA section? I wonder what characters those might be.

    The one I encountered was chr(3), a.k.a. CTRL/C. Somehow one of the
    operators had managed to enter that into a comments field, causing my export-to-XML function to fail.
    --- Synchronet 3.21a-Linux NewsLink 1.2