Document support for escaped UTF-16 and UTF-32 Unicode characters

Fixes #5607
2026-01-07 02:12:07 +03:00 · 2023-03-30 06:29:48 +02:00
parent fd55ac760a
commit 562dfdcbcc
1 changed files with 11 additions and 1 deletions
--- a/tutorials/scripting/gdscript/gdscript_basics.rst
+++ b/tutorials/scripting/gdscript/gdscript_basics.rst
@@ -487,9 +487,19 @@ Strings can contain the following escape sequences:
 +---------------------+---------------------------------+
 | ``\\``              | Backslash                       |
 +---------------------+---------------------------------+
-| ``\uXXXX``          | Unicode codepoint ``XXXX``      |
+| ``\uXXXX``          | UTF-16 Unicode codepoint        |
+|                     | ``XXXX``                        |
 |                     | (hexadecimal, case-insensitive) |
 +---------------------+---------------------------------+
+| ``\UXXXXXX``        | UTF-32 Unicode codepoint        |
+|                     | ``XXXXXX``                      |
+|                     | (hexadecimal, case-insensitive) |
+---------------------+---------------------------------+
+
+There are two ways to represent an escaped Unicode character above 0xFFFF:
+
+- as a `UTF-16 surrogate pair <https://en.wikipedia.org/wiki/UTF-16#Code_points_from_U+010000_to_U+10FFFF>`_ ``\uXXXX\uXXXX``.
+- as a single UTF-32 codepoint ``\UXXXXXX``.

 Also, using ``\`` followed by a newline inside a string will allow you to continue it in the next line, without
 inserting a newline character in the string itself.