Fix escaping of names in IR by tlively · Pull Request #8880 · WebAssembly/binaryen

tlively · 2026-07-01T01:13:19Z

The binary reader (but not the text parser) would previous escape names read from the names section and use those escaped names in the IR. But there is no reason to escape names in the IR; we can use the unmodified byte sequences instead. Remove this unnecessary escaping and always store names unescaped in the IR.

The binary writer was partially unescaping names when writing them back to the names section, but this is no longer necessary. After removing this, we simply preserve whatever names were present in the input, modulo any changes we might have made to avoid duplicate names.

Asyncify and wasm-split read function names as input from the command line and from manifest files and would previously escape these names on reading so they would match up with the escaped names in the IR. Now that the IR does not store escaped names, fix Asyncify and wasm-split to no longer escape these names. As a follow-on change we might consider unescaping these names instead, which would allow users to more easily pass names containing unprintable characters.

The binary reader (but not the text parser) would previous escape names read from the names section and use those escaped names in the IR. But there is no reason to escape names in the IR; we can use the unmodified byte sequences instead. Remove this unnecessary escaping and always store names unescaped in the IR. The binary writer was further escaping names when writing them back to the names section. Remove this escaping so that we simply preserve whatever names were present in the input, modulo any deduplication we might have done. Asyncify and wasm-split read function names as input from the command line and from manifest files and would previously escape these names on reading so they would match up with the escaped names in the IR. Now that the IR does not store escaped names, fix Asyncify and wasm-split to no longer escape these names. As a follow-on change we might consider _unescaping_ these names instead, which would allow users to more easily pass names containing unprintable characters. As a drive-by, fix the printing in MinimizeImportsAndExports to properly use JSON escaping.

tlively · 2026-07-01T01:15:34Z

@aheejin, it looks like you were right earlier about the IR storing escaped names. I was confused because that does not happen when the input is text. This will make everything more consistent and punts on the question of how and whether we should accept escaped names in the wasm-split manifests.

aheejin

Nice! This makes things cleaner.

Can we also remove escape ane unescape in https://github.com/aheejin/binaryen/blob/main/src/ir/names.cpp and https://github.com/aheejin/binaryen/blob/main/src/ir/names.h I added in #8868?

aheejin · 2026-07-01T01:29:46Z

+        String::printEscapedJSON(std::cout, key.first.view()) << ", ";
+        String::printEscapedJSON(std::cout, key.second.view()) << ", ";
+        String::printEscapedJSON(std::cout, new_.view()) << "]";


Why should this be printEscapedJSON and not printEscaped? And what are the differences?

This pass is printing JSON, so it should be printing JSON escape sequences rather than WebAssembly text format escape sequences. An example of the difference is that JSON uses "\uABCD" for unicode character U+ABCD, whereas the WebAssembly text format would use "\u{ABCD}".

However, I've reverted this part of the change. printEscapedJSON expects the representation of its string argument to be WTF-16, but that is not the case here.

aheejin · 2026-07-01T01:39:24Z

-void WasmBinaryWriter::writeEscapedName(std::string_view name) {
-  if (name.find('\\') == std::string_view::npos) {
-    writeInlineString(name);
-    return;
-  }
-  // decode escaped by escapeName (see below) function names
-  std::string unescaped;
-  for (size_t i = 0; i < name.size();) {
-    char ch = name[i++];
-    // support only `\xx` escapes; ignore invalid or unsupported escapes
-    if (ch != '\\' || i + 1 >= name.size() || !isHexDigit(name[i]) ||
-        !isHexDigit(name[i + 1])) {
-      unescaped.push_back(ch);
-      continue;
-    }
-    unescaped.push_back(
-      char((decodeHexNibble(name[i]) << 4) | decodeHexNibble(name[i + 1])));
-    i += 2;
-  }
-  writeInlineString({unescaped.data(), unescaped.size()});
-}


The binary writer was further escaping names when writing them back to the names section.

Isn't this unescaping names despite the name tells the opposite? Anyway, if the IR has unescaped names now and the name section should have the same, removing this looks correct.

Oh yes, you're right. I'll update the description.

aheejin · 2026-07-01T05:27:34Z

-Name escape(Name name);
 // Unescapes a WebAssembly identifier back into its original human-readable
 // string.
 std::string unescape(Name name);


Can't we remove unescape?

It looks it's used in wasm-split --print-profile --unescape. Do we still need this given that IR has unescaped names already?

tlively requested a review from a team as a code owner July 1, 2026 01:13

tlively requested review from aheejin, kripken and stevenfontanella and removed request for a team July 1, 2026 01:13

Merge branch 'main' into no-escape-names-section

b835480

aheejin reviewed Jul 1, 2026

View reviewed changes

aheejin mentioned this pull request Jul 1, 2026

[wasm-split] Use escapes names for manifest files #8876

Closed

tlively added 3 commits June 30, 2026 20:26

undo MinifyImportsAndExports change

8f13032

update binaryen.js tests

f9c003a

remove old escape code

3069f22

aheejin reviewed Jul 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix escaping of names in IR#8880

Fix escaping of names in IR#8880
tlively wants to merge 5 commits into
mainfrom
no-escape-names-section

tlively commented Jul 1, 2026 •

edited

Loading

Uh oh!

tlively commented Jul 1, 2026

Uh oh!

aheejin left a comment

Uh oh!

aheejin Jul 1, 2026

Uh oh!

tlively Jul 1, 2026

Uh oh!

aheejin Jul 1, 2026

Uh oh!

tlively Jul 1, 2026

Uh oh!

aheejin Jul 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

tlively commented Jul 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tlively commented Jul 1, 2026

Uh oh!

aheejin left a comment

Choose a reason for hiding this comment

Uh oh!

aheejin Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

tlively Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

aheejin Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

tlively Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

aheejin Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tlively commented Jul 1, 2026 •

edited

Loading