Cranelift: x64: fix user-controlled recursion in cmp emission. #12333

cfallin · 2026-01-13T17:49:04Z

We had a set of rules introduced in #11097 that attempted to optimize the case of testing the result of an icmp for a nonzero value. This allowed optimization of, for example, (((x == 0) == 0) == 0 ...) to a single level, either x == 0 or x != 0 depending on even/odd nesting depth.

Unfortunately this kind of recursion in the backend has a depth bounded only by the user input, hence creates a compiler DoS vulnerability: the wrong kind of compiler input can cause a stack overflow in Cranelift at compilation time. This case is reachable from Wasmtime's Wasm frontend via the i32.eqz operator (for example) as well.

Ideally, this kind of deep rewrite is best done in our mid-end optimizer, where we think carefully about bounds for recursive rewrites. The left-hand sides for the backend rules should really be fixed shapes that correspond to machine instructions, rather than ad-hoc peephole optimizations in their own right.

This fix thus simply removes the recursion case that causes the blowup. The patch includes two tests: one with optimizations disabled, showing correct compilation (without the fix, this case fails to compile with a stack overflow), and one with optimizations enabled, showing that the mid-end properly cleans up the nested expression and we get the expected one-level result anyway.

Note: this was reported as a security issue by @venkkatesh-sekar (thanks!); per our security policy, compilation DoS is not covered, so this is being fixed in a public PR. I will subsequently make a patch release to our currently supported versions (v36, v39, v40, 41).

We had a set of rules introduced in bytecodealliance#11097 that attempted to optimize the case of testing the result of an `icmp` for a nonzero value. This allowed optimization of, for example, `(((x == 0) == 0) == 0 ...)` to a single level, either `x == 0` or `x != 0` depending on even/odd nesting depth. Unfortunately this kind of recursion in the backend has a depth bounded only by the user input, hence creates a DoS vulnerability: the wrong kind of compiler input can cause a stack overflow in Cranelift at compilation time. This case is reachable from Wasmtime's Wasm frontend via the `i32.eqz` operator (for example) as well. Ideally, this kind of deep rewrite is best done in our mid-end optimizer, where we think carefully about bounds for recursive rewrites. The left-hand sides for the backend rules should really be fixed shapes that correspond to machine instructions, rather than ad-hoc peephole optimizations in their own right. This fix thus simply removes the recursion case that causes the blowup. The patch includes two tests: one with optimizations disabled, showing correct compilation (without the fix, this case fails to compile with a stack overflow), and one with optimizations enabled, showing that the mid-end properly cleans up the nested expression and we get the expected one-level result anyway.

alexcrichton · 2026-01-13T18:19:18Z

I haven't dug deeply into the fallout here, but it looks like just removing these rules isn't enough to retain the codegen we have from before. A rough hunch is that there's a number of entrypoints for using these ISLE helpers and while the wasm test case here is one which is also cleaned up by the optimizer I think that these rules might be load-bearing for other codegen patterns unrelated to the optimizer.

It might need to be the case that some entrypoints use one rule, which includes these deleted rules, but then these rules don't recurse on themselves?

alexcrichton · 2026-01-13T18:21:34Z

Although, from another angle, this seems fine as a thing to backport since it's clearly such low impact. I'm not aware of these codegen patterns being load bearing in terms of performance so a temporary minor regression while a "full fix" is developed for main I think would also be reasonable. In such a case though there's a number of codegen/etc tests to update to get CI passing.

cfallin · 2026-01-13T18:59:31Z

This broke some existing lowerings; playing a bit more with various ways to break the cycle.

cfallin · 2026-01-13T19:00:17Z

(On refresh, what Alex said -- somehow those comments didn't appear for me.)

cfallin · 2026-01-13T20:35:32Z

OK, I've reworked things a bit to preserve exactly the same codegen on branches -- I effectively peeled one layer off of is_nonzero_cmp to continue to use with branches but the previous backedge from emit_cmp to is_nonzero_cmp now invokes is_nonzero which has only the base cases. Together with explicit LHS patterns to peel one level of uextend this leaves us with no test-output changes now.

cfallin · 2026-01-13T20:37:34Z

Or, well, spoke too soon -- no Cranelift filetest changes. A few disas tests that do FP compares do regress to materializing the bool.

cfallin · 2026-01-13T21:58:17Z

OK, I've gotten full equivalence on the disas tests too -- this needed one new mid-end rule to see through a pattern (fcmp(...) != 0) that we were previously only ever simplifying with this peephole rule in the backend.

@fitzgen mind giving this another look, since it's a little heavier than before?

Re: backport, I could certainly backport the rule deletions only, but I'd be a little concerned about performance regressions in that case because this affects many branches. If we feel ok about these rewrites I think the full patch should be applicable back to v36 (LTS).

cranelift/codegen/src/isa/x64/inst.isle

alexcrichton · 2026-01-13T22:44:46Z

Also, logistically, @cfallin are you ok running point on backporting/point releases? And 24.0.0, while supported, is unaffected right?

cfallin · 2026-01-13T22:50:10Z

Yes, I'm happy to do the backports too. (Sorry, very slow today taking scraps of time between meetings and other interrupts; can hopefully do backports by end of tomorrow at latest.)

alexcrichton · 2026-01-13T22:53:01Z

Oh no worries! Just want to make sure there's explicitly an owner. Tomorrow is totally fine and I can be around to hit approvals and help babysit CI. We've had some CI changes recently like trusted publishing which raises the risk of broken CI on other branches, so something to watch out for.

This change works by splitting a rule so that the entry point used by `brif` lowering can still peel off one layer of `icmp` and emit it directly, without entering the unbounded structural recursion. It also adds a mid-end rule to catch one case that we were previously catching in the backend only: `fcmp(...) != 0`.

cfallin · 2026-01-13T23:00:21Z

OK, this should be good to go -- if r+ here (on new changes) then I can put up backports momentarily.

…odealliance#12333) * Cranelift: x64: fix user-controlled recursion in cmp emission. We had a set of rules introduced in bytecodealliance#11097 that attempted to optimize the case of testing the result of an `icmp` for a nonzero value. This allowed optimization of, for example, `(((x == 0) == 0) == 0 ...)` to a single level, either `x == 0` or `x != 0` depending on even/odd nesting depth. Unfortunately this kind of recursion in the backend has a depth bounded only by the user input, hence creates a DoS vulnerability: the wrong kind of compiler input can cause a stack overflow in Cranelift at compilation time. This case is reachable from Wasmtime's Wasm frontend via the `i32.eqz` operator (for example) as well. Ideally, this kind of deep rewrite is best done in our mid-end optimizer, where we think carefully about bounds for recursive rewrites. The left-hand sides for the backend rules should really be fixed shapes that correspond to machine instructions, rather than ad-hoc peephole optimizations in their own right. This fix thus simply removes the recursion case that causes the blowup. The patch includes two tests: one with optimizations disabled, showing correct compilation (without the fix, this case fails to compile with a stack overflow), and one with optimizations enabled, showing that the mid-end properly cleans up the nested expression and we get the expected one-level result anyway. * Preserve codegen on branches. This change works by splitting a rule so that the entry point used by `brif` lowering can still peel off one layer of `icmp` and emit it directly, without entering the unbounded structural recursion. It also adds a mid-end rule to catch one case that we were previously catching in the backend only: `fcmp(...) != 0`.

… (#12338) * Cranelift: x64: fix user-controlled recursion in cmp emission. We had a set of rules introduced in #11097 that attempted to optimize the case of testing the result of an `icmp` for a nonzero value. This allowed optimization of, for example, `(((x == 0) == 0) == 0 ...)` to a single level, either `x == 0` or `x != 0` depending on even/odd nesting depth. Unfortunately this kind of recursion in the backend has a depth bounded only by the user input, hence creates a DoS vulnerability: the wrong kind of compiler input can cause a stack overflow in Cranelift at compilation time. This case is reachable from Wasmtime's Wasm frontend via the `i32.eqz` operator (for example) as well. Ideally, this kind of deep rewrite is best done in our mid-end optimizer, where we think carefully about bounds for recursive rewrites. The left-hand sides for the backend rules should really be fixed shapes that correspond to machine instructions, rather than ad-hoc peephole optimizations in their own right. This fix thus simply removes the recursion case that causes the blowup. The patch includes two tests: one with optimizations disabled, showing correct compilation (without the fix, this case fails to compile with a stack overflow), and one with optimizations enabled, showing that the mid-end properly cleans up the nested expression and we get the expected one-level result anyway. * Preserve codegen on branches. This change works by splitting a rule so that the entry point used by `brif` lowering can still peel off one layer of `icmp` and emit it directly, without entering the unbounded structural recursion. It also adds a mid-end rule to catch one case that we were previously catching in the backend only: `fcmp(...) != 0`.

… (#12341) * Cranelift: x64: fix user-controlled recursion in cmp emission. We had a set of rules introduced in #11097 that attempted to optimize the case of testing the result of an `icmp` for a nonzero value. This allowed optimization of, for example, `(((x == 0) == 0) == 0 ...)` to a single level, either `x == 0` or `x != 0` depending on even/odd nesting depth. Unfortunately this kind of recursion in the backend has a depth bounded only by the user input, hence creates a DoS vulnerability: the wrong kind of compiler input can cause a stack overflow in Cranelift at compilation time. This case is reachable from Wasmtime's Wasm frontend via the `i32.eqz` operator (for example) as well. Ideally, this kind of deep rewrite is best done in our mid-end optimizer, where we think carefully about bounds for recursive rewrites. The left-hand sides for the backend rules should really be fixed shapes that correspond to machine instructions, rather than ad-hoc peephole optimizations in their own right. This fix thus simply removes the recursion case that causes the blowup. The patch includes two tests: one with optimizations disabled, showing correct compilation (without the fix, this case fails to compile with a stack overflow), and one with optimizations enabled, showing that the mid-end properly cleans up the nested expression and we get the expected one-level result anyway. * Preserve codegen on branches. This change works by splitting a rule so that the entry point used by `brif` lowering can still peel off one layer of `icmp` and emit it directly, without entering the unbounded structural recursion. It also adds a mid-end rule to catch one case that we were previously catching in the backend only: `fcmp(...) != 0`.

… (#12339) * Cranelift: x64: fix user-controlled recursion in cmp emission. We had a set of rules introduced in #11097 that attempted to optimize the case of testing the result of an `icmp` for a nonzero value. This allowed optimization of, for example, `(((x == 0) == 0) == 0 ...)` to a single level, either `x == 0` or `x != 0` depending on even/odd nesting depth. Unfortunately this kind of recursion in the backend has a depth bounded only by the user input, hence creates a DoS vulnerability: the wrong kind of compiler input can cause a stack overflow in Cranelift at compilation time. This case is reachable from Wasmtime's Wasm frontend via the `i32.eqz` operator (for example) as well. Ideally, this kind of deep rewrite is best done in our mid-end optimizer, where we think carefully about bounds for recursive rewrites. The left-hand sides for the backend rules should really be fixed shapes that correspond to machine instructions, rather than ad-hoc peephole optimizations in their own right. This fix thus simply removes the recursion case that causes the blowup. The patch includes two tests: one with optimizations disabled, showing correct compilation (without the fix, this case fails to compile with a stack overflow), and one with optimizations enabled, showing that the mid-end properly cleans up the nested expression and we get the expected one-level result anyway. * Preserve codegen on branches. This change works by splitting a rule so that the entry point used by `brif` lowering can still peel off one layer of `icmp` and emit it directly, without entering the unbounded structural recursion. It also adds a mid-end rule to catch one case that we were previously catching in the backend only: `fcmp(...) != 0`.

… (#12342) * Cranelift: x64: fix user-controlled recursion in cmp emission. We had a set of rules introduced in #11097 that attempted to optimize the case of testing the result of an `icmp` for a nonzero value. This allowed optimization of, for example, `(((x == 0) == 0) == 0 ...)` to a single level, either `x == 0` or `x != 0` depending on even/odd nesting depth. Unfortunately this kind of recursion in the backend has a depth bounded only by the user input, hence creates a DoS vulnerability: the wrong kind of compiler input can cause a stack overflow in Cranelift at compilation time. This case is reachable from Wasmtime's Wasm frontend via the `i32.eqz` operator (for example) as well. Ideally, this kind of deep rewrite is best done in our mid-end optimizer, where we think carefully about bounds for recursive rewrites. The left-hand sides for the backend rules should really be fixed shapes that correspond to machine instructions, rather than ad-hoc peephole optimizations in their own right. This fix thus simply removes the recursion case that causes the blowup. The patch includes two tests: one with optimizations disabled, showing correct compilation (without the fix, this case fails to compile with a stack overflow), and one with optimizations enabled, showing that the mid-end properly cleans up the nested expression and we get the expected one-level result anyway. * Preserve codegen on branches. This change works by splitting a rule so that the entry point used by `brif` lowering can still peel off one layer of `icmp` and emit it directly, without entering the unbounded structural recursion. It also adds a mid-end rule to catch one case that we were previously catching in the backend only: `fcmp(...) != 0`.

adambratschikaye · 2026-01-14T10:46:37Z

Thanks for taking care of this @cfallin, @alexcrichton, and @fitzgen!

This is a patch version bump of wasmtime which contains the backport fix for the compiler bug reported in bytecodealliance/wasmtime#12333 --------- Co-authored-by: IDX GitHub Automation <infra+github-automation@dfinity.org>

cfallin requested a review from a team as a code owner January 13, 2026 17:49

cfallin requested review from alexcrichton and fitzgen and removed request for a team January 13, 2026 17:49

fitzgen approved these changes Jan 13, 2026

View reviewed changes

cfallin enabled auto-merge January 13, 2026 17:51

cfallin requested a review from a team as a code owner January 13, 2026 20:37

cfallin added this pull request to the merge queue Jan 13, 2026

cfallin removed this pull request from the merge queue due to a manual request Jan 13, 2026

github-actions bot added cranelift Issues related to the Cranelift code generator cranelift:area:x64 Issues related to x64 codegen labels Jan 13, 2026

cfallin force-pushed the icmp-recursion branch from 48b4f6c to 96a2b7e Compare January 13, 2026 21:55

alexcrichton reviewed Jan 13, 2026

View reviewed changes

cranelift/codegen/src/isa/x64/inst.isle Show resolved Hide resolved

cfallin force-pushed the icmp-recursion branch from 96a2b7e to 8453886 Compare January 13, 2026 22:59

alexcrichton approved these changes Jan 13, 2026

View reviewed changes

cfallin enabled auto-merge January 13, 2026 23:11

cfallin added this pull request to the merge queue Jan 13, 2026

Merged via the queue into bytecodealliance:main with commit 55105fb Jan 13, 2026
58 checks passed

cfallin deleted the icmp-recursion branch January 13, 2026 23:36

alexcrichton mentioned this pull request Jan 13, 2026

[41.0.0] Fix ISLE icmp optimization rules for vector inputs #12340

Merged

alexcrichton mentioned this pull request Jan 14, 2026

[40.0.x] Backport a few codegen fixes #12345

Merged

mmcloughlin mentioned this pull request Jan 18, 2026

Cranelift: aarch64: user-controlled recursion in lower_fmla #12368

Closed

cfallin mentioned this pull request Jan 20, 2026

x64/aarch64: Remove recursion in fma backend rules #12369

Merged

venkkatesh-sekar mentioned this pull request Jan 20, 2026

chore(ic): Bump wasmtime to v40.0.2 dfinity/ic#8425

Merged

mmcloughlin mentioned this pull request Jan 29, 2026

Cranelift: ISLE recursion check #12474

Merged

Cranelift: x64: fix user-controlled recursion in cmp emission. #12333

Cranelift: x64: fix user-controlled recursion in cmp emission. #12333

Uh oh!

Conversation

cfallin commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexcrichton commented Jan 13, 2026

Uh oh!

alexcrichton commented Jan 13, 2026

Uh oh!

cfallin commented Jan 13, 2026

Uh oh!

cfallin commented Jan 13, 2026

Uh oh!

cfallin commented Jan 13, 2026

Uh oh!

cfallin commented Jan 13, 2026

Uh oh!

Uh oh!

cfallin commented Jan 13, 2026

Uh oh!

Uh oh!

alexcrichton commented Jan 13, 2026

Uh oh!

cfallin commented Jan 13, 2026

Uh oh!

alexcrichton commented Jan 13, 2026

Uh oh!

cfallin commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

adambratschikaye commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cfallin commented Jan 13, 2026 •

edited

Loading

cfallin commented Jan 13, 2026 •

edited

Loading