[eagle overlap spec] wip impl top k > 1 in overlap eagle worker(v2) #11839

vincentzed · 2025-10-19T22:31:37Z

Summary

Try to impl top_k > 1 in eagle overlap spec (v2 ->> replace v1 (non overlap))

Accuracy Tests

SGLANG_ALLOW_OVERWRITE_LONGER_CONTEXT_LEN=1 python -m sglang.launch_server --dtype float16 --model-path unsloth/Meta-llama-3.1-8b-instruct --attention-backend triton --decode-log-interval 1 --disable-cuda-graph --speculative-algorithm EAGLE --speculative-draft-model-path lmsys/sglang-EAGLE-LLAMA3-instruct-8B --mem-fraction-static 0.8 --speculative-num-steps 3 --speculative-eagle-topk 2 --speculative-num-draft-tokens 4 ---page-size 1 --disable-radix-cache --disable-cuda-graph --enable-beta-spec

Right now it still produce gibberish.

If topk = 1 it's fine. Set page size 1 only.
Not explicitly supporting flashinfer

Baseline result

Implementation	Condition	Status
Triton	Page size > 1	✔
Triton	Page size = 1	✔
Triton	Topk = 1	✔
Triton	Topk > 1	✗
Triton	Both > 1	✗
Flashinfer	Top K = 1	✔
Flashinfer	Top K > 1	✗
Flashinfer	Different page sizes	untested

Current Status

Still gibberish, focus on triton

Signed-off-by: vincentzed <207368749+vincentzed@users.noreply.github.com>

upd

0e34b44

Signed-off-by: vincentzed <207368749+vincentzed@users.noreply.github.com>

hnyls2002 mentioned this pull request Oct 21, 2025

[Feature] Overlap Spec Support #11762

Open

26 tasks

JustinTong0323 mentioned this pull request Oct 24, 2025

[Bug] beta spec overlap gpt-oss fa3 topk>1 crash #11690

Closed

5 tasks

attack204 mentioned this pull request Nov 17, 2025

Jet-Nemotron — EAGLE3 + Varlen Dynamic Conv #13025

Open

amartyashankha mentioned this pull request Dec 18, 2025

EAGLE v2 overlap Tree Compaction: fix tree mode (topk > 1) #15360

Open

Terry-Uv mentioned this pull request Dec 23, 2025

[Overlap Spec V2 Eagle] Support Triton spec v2 top k >1 and pagesize > 1 #15664

Open

8 tasks

hnyls2002 closed this Dec 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[eagle overlap spec] wip impl top k > 1 in overlap eagle worker(v2) #11839

[eagle overlap spec] wip impl top k > 1 in overlap eagle worker(v2) #11839

Uh oh!

vincentzed commented Oct 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[eagle overlap spec] wip impl top k > 1 in overlap eagle worker(v2) #11839

[eagle overlap spec] wip impl top k > 1 in overlap eagle worker(v2) #11839

Uh oh!

Conversation

vincentzed commented Oct 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Accuracy Tests

Baseline result

Current Status

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vincentzed commented Oct 19, 2025 •

edited

Loading