OMC: init heap/hint table across different shard #1171

hero78119 · 2025-12-04T02:02:50Z

Related to #1145
Build on top of #1202

design rationale

Make init chip across shard, and assure consecutive address across shards without overlap. The mechanism to read from previous shard works as usual.

remove pitfall of `Platform` clone

wrap Platform -> prog_data into Arc otherwise each platform clone with a massive and hidden prover cost which we do not aware.

benchmark

Run on 23817600 with K=6, cache trace = none, 1x4090

Component	base Time	Init-Across-Shard Time	Custom Serde Time	Init-Across-Shard Improvement	Custom Serde Improvement
app.prove	203 s	175 s	167 s	↓ 13.8%	↓ 17.7%
emulator.preflight-execute	43.9 s	35.8 s	36.7 s	↓ 18.5%	↓ 16.4%
app_prove.inner	158 s	139 s	130 s	↓ 12.0%	↓ 17.7%
create_proof_of_shard (shard_id = 0)	14.9 s	4.91 s	8.02 s	↓ 67.0%	↓ 46.2%
create_proof_of_shard (shard_id = 1)	16.4 s	12.7 s	3.56 s	↓ 22.6%	↓ 78.3%

verifier sound TODOs in next PR

This PR got some verifier change, but not fully sound yet. A refactor is need to constrain specific chip number of instance is on the way, in short to support this pi[<chip_id>].num_instance == chip_proof.num_instance. This is needed and assure init heap/hint in each shard are constrain within [platform.heap.start, platform.heap.end) or [platform.hint.start, platform.hint.end)

constraint pi[heap_length], pi[hint_length] equal to number of instance of respective init chip.
constraint pi[xxx_start_addr] + pi[xxx_length] within range
both rust/recursion verifier
constrain init chip across shard only allow 1 chip proof

Another TODO for `Hint` read and `rkyz`

rkyz serialized data from hint_end -> hint_start, so right now hint still mostly happened in shard one because in first read max_hint_addr goes from end address. This make first shard still need to write bunch of records to shard ram circuit. To further improve this, we need to find way, e.g. make rykv go from lower address to high. So the first shard issue can be solved entirely

hero78119 · 2025-12-05T13:34:13Z

need to figure out soundness #1178

…ght-tracer

hero78119 · 2025-12-19T08:04:54Z

TODO

check is there any debug revert, e.g. debug log, or par -> seq change

kunxian-xia

1st round of review

ceno_zkvm/src/tables/ram.rs

ceno_zkvm/src/tables/ram/ram_impl.rs

kunxian-xia · 2025-12-29T11:24:21Z

ceno_zkvm/src/e2e.rs

            } else {
                system_config
                    .mmu_config
                    .assign_init_table_circuit(


In fact we can just avoid this step as we only assign static init tables in the 0th shard.

although literally we can omit this in shard id > 0 but in some place we need all chip presented with empty RMM so we still need it, e.g. remove this cause mock_prover failed

ceno_emul/src/tracer.rs

ceno_zkvm/src/structs.rs

kunxian-xia · 2025-12-30T07:08:22Z

ceno_zkvm/src/e2e.rs

+            // │     │  └─ later rw? YES (rw in >0 exists)  -> ShardRAM
+            // │     │
+            // │     └─ rw occurs in current shard (current shard may be >0)
+            // │        ├─ later rw? NO  (no rw in later)   -> ShardRAM + LocalFinalize


typo? It should be "LocalFinalize".

"ShardRAM + LocalFinalize" correct, for example 0th shard is previous shard, then need to global read from ShardRAM

hero78119 · 2025-12-30T07:24:55Z

ceno_zkvm/src/e2e.rs

+            // │  └─ later rw? YES -> ShardRAM
+            // │
+            // └─ NO: init in a previous shard
+            // ├─ later rw? NO  -> LocalFinalize


TODO will update this to later rw? NO -> ShardRAM + LocalFinalize

kunxian-xia

LGTM 👍

Follow up on #1171 replace `rkyv` with custom serde for 2 reasons - serialization to bytes need to implement `rkyv` friendly struct, or leverage external library, e.g. "rkyv + bincode". `rkyv` friendly struct is not friendly for existing application in particular struct are defined in 3rd-party library. Thus previously we integrate bincode for the serialiation to bytes. This bring extra effort for guest program as we need bincode to deserialize back to owned struct. - in deserialize `rkyv` will access high addr to retrieve some meta information before sequential read. Below are example read pattern ``` hint address 28000004 hint address 28000008 hint address 280c8770 <- high addr accessed to fetch meta data hint address 280c876c hint address 280c876c hint address 280c8770 hint address 28000010 hint address 28000010 hint address 28000010 hint address 28000010 hint address 28000014 hint address 28000014 hint address 28000014 ... ``` The high order access not friendly if we want to record max accessed address in each shard for memory region initialized across shard. New `ceno_serde` crates credits to https://github.com/openvm-org/openvm/tree/main/crates/toolchain/openvm/src/serde ## benchmark Run on 23817600 with K=6, cache trace = none, 1x4090 | Component | base Time | Init-Across-Shard Time | Custom Serde Time | Init-Across-Shard Improvement | Custom Serde Improvement | |--------------------------------------|-----------|-------------------------|-------------------|-------------------------------|--------------------------| | app.prove | 203 s | 175 s | 160 s | ↓ 13.8% | ↓ 21.2% | | emulator.preflight-execute | 43.9 s | 35.8 s | 36.6 s | ↓ 18.5% | ↓ 16.6% | | app_prove.inner | 158 s | 139 s | 123 s | ↓ 12.0% | ↓ 22.2% | | create_proof_of_shard (shard_id = 0) | 14.9 s | 4.91 s | 8.46 s | ↓ 67.0% | ↓ 43.2% | | create_proof_of_shard (shard_id = 1) | 16.4 s | 12.7 s | 3.74 s | ↓ 22.6% | ↓ 77.2% | Overall cycle `322620252` -> `315955342` (2.06%) > Custom Serde updated setting: max cell: (1 << 30) * 8 / 4 / 2 ------> (1 << 30) * 10 / 4 / 2 keccak blowup K=6 `unchange`

track heap watermark

7398945

hero78119 marked this pull request as draft December 4, 2025 02:02

wip some idea of dynamic structural witness

18c5742

hero78119 changed the title ~~track heap watermark~~ OMC: init heap table across different shard Dec 5, 2025

hero78119 added the speed label Dec 5, 2025

hero78119 added 16 commits December 15, 2025 17:06

refactor with preflight tracer

d78109c

clippy & rename

bdd1d97

clippy & refactor

5a66398

Merge branch 'master' of github.com:scroll-tech/ceno into feat/prefli…

7afad48

…ght-tracer

inline some key function in tracer

1c473ba

Merge branch 'master' into feat/preflight-tracer

2b00540

merge with master

3f3058d

dynamic address e2e integration

ee5fce5

cleanup complex padding logic in local-finalize-circuit

48ea846

merge with #1202

8ddd3b6

finish local finalized for dynamic heap range

20c6c2b

make heap table able to init multiple times

cbd5b2b

merge and track memory

24ebc17

single shard works

cbb3c37

shard ctx set with heap watermark

060e014

set pi properly

d6ffce9

hero78119 added 2 commits December 19, 2025 21:14

wip 1st shard pass

da3cdd1

wip track heap rollback

4fbd5df

hero78119 force-pushed the feat/shard_mem_init branch from d047dde to b610c86 Compare December 20, 2025 05:56

e2e prover passed

0c7475f

hero78119 force-pushed the feat/shard_mem_init branch from b610c86 to 0c7475f Compare December 20, 2025 06:02

hero78119 added 2 commits December 20, 2025 14:18

log cleanup

78b488f

merge with master

361f4cc

hero78119 added 7 commits December 20, 2025 19:19

cleanup

b8239d9

fix performance regressed due to platform clone

322b0ef

rust verifier e2e pass

0b5b398

integration test of heap

a012a2c

cleanup debug log and e2e pass

ff915d3

shard support hint

f5204f1

support dynamic hint

e5b202e

hero78119 marked this pull request as ready for review December 22, 2025 13:28

hero78119 changed the title ~~OMC: init heap table across different shard~~ OMC: init heap/hint table across different shard Dec 22, 2025

hero78119 added 3 commits December 22, 2025 22:15

misc: documentation

11d7fbf

fix lint

70e0017

fix bug

a81f58e

hero78119 mentioned this pull request Dec 24, 2025

replace rkyv with custom serde #1207

Merged

misc: refactor

4f3f5ef

kunxian-xia reviewed Dec 29, 2025

View reviewed changes

hero78119 mentioned this pull request Dec 29, 2025

verifier work for offline-memory-check dynamic init chip #1212

Open

3 tasks

hero78119 added 2 commits December 29, 2025 20:41

address review comments

00b1c71

update gkr-backend tag

f90a11a

kunxian-xia reviewed Dec 30, 2025

View reviewed changes

ceno_emul/src/tracer.rs Outdated Show resolved Hide resolved

ceno_emul/src/tracer.rs Outdated Show resolved Hide resolved

ceno_zkvm/src/structs.rs Outdated Show resolved Hide resolved

ceno_zkvm/src/structs.rs Outdated Show resolved Hide resolved

misc: more comments & refactor

643c5d5

kunxian-xia reviewed Dec 30, 2025

View reviewed changes

hero78119 commented Dec 30, 2025

View reviewed changes

kunxian-xia approved these changes Dec 30, 2025

View reviewed changes

kunxian-xia added this pull request to the merge queue Dec 30, 2025

Merged via the queue into master with commit fe4b65d Dec 30, 2025
4 checks passed

kunxian-xia deleted the feat/shard_mem_init branch December 30, 2025 08:20

hero78119 mentioned this pull request Dec 30, 2025

cleanup complex padding logic in local-finalize-circuit #1202

Closed

hero78119 mentioned this pull request Jan 2, 2026

amortize Init* tables among multiple shards #1145

Closed

OMC: init heap/hint table across different shard #1171

OMC: init heap/hint table across different shard #1171

Uh oh!

Conversation

hero78119 commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

design rationale

remove pitfall of Platform clone

benchmark

verifier sound TODOs in next PR

Another TODO for Hint read and rkyz

Uh oh!

hero78119 commented Dec 5, 2025

Uh oh!

hero78119 commented Dec 19, 2025

Uh oh!

kunxian-xia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kunxian-xia Dec 29, 2025

Choose a reason for hiding this comment

Uh oh!

hero78119 Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kunxian-xia Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

hero78119 Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

hero78119 Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

kunxian-xia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hero78119 commented Dec 4, 2025 •

edited

Loading

remove pitfall of `Platform` clone

Another TODO for `Hint` read and `rkyz`

hero78119 Dec 29, 2025 •

edited

Loading