Entry Point
Scheduler
Model Runner
Prepare Inputs
Compute query_start_loc
QKV Projection
Reorder Batch
ROCM_AITER_FA Backend
Classify Request Type
PREFILL PATH
EXTEND PATH
DECODE PATH
Output & Loop
query_start_loc: [0]
Sampling Check:
-
R1
R2
R3
R4
R5
✅ Generation Complete!
"Hallucinations significantly decreased"
Start from:
Iteration 1
Iteration 2
Iteration 3
Iteration 4
Iteration 5
Iteration 6
Iteration 7
▶️ Start
⏸️ Pause
Press spacebar to pause/resume
R1
▼
R2
▼
R3
▼
R4
▼
R5
▼
Time
R1
P
D
D
D
D
D
D
...
R2
P
D
D
D
D
D
...
R3
P
E
E
D
D
...
R4
P
D
D
D
D
...
R5
P
E
D
D
...
Step 1
Step 2
Step 3
Step 4
Step 5
Step 6
Step 7
...
Iteration step: -
R1
R2
R3
R4
R5
Input
-
-
-
-
-
Output
-
-
-
-
-
Starts
-
-
-
-
-
Path
-
-
-
-
-
Path History
-
-
-
-
-
Prefill Token Threshold
100
100
100
100
100
Prompt Token
-
-
-
-
-
Remaining Token
-
-
-
-
-
Scheduled Token
-
-
-
-
-
Computed Token
-
-
-
-
-
Output Token
-
-
-
-
-