TokenRouter: Efficient Serving System for Token-Level LLM Routing
AI & ML interests
None defined yet.
Recent Activity
Papers
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
models 16
nics-efc/CoLLM_Qwen3_0_6B
0.8B • Updated • 14
nics-efc/CITER_Qwen3_0_6B_Qwen3_32B
Updated
nics-efc/VPR-Tic-Tac-Toe
Text Generation • 4B • Updated • 15
nics-efc/VPR-Sudoku
Text Generation • 4B • Updated • 14
nics-efc/VPR-Minesweeper
Text Generation • 4B • Updated • 16
nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B
Text Generation • 4B • Updated • 5
nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B
Text Generation • 4B • Updated • 40
nics-efc/MARSHAL-Tic-Tac-Toe-Qwen3-4B
Text Generation • 4B • Updated • 22
nics-efc/MARSHAL-Generalist-Qwen3-8B
Text Generation • 8B • Updated • 12
nics-efc/MARSHAL-Generalist-Qwen3-4B
Text Generation • 4B • Updated • 12
datasets 8
nics-efc/R2R_Router_Training_Qwen3-0.6B_Qwen3-30B-A3B
Viewer • Updated • 9.3M • 1.4k
nics-efc/R2R_Router_Training_Qwen3-4B_Qwen3-32B
Viewer • Updated • 18.3M • 1.29k
nics-efc/R2R_Router_Training_Qwen3-1.7B_Qwen3-8B
Viewer • Updated • 21.9M • 818
nics-efc/R2R_Router_Training_Qwen3-0.6B_Qwen3-8B
Viewer • Updated • 22.2M • 511
nics-efc/R2R_query
Viewer • Updated • 2.93k • 56
nics-efc/R2R_Router_Training
Viewer • Updated • 8.19M • 423 • 4
nics-efc/MoA_Long_HumanQA
Viewer • Updated • 3.5k • 111 • 4
nics-efc/MoA_Long_Retrieval
Viewer • Updated • 4.4k • 18 • 4