2026.TA.gemma2_2b_tc8192_decb_l1w0.001_tarbb_lb2.0_ln1_dr10000_lr8e-04_bs4_sl14754660
Sparse transcoder adapter trained with bridging mode.
Model Details
Transcoder Configuration
- n_features: 8192
- dec_bias: True
- l1_weight: 0.001
Training
- Learning rate: 0.0008
- Batch size: 4
- Epochs: 1
- Warmup ratio: 0.05
- Loss type: kl
- lambda_adapt: 1.0
- lambda_bridge: 2.0
- lambda_nmse: 1
- n_cutoffs: 1
- backbone: target
Training Data