arXiv help :)

#1
by BoggersTheFish - opened

Hey everyone,

I'm an independent researcher who just submitted "TensionLM: Sigmoid Tension as Constraint Relaxation for Language Modelling" to arXiv (cs.AI + cs.LG).

  • 117M model with full public code + weights
  • Sigmoid tension attention (replaces softmax) + TS-native auxiliary losses
  • Logic β†’ Language β†’ Math curriculum gives 96Γ— better first-contact math PPL

Would really appreciate an endorsement so the paper can go live.

Endorsement code: RLU86M

Paper + code: https://github.com/BoggersTheFish/bozo
Model: https://huggingface.co/BoggersTheFish/TensionLM-117M-Curriculum

Thanks so much!

Sign up or log in to comment