TensionLM-117M-TS-Reasoner-v5

This is the CPU-only TS graph/program reasoner v5 for the frozen TensionLM-117M-Reasoning-v2 substrate.

v5 adds a separate interpreter engine:

graph_parser: query-selected typed-edge graph following.
arithmetic_parser: operation-log and word-trace arithmetic.
safe_python: bounded Python trace/loop/branch/data-flow semantics.
controller: tries v5 operators first, then legacy v4 fallback.

Eval receipts

All scores are system scores, not raw LLM scores.

System	TAC v2	TAC v3	TAC v4
GPT-2 124M	3/120	0/120	0/120
Base TensionLM 117M	7/120	1/120	2/120
TensionLM-117M-Reasoning-v2	20/120	2/120	0/120
TS-Reasoner-v4	120/120	120/120	0/120
TS-Reasoner-v5	120/120	120/120	120/120

TAC v4 is adversarial to v4: it uses query-selected graph chains, operation logs, and Python traces. The v4 failure ledger is included in eval/.

Usage

python inference.py --prompt "Graph ledger: main(alpha,beta); side(alpha,zeta); main(beta,gamma); main(gamma,delta). Resolve main* from alpha; terminal node:" --category transitivity --show_trace

python inference.py --prompt "Counter starts at 5. Ops: +2; *3; -1. Counter ends as" --category arithmetic --show_trace

python inference.py --prompt "Python loop: total=0; for i in range(5): total += i. total =" --category code_reasoning --show_trace

Limitations

This artifact is narrow and inspectable by design. It is not a chat assistant, not raw LLM improvement, and not proof that dense weights solved TAC v4. The claim is the no-GPU path: frozen language substrate plus explicit TS graph and program operators can move formal-task capability without retraining the model.

Downloads last month: 17

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support