TensionLM-117M-TS-Reasoner-v5

This is the CPU-only TS graph/program reasoner v5 for the frozen TensionLM-117M-Reasoning-v2 substrate.

v5 adds a separate interpreter engine:

  • graph_parser: query-selected typed-edge graph following.
  • arithmetic_parser: operation-log and word-trace arithmetic.
  • safe_python: bounded Python trace/loop/branch/data-flow semantics.
  • controller: tries v5 operators first, then legacy v4 fallback.

Eval receipts

All scores are system scores, not raw LLM scores.

System TAC v2 TAC v3 TAC v4
GPT-2 124M 3/120 0/120 0/120
Base TensionLM 117M 7/120 1/120 2/120
TensionLM-117M-Reasoning-v2 20/120 2/120 0/120
TS-Reasoner-v4 120/120 120/120 0/120
TS-Reasoner-v5 120/120 120/120 120/120

TAC v4 is adversarial to v4: it uses query-selected graph chains, operation logs, and Python traces. The v4 failure ledger is included in eval/.

Usage

python inference.py --prompt "Graph ledger: main(alpha,beta); side(alpha,zeta); main(beta,gamma); main(gamma,delta). Resolve main* from alpha; terminal node:" --category transitivity --show_trace

python inference.py --prompt "Counter starts at 5. Ops: +2; *3; -1. Counter ends as" --category arithmetic --show_trace

python inference.py --prompt "Python loop: total=0; for i in range(5): total += i. total =" --category code_reasoning --show_trace

Limitations

This artifact is narrow and inspectable by design. It is not a chat assistant, not raw LLM improvement, and not proof that dense weights solved TAC v4. The claim is the no-GPU path: frozen language substrate plus explicit TS graph and program operators can move formal-task capability without retraining the model.

Downloads last month
17
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support