TensionLM-117M-TS-Reasoner-v8

This is the explainable-boundary CPU TS reasoner for the frozen TensionLM-117M-Reasoning-v2 substrate.

v8 keeps the v6/v7 solver path and adds richer abstention decisions:

explicit refusal rules such as graph:cycle_detected, graph:ambiguous_branch, arithmetic:division_by_zero, and code:unknown_function,
low-confidence bands for unsupported or partial prompts,
a mixed TAC-GEN distribution that interleaves solvable standard/paraphrase prompts with unsolvable unknown prompts.

The point is still no-GPU reasoning: frozen language substrate plus explicit TS graph/program operators, boundary detection, and inspectable confidence.

Eval receipts

Fixed benchmark scores:

System	TAC v2	TAC v3	TAC v4
TS-Reasoner-v8	120/120	120/120	120/120

Generated benchmark scores:

Distribution	Engine	Score	Solve rate
TAC-GEN paraphrase, seed 9101	v8	3000/3000	100%
TAC-GEN unknown, seeds 9201-9204	v8	12000/12000	0%
TAC-GEN mixed, seeds 9301-9304	v8	12000/12000	65.7%

For the unknown distribution, correctness means returning <ABSTAIN>. In the mixed distribution, standard/paraphrase prompts are solved and unknown prompts are refused in the same pressure field.

These are system scores, not raw LLM scores.

Usage

python inference.py --prompt "Handoff log: a hands off to b; b hands off to c. Ignore the separate handoff x hands off to b. The handoff chain beginning at a ends at" --category transitivity --show_trace

python inference.py --prompt "Graph ledger: main(a,b); main(a,c). Resolve main* from a; terminal node:" --category transitivity --show_trace

python demo_ts_reasoner_v8.py

Limitations

This artifact handles generated formal prompt families covered by the included operators. It is not a chat assistant, not raw model improvement, and not a claim of open-ended natural language understanding. The confidence values are rule-calibrated system signals over these families, not probabilities over all natural language.

Downloads last month: 15

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support