TensionLM-117M-TS-Reasoner-v6

This is the paraphrase-aware CPU TS reasoner for the frozen TensionLM-117M-Reasoning-v2 substrate.

v6 adds paraphrase pressure over v5:

  • graph relations expressed as pipelines, handoffs, primary edges, and audit trails,
  • arithmetic phrased as ledger/recipe/token/quotient/remainder traces,
  • code phrased as augmented assignment, even-loop accumulation, branch, dict mutation, and list pop traces.

The point is still no-GPU reasoning: frozen language substrate plus explicit TS graph/program operators.

Eval receipts

Fixed benchmark scores:

System TAC v2 TAC v3 TAC v4
TS-Reasoner-v6 120/120 120/120 120/120

Generated benchmark scores:

Distribution Engine Score
TAC-GEN standard, seeds 9001-9004 v5 12000/12000
TAC-GEN paraphrase, seed 9101 n=100/category v5 0/300
TAC-GEN paraphrase, seeds 9101-9104 v6 12000/12000

These are system scores, not raw LLM scores.

Usage

python inference.py --prompt "Handoff log: a hands off to b; b hands off to c. Ignore the separate handoff x hands off to b. The handoff chain beginning at a ends at" --category transitivity --show_trace

python inference.py --prompt "Ledger entry: begin 20, pay out 5, add back 3, scale by 2. The ledger closes at" --category arithmetic --show_trace

python inference.py --prompt "Trace Python: total=0; for i in range(8): add i only when i is even. total is" --category code_reasoning --show_trace

Limitations

This artifact handles generated formal prompt families covered by the included operators. It is not a chat assistant, not raw model improvement, and not a claim of open-ended natural language understanding. The next pressure step is unknown grammar detection and robust refusal/fallback.

Downloads last month
16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support