Safetensors
GGUF
lfm2
llama.cpp
unsloth
conversational

LFM2.5 350M Base

Finetuned on the following datasets:

  • TeichAI/lordx64-claude-opus-4.7-max-cleaned
  • Jackrong/DeepSeek-V4-Distill-8000x

Training Settings

  • Trained with the ChatML chat format
  • Effective batch size: 16
  • Training epochs: 2

LoRA Settings

  • 16-bit LoRA
  • r = 16
  • lora_alpha = 32

Warning

This model sometimes gets stuck in thinking loops, mostly on very simple prompts.


Sample Output

Prompt:
Is 9.11 bigger than 9.9?

We need to determine if 9.11 is bigger than 9.9. The question asks "Is 9.11 bigger than 9.9?" This is a simple comparison. We can check if 9.11 is greater than 9.9 by comparing the values. Since 9.11 is less than 9.9, the answer is no. So the answer is "No".

Answer: No.

Explanation:

9.11 is less than 9.9. Therefore, 9.11 is not greater than 9.9.

LFM2.5-350M-THINKING : GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

  • For text only LLMs: llama-cli -hf finnianx/LFM2.5-350M-THINKING --jinja
  • For multimodal models: llama-mtmd-cli -hf finnianx/LFM2.5-350M-THINKING --jinja

Available Model files:

  • LFM2.5-350M-THINKING.Q8_0.gguf
  • LFM2.5-350M-THINKING.F16.gguf
  • LFM2.5-350M-THINKING.Q4_K_M.gguf This was trained 2x faster with Unsloth
Downloads last month
308
Safetensors
Model size
0.4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for finnianx/LFM2.5-350M-THINKING

Quantized
(1)
this model
Quantizations
1 model

Datasets used to train finnianx/LFM2.5-350M-THINKING