Safetensors
GGUF
lfm2
llama.cpp
unsloth
conversational
How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="finnianx/LFM2.5-350M-THINKING",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

LFM2.5 350M Base

Finetuned on the following datasets:

  • TeichAI/lordx64-claude-opus-4.7-max-cleaned
  • Jackrong/DeepSeek-V4-Distill-8000x

Training Settings

  • Trained with the ChatML chat format
  • Effective batch size: 16
  • Training epochs: 2

LoRA Settings

  • 16-bit LoRA
  • r = 16
  • lora_alpha = 32

Warning

This model sometimes gets stuck in thinking loops, mostly on very simple prompts.


Sample Output

Prompt:
Is 9.11 bigger than 9.9?

We need to determine if 9.11 is bigger than 9.9. The question asks "Is 9.11 bigger than 9.9?" This is a simple comparison. We can check if 9.11 is greater than 9.9 by comparing the values. Since 9.11 is less than 9.9, the answer is no. So the answer is "No".

Answer: No.

Explanation:

9.11 is less than 9.9. Therefore, 9.11 is not greater than 9.9.

LFM2.5-350M-THINKING : GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

  • For text only LLMs: llama-cli -hf finnianx/LFM2.5-350M-THINKING --jinja
  • For multimodal models: llama-mtmd-cli -hf finnianx/LFM2.5-350M-THINKING --jinja

Available Model files:

  • LFM2.5-350M-THINKING.Q8_0.gguf
  • LFM2.5-350M-THINKING.F16.gguf
  • LFM2.5-350M-THINKING.Q4_K_M.gguf This was trained 2x faster with Unsloth
Downloads last month
344
Safetensors
Model size
0.4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for finnianx/LFM2.5-350M-THINKING

Quantized
(1)
this model
Quantizations
1 model

Datasets used to train finnianx/LFM2.5-350M-THINKING