Update README.md
Browse files
README.md
CHANGED
|
@@ -6,3 +6,4 @@ base_model:
|
|
| 6 |
pipeline_tag: text-generation
|
| 7 |
---
|
| 8 |
This model was fine‑tuned with GRPO for only 50 steps using 4 samples per step. The result is exceptionally high accuracy on JEE‑level mathematics problems, though its broader context handling and instruction‑following abilities were diminished. In essence, it has become a compact powerhouse — a “mini‑tank” built for raw mathematical problem‑solving rather than nuanced reasoning.
|
|
|
|
|
|
| 6 |
pipeline_tag: text-generation
|
| 7 |
---
|
| 8 |
This model was fine‑tuned with GRPO for only 50 steps using 4 samples per step. The result is exceptionally high accuracy on JEE‑level mathematics problems, though its broader context handling and instruction‑following abilities were diminished. In essence, it has become a compact powerhouse — a “mini‑tank” built for raw mathematical problem‑solving rather than nuanced reasoning.
|
| 9 |
+
|