QizhiPei commited on
Commit
54f4029
·
verified ·
1 Parent(s): bee8598

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -14,6 +14,10 @@ model-index:
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
 
 
 
 
17
  # DiffScale-7B
18
 
19
  This model is a fine-tuned version of [QizhiPei/Qwen2.5-Math-7B-Instruct-RoPE-300k](https://huggingface.co/QizhiPei/Qwen2.5-Math-7B-Instruct-RoPE-300k) on the DiffScale-Math dataset.
@@ -58,3 +62,4 @@ The following hyperparameters were used during training:
58
  - Pytorch 2.4.0+cu121
59
  - Datasets 3.1.0
60
  - Tokenizers 0.20.3
 
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ Paper: [ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning](https://arxiv.org/abs/2509.21070)
18
+
19
+ Code: https://github.com/QizhiPei/ScaleDiff
20
+
21
  # DiffScale-7B
22
 
23
  This model is a fine-tuned version of [QizhiPei/Qwen2.5-Math-7B-Instruct-RoPE-300k](https://huggingface.co/QizhiPei/Qwen2.5-Math-7B-Instruct-RoPE-300k) on the DiffScale-Math dataset.
 
62
  - Pytorch 2.4.0+cu121
63
  - Datasets 3.1.0
64
  - Tokenizers 0.20.3
65
+