Ctrl+K

1 contributor

History: 8 commits

NotoriousH2

Update README with detailed data pipeline and reproduction steps

2d3e79d verified 2 months ago

.gitattributes

1.57 kB
SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
README.md

2.76 kB
Update README with detailed data pipeline and reproduction steps 2 months ago
added_tokens.json

35 Bytes
SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
chat_template.jinja

1.53 kB
SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
config.json

1.6 kB
SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
eval.py

3.47 kB
Add eval.py 2 months ago
generation_config.json

217 Bytes
SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
pytorch_model.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.BFloat16Storage"
What is a pickle import?
2 GB
xet

SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
rs_sample.py

3.72 kB
Add rs_sample.py 2 months ago
special_tokens_map.json

548 Bytes
SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
tokenizer.json

33.4 MB
xet

SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
tokenizer.model

4.69 MB
xet

SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
tokenizer_config.json

1.16 MB
SFT + Rejection Sampling SFT (5x teacher replay). GSM8K avg ~46.6%, best 48.9% 2 months ago
train_rs_sft.py

6.41 kB
Add train_rs_sft.py 2 months ago
train_sft.py

3.32 kB
Add train_sft.py 2 months ago

Detected Pickle imports (3)