b-mc2/sql-create-context
Viewer • Updated • 78.6k • 3.75k • 497
How to use nakcnx/typhoon-sql-qlora with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("scb10x/typhoon-7b")
model = PeftModel.from_pretrained(base_model, "nakcnx/typhoon-sql-qlora")Typhoon-7b QLoRA Finetune by unsloth with SQL Context dataset.
Batch Size: 48 (4(BS)x4(GAS)x3(GPU))
The following bitsandbytes quantization config was used during training:
Step Training Loss Eval Loss
1550 (Epoch:1) 0.4295 0.4367
3110 (Epoch:2) 0.4057 0.4217
Base model
typhoon-ai/typhoon-7b