Model Card for Model ID

This LoRA model is a product of the research paper "LLM QLoRA Fine-Tuning of Llama, DeepSeek, and Qwen: A Skyrim Case Study" (to appear in IEEE Access). The study investigates the interplay between model scale (~8B, ~13B, ~33B), architecture, and data formatting in adapting Large Language Models to knowledge-intensive domains.

By utilizing a multi-stage data cycling strategy and 4-bit Quantized Low-Rank Adaptation (QLoRA), this model was fine-tuned to master the lore of The Elder Scrolls V: Skyrim. The training process involved rigorous hyperparameter optimization, comparing unstructured, structured, and summarized datasets across varying LoRA ranks (16, 32, 64) to determine the optimal configuration for factual recall and narrative fluency. The resulting model demonstrates the capability to generate high-fidelity, hallucination-resistant character biographies and attribute lists, as validated by a comprehensive ensemble LLM-as-a-Judge evaluation framework.

Model Details

Model Description

Skyrim Lore - DeepSeek-Coder-V2-16B
Description: The top-performing model of the entire study. Despite being a "Coder" model, it achieved the highest Overall Score (3.16) when fine-tuned on the Summarized dataset with a LoRA Rank of 64. Its logic-heavy pre-training allowed it to disentangle dense narrative facts with superior accuracy.
Best Configuration: Summarized Dataset | Rank 64
Base Model: unsloth/DeepSeek-Coder-V2-Lite-Instruct
Note: You can also use the pre-quantized version unsloth/DeepSeek-Coder-V2-Lite-Instruct-bnb-4bit.

Framework versions

PEFT 0.15.2

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MarcosEdu/Skyrim_D-Coder-V2-16B

Base model

deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct

Adapter

(5)

this model

Collection including MarcosEdu/Skyrim_D-Coder-V2-16B

Skyrim Lore QLoRA Collection

Collection

LoRA models from "LLM QLoRA Fine-Tuning of Llama, DeepSeek, and Qwen: A Skyrim Case Study," trained on the Mantella dataset for Skyrim. • 9 items • Updated Jan 27 • 1