Pashto BERT
Collection
1 item • Updated
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
convert this to raw readme.md file, it's a model card on huggingface
This is a monolingual Pashto BERT (BERT-Base) model trained on a large Pashto corpus. The model is designed to understand and generate text in Pashto, making it suitable for various downstream Natural Language Processing (NLP) tasks.
[CLS], [SEP], [PAD], [MASK], [UNK]This model can be fine-tuned for various Pashto-specific NLP tasks, such as:
This model can be loaded using the transformers library from Hugging Face:
from transformers import AutoModel, AutoTokenizer
model_name = "your-huggingface-username/pashto-bert-base"
tokenizer = AutoTokenizer.from_pretrained("/kaggle/working/model/")
model = AutoModel.from_pretrained(model_name)
text = "ستاسو نننۍ ورځ څنګه وه؟"
tokens = tokenizer(text, return_tensors="pt")
out = model(**tokens)
linear_schedule_with_warmup