mom-multilingual-class
Collection
long context models for MoM multilingual classifier (domain, jailbreak, pii, factual, feedback) • 12 items • Updated
How to use llm-semantic-router/mmbert32k-factcheck-classifier-merged with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="llm-semantic-router/mmbert32k-factcheck-classifier-merged") # Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification
tokenizer = AutoTokenizer.from_pretrained("llm-semantic-router/mmbert32k-factcheck-classifier-merged")
model = AutoModelForSequenceClassification.from_pretrained("llm-semantic-router/mmbert32k-factcheck-classifier-merged")This is the merged version of the mmBERT-32K factcheck classifier model, ready for direct inference without PEFT.
from transformers import AutoModelForSequenceClassification, AutoTokenizer
model = AutoModelForSequenceClassification.from_pretrained(
"llm-semantic-router/mmbert32k-factcheck-classifier-merged",
trust_remote_code=True
)
tokenizer = AutoTokenizer.from_pretrained("llm-semantic-router/mmbert32k-factcheck-classifier-merged")
# Inference
inputs = tokenizer("Your text here", return_tensors="pt", truncation=True, max_length=32768)
outputs = model(**inputs)
Apache 2.0
Base model
jhu-clsp/mmBERT-base