ratishsp/fineweb-edu-misinfo
Viewer • Updated • 200k • 221
A ModernBERT-base classifier trained to detect misinformation in web text, specifically content that passes educational quality filters despite being misleading or harmful. Trained on 200K documents from FineWeb-Edu annotated by Llama 4 Maverick (meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8).
This repo contains two models:
binary/)
Classifies documents as misinfo or benign.
| Precision | Recall | F1 | Support | |
|---|---|---|---|---|
| misinfo | 0.83 | 0.89 | 0.86 | 3,885 |
| benign | 0.97 | 0.95 | 0.96 | 15,663 |
| accuracy | 0.94 | 19,548 |
multiclass/)
Classifies documents into 5 misinformation categories + benign.
| Precision | Recall | F1 | Support | |
|---|---|---|---|---|
| climate_denial | 0.79 | 0.91 | 0.84 | 539 |
| health_misinfo | 0.78 | 0.90 | 0.83 | 1,014 |
| pseudoscience | 0.82 | 0.86 | 0.84 | 1,618 |
| hate_extremism | 0.65 | 0.70 | 0.67 | 226 |
| conspiracy_propaganda | 0.55 | 0.74 | 0.63 | 488 |
| benign | 0.97 | 0.94 | 0.96 | 15,663 |
| accuracy | 0.92 | 19,548 |
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
# Binary model
tokenizer = AutoTokenizer.from_pretrained("ratishsp/fineweb-edu-misinfo-classifier", subfolder="binary")
model = AutoModelForSequenceClassification.from_pretrained("ratishsp/fineweb-edu-misinfo-classifier", subfolder="binary")
text = "Your document text here..."
inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=8192)
with torch.no_grad():
logits = model(**inputs).logits
prediction = torch.argmax(logits, dim=-1).item()
label = model.config.id2label[prediction]
print(label) # "misinfo" or "benign"
@misc{puduppully2026fineweb-edu-misinfo,
author = {Puduppully, Ratish},
title = {FineWeb-Edu Misinformation Classifier},
year = {2026},
publisher = {HuggingFace},
url = {https://huggingface.co/ratishsp/fineweb-edu-misinfo-classifier}
}
Base model
answerdotai/ModernBERT-base