Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Pasquale Minervini's picture
26 23 6

Pasquale Minervini

pminervini
zhaoyuhitsz's profile picture alessiodevoto's profile picture girishgupta's profile picture
·
https://www.neuralnoise.com
  • pminervini
  • pminervini
  • pasquale-minervini-phd-47a08324
  • neuralnoise.com

AI & ML interests

NLP, ML, AI

Organizations

BigScience Workshop's profile picture NLP @ University of Edinburgh's profile picture ChatArena's profile picture EdinburghNLP - Natural Language Processing Group at the University of Edinburgh's profile picture Open Life Science AI's profile picture Ping Nie's profile picture hallucinations-leaderboard's profile picture Miniml's profile picture LLMAccountability's profile picture Edinburgh Dataset Analytics Working Group's profile picture OpenBox's profile picture Poster Summarization's profile picture LEMUR Decoding's profile picture DateTimeReasoning's profile picture Inverse Scaling's profile picture ML intern explorers's profile picture

Articles 2

Article
198

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Article
38

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

View all Articles

Papers 39

arxiv:2603.10225
arxiv:2602.06948
arxiv:2511.00602
arxiv:2509.21552

models 1

pminervini/outputs

Updated Apr 6, 2024

datasets 8

pminervini/VQAv2

Viewer • Updated Jul 1, 2024 • 1.21M • 92

pminervini/NQ-Swap

Viewer • Updated Mar 1, 2024 • 4.75k • 723 • 1

pminervini/hl-fever

Viewer • Updated Jan 23, 2024 • 146k • 315

pminervini/shroom

Viewer • Updated Jan 7, 2024 • 499 • 121

pminervini/true-false

Viewer • Updated Dec 27, 2023 • 12.4k • 363 • 8

pminervini/averitec

Viewer • Updated Dec 26, 2023 • 3.57k • 528 • 1

pminervini/inverse-scaling

Viewer • Updated Dec 13, 2023 • 37.5k • 419 • 1

pminervini/HaluEval

Viewer • Updated Dec 7, 2023 • 64.5k • 7.24k • 28
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs