Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks Paper • 2204.07705 • Published Apr 16, 2022 • 2
Representation Learning for Conversational Data using Discourse Mutual Information Maximization Paper • 2112.05787 • Published Dec 4, 2021
PBEBench: A Multi-Step Programming by Examples Reasoning Benchmark inspired by Historical Linguistics Paper • 2505.23126 • Published May 29, 2025
PragWorld: A Benchmark Evaluating LLMs' Local World Model under Minimal Linguistic Alterations and Conversational Dynamics Paper • 2511.13021 • Published Nov 17, 2025
ReaComp: Compiling LLM Reasoning into Symbolic Solvers for Efficient Program Synthesis Paper • 2605.05485 • Published 17 days ago