HAICOSYSTEM: An Ecosystem for Sandboxing Safety Risks in Human-AI Interactions Paper • 2409.16427 • Published Sep 24, 2024 • 1
SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs Paper • 2410.13648 • Published Oct 17, 2024
Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning Paper • 2504.04383 • Published Apr 6, 2025
Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale Paper • 2511.05705 • Published Nov 7, 2025 • 10
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published Jan 30 • 110
Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch Paper • 2602.03183 • Published Feb 3 • 11
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models Paper • 2502.11881 • Published Feb 17, 2025
Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch Paper • 2602.03183 • Published Feb 3 • 11
Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs Paper • 2403.05020 • Published Mar 8, 2024 • 2
Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs Paper • 2403.04801 • Published Mar 5, 2024
Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models Paper • 2407.06004 • Published Jul 8, 2024
Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models Paper • 2402.03284 • Published Feb 5, 2024
Abstractive Summarization of Reddit Posts with Multi-level Memory Networks Paper • 1811.00783 • Published Nov 2, 2018
ProsocialDialog: A Prosocial Backbone for Conversational Agents Paper • 2205.12688 • Published May 25, 2022
SODA: Million-scale Dialogue Distillation with Social Commonsense Contextualization Paper • 2212.10465 • Published Dec 20, 2022 • 2
Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion Causes Paper • 2109.08828 • Published Sep 18, 2021
Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness Paper • 2004.05816 • Published Apr 13, 2020
FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions Paper • 2310.15421 • Published Oct 24, 2023