World Model for Robot Learning: A Comprehensive Survey Paper • 2605.00080 • Published 15 days ago • 15
When Do Diffusion Models learn to Generate Multiple Objects? Paper • 2605.00273 • Published 15 days ago • 9
Map2World: Segment Map Conditioned Text to 3D World Generation Paper • 2605.00781 • Published 14 days ago • 25
When Do Diffusion Models learn to Generate Multiple Objects? Paper • 2605.00273 • Published 15 days ago • 9
Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models Paper • 2602.24264 • Published Feb 27 • 14
Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models Paper • 2602.24264 • Published Feb 27 • 14
Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models Paper • 2602.24264 • Published Feb 27 • 14
Enhancing Multi-Image Understanding through Delimiter Token Scaling Paper • 2602.01984 • Published Feb 2 • 5
DISCO: Diversifying Sample Condensation for Efficient Model Evaluation Paper • 2510.07959 • Published Oct 9, 2025 • 15