Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

UMD Tech+Research 23

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

vatsalag  authored a paper about 1 month ago
Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory
vatsalag  authored a paper about 1 month ago
MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos
vatsalag  authored a paper 9 months ago
Do text-free diffusion models learn discriminative visual representations?
View all activity

Vatsal Agarwal's profile picture

vatsalag 
authored 2 papers about 1 month ago

Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory

Paper • 2602.18434 • Published Feb 20

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published Mar 14 • 14
vatsalag 
authored 4 papers 9 months ago

Do text-free diffusion models learn discriminative visual representations?

Paper • 2311.17921 • Published Nov 29, 2023 • 1

Diffusion Models Beat GANs on Image Classification

Paper • 2307.08702 • Published Jul 17, 2023 • 19

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Paper • 2409.06703 • Published Sep 10, 2024 • 3

Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor

Paper • 2507.07106 • Published Jul 9, 2025 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs