UMD Tech+Research 23

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

vatsalag authored a paper about 1 month ago

Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory

vatsalag authored a paper about 1 month ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

vatsalag authored a paper 9 months ago

Do text-free diffusion models learn discriminative visual representations?

View all activity

authored 2 papers about 1 month ago

Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory

Paper • 2602.18434 • Published Feb 20

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published Mar 14 • 14

authored 4 papers 9 months ago

Do text-free diffusion models learn discriminative visual representations?

Paper • 2311.17921 • Published Nov 29, 2023 • 1

Diffusion Models Beat GANs on Image Classification

Paper • 2307.08702 • Published Jul 17, 2023 • 19

LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

Paper • 2409.06703 • Published Sep 10, 2024 • 3

Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor

Paper • 2507.07106 • Published Jul 9, 2025 • 2