Track points in a video
https://huggingface.co/papers/2501.03006
Generate edited video frames using text prompts