Training Pipeline for Vision Transformers

by MaxBolD - opened Jan 15, 2025

Jan 15, 2025

Hi @KennethTM
I'm working on a similar task with Vision Transformers but I'm struggling to set up a training pipeline.
Would you mind sharing yours? It would help me a lot to get started! Thanks in advance!

KennethTM

Owner Jan 15, 2025

I based my training on these notebooks:
https://github.com/NielsRogge/Transformers-Tutorials/blob/master/Pix2Struct/Fine_tune_Pix2Struct_on_key_value_pair_dataset_(PyTorch_Lightning).ipynb

https://github.com/huggingface/notebooks/blob/main/examples/image_captioning_pix2struct.ipynb

Hope that helps.

Kenneth

MaxBolD changed discussion status to closed Jan 15, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment