Pretrained models for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"
Jinrui Zhang
zjr2000
AI & ML interests
None yet
Recent Activity
upvoted an article 2 days ago
NEO-unify: Building Native Multimodal Unified Models End to End new activity 5 days ago
zjr2000/SPES-2B:Add library_name and improve model card metadata new activity 5 days ago
zjr2000/SPES-9B:Link model to paper and improve model cardOrganizations
None yet