GUIDE teknik

Optimiser bi dafay wàcci ci CPU ak NVMe

Ab pexe buy sakkanal mémoire biy gaare ay téere yu diis ci tàggat (statu optimiser, gradient, yenn saa yi diisaay) ci RAM CPU wala ci NVMe SSDs ci barabu mémoire GPU bu néew.

Résumé

Ab pexe buy sakkanal mémoire biy gaare ay téere yu diis ci tàggat (statu optimiser, gradient, yenn saa yi diisaay) ci RAM CPU wala ci NVMe SSDs ci barabu mémoire GPU bu néew. Dafay may nit ñi ñu tàggat model yu gëna mag te seen mémoire GPU mënu ko woon may.

Optimizer State Offloading ci CPU ak NVMe ab bloku tabax bu xarala la buy indi jafe jafe ci kalite model bi, njëgu infrastructure bi, latency bi, ak wóor ci escalier bi.

Plongeur bu xóot

Soo tàggatee benn reso neuronal ak optimisatër bu melni Adam, parametre bu nekk dafay yóbbaale bagaas bu gëna bari: ñaari lim yuy daw (momentum ak variance), boole ci benn kopi bu leer ci diisaay bi, boole ci gradient bi. Ci tàggat yaram bu jaxaso, loolu mën na nekk lu tollu ci 16 byte ci parametre bu nekk, di wàññi 2 byte yi ci diisaay bi ci boppam. Offloading dafay dindi bagaas bi ci GPU bi. Decharge CPU dafay génne stade optimiser yi ci sistem RAM bu gëna neex ci kaw bus PCIe, fekk NVMe offload daf leen di puus ba ci disk solid-state yu gaaw yi. ZeRO-Infinity ak ZeRO-Offload yu DeepSpeed ​​ñoo ko siiwal, pexem dafay wecci gaawaay bu ñor ak kàttan, may benn GPU wala cluster bu ndaw ñu defar ay model yu am ay miliyaar ciy parametre.

Gis-gis xarala

Caabi gi mooy boole done yi ak xayma yi. Optimisatër bi dafa toog ci CPU/NVMe; ci jamonoy dellu ginaaw, dañuy njëkka jël xaaj yi ci kaw PCIe balaa ñu leen di soxla, te jéego optimiser bi ci boppam dafay faral di dox ci CPU bi. ZeRO-Offload dafay tëye poids master float32 ak Adam moments ci CPU, kon math ci kanam ak ci ginaaw kese lañuy des ci GPU bi. NVMe dafay yokk cache bu bari suko defee etaa yu terabyte yi di tuuru ci disk bi fekk xaaj yu tàng yi des ci RAM bi.

Jàngale Optimiser ci CPU ak NVMe

Ab pexe buy sakkanal mémoire biy gaare ay téere yu diis ci tàggat (statu optimiser, gradient, yenn saa yi diisaay) ci RAM CPU wala ci NVMe SSDs ci barabu mémoire GPU bu néew. Dafay may nit ñi ñu tàggat model yu gëna mag te seen mémoire GPU mënu ko woon may. Optimizer State Offloading ci CPU ak NVMe ab bloku tabax bu xarala la buy indi jafe jafe ci kalite model bi, njëgu infrastructure bi, latency bi, ak wóor ci escalier bi. Ngir tabax xam-xam bu xóot, jëfandikoo Optimizer State Offloading ci CPU ak NVMe ni xeetu liggéey, du benn man-man: leeral njariñ yi nga bëgg, leeral xalaat yi, ak tàqale li sistem bi mëna def ci anam wu wóor ak li ba leegi soxla àtteb kàngam.

Ci jëf, ekip yu am doole yiy jëfandikoo Optimizer State Offloading ci CPU ak NVMe dañuy gëna xéewale architecture, done, ak tànneefi infrastructure ci wàllu wóor ak njëg. Dañuy bind kritër yu leer ngir am ndam, natt leen ci done yu dëggu ak def liggéey, ba noppi ñu baamtu ci anamu ñàkka mëna seetlu, du ci benn yoon benchmark wins. Mooy barab bi xam-xam theorie bi di soppiku nekk kàttan buy yàgg ci produit yi, ci politik yi ak ci liggéey yi.

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw. Ci jamano jooju, Optimisation benn benchmark mën na nëbb ñakk kattan yu gëna yaatu ci sistem bi. Xeetu jëf bi gëna dëgër mooy boole gaawaayu jàngat ak disipline nguur: doxal pilote, jàpp firnde, siiwal dogal yi, ak wéy di yeesal kaaraange gi ci anam wi ñuy doxalee, li jëfandikukat bi di xaar, ak sàrti sàrt yi di jëm kanam.

njeextalu pexe

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw.

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Njàngalem xarala yi dafay jàppale ekip yi ñu tànn li gën, te baña yam ci li gëna bees daal.

Njàngalem xarala yi dafay jàppale ekip yi ñu tànn li gën, te baña yam ci li gëna bees daal. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Tanneef yu gëna baax ci wàllu ingeñër dina wàññi jafe-jafe yi ci wàllu wóor ci liggéey bi.

Tanneef yu gëna baax ci wàllu ingeñër dina wàññi jafe-jafe yi ci wàllu wóor ci liggéey bi. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Ëlëgu Optimizer biy wàcci ci CPU ak NVMe

Ginaaw model yi dañuy wéy di gëna màgg ci mémoire GPU, dechargement bu bari dafay nekk standard te baña nekk lu wuute. Xaarandil lëkkaloo bu gëna dëgër ak lëkkaloo yu gëna gaaw yu melni NVLink-C2C ak CXL pool memory yuy nëbb CPU-GPU, boole ci ay scheduler yu gëna am xel yuy wax ban etaa lañu wara prefetch. Architecture yu am mémoire buñ boole mu melni Grace Hopper dañuy wàññi penalti PCIe, te kaadar yi dañuy puus ngir def offload bu bari daanaka leer nàññ suko defee ñi bëgg bëgg mën nañu defar model yu mag ci hardware bu woyof.

Doxal ci àdduna dëgg

Defar ab LLM bu am 13 milyaar ci benn GPU bu 24 GB di jëfandikoo DeepSpeed ​​ZeRO-Offload ngir push Adam ci RAM CPU bi.

Laboratoire gëstu bu ndaw buy tàggat xeetu parametre yu bari ci yenn GPUs ci di tuuru etaa optimisatër yi ci NVMe drive ak ZeRO-Infinity.

Hugging Face Gaawaay configs yiy may CPU offload suko defee jëfandikukat yi mëna def liggéey yu mat sëkk yu mën sànni njuumti yi ci memory bi.

New-conscious startups yi luwaase GPUs cloud yu gëna xéewale, memory bu néew, ak yobbu ko ci NVMe biñ takk ci barabu fay kàrt 80 GB yu gëna kawe.

Modèlu jëfandikoo

Optimizer State Dechargement ci CPU ak NVMe ci jëf

Defar ab LLM bu am 13 milyaar ci benn GPU bu 24 GB di jëfandikoo DeepSpeed ​​ZeRO-Offload ngir push Adam ci RAM CPU bi.

Fine-tuning LLM bu 13-milyaar-parametre ci benn GPU bu 24 GB di jëfandikoo DeepSpeed ​​ZeRO-Offload ngir puus Adam ci CPU RAM Teams yi dañuy faral di am njariñ yu gëna baax suñu joxee threshold yu baax ci kanam, tëye yoon wi nit ñi di yokk ngir produit yi ak njëg yi, ak topp.

Optimizer State Dechargement ci CPU ak NVMe ci jëf

Laboratoire gëstu bu ndaw buy tàggat xeetu parametre yu bari ci yenn GPUs ci di tuuru etaa optimisatër yi ci NVMe drive ak ZeRO-Infinity.

Laboratoire gëstu bu ndaw buy tàggat benn xeetu parametre yu bari ci yenn GPUs ci di tuuru réew yu optimizer yi ci NVMe drives ak ZeRO-Infinity Teams dañuy faral di am njariñ yu gëna baax suñu leeralee threshold yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ak topp njuréefi produit ak error.

Optimizer State Dechargement ci CPU ak NVMe ci jëf

Hugging Face Gaawaay configs yiy may CPU offload suko defee jëfandikukat yi mëna def liggéey yu mat sëkk yu mën sànni njuumti yi ci memory bi.

Hugging Face Accelerate configs yiy may CPU offload suko defee jëfandikukat yi mëna def liggéey yu mat sëkk yu mën sànni njuumti yi ci memory Teams yi dañuy faral di am njariñ yu gëna baax suñu joxee thresholds yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ak topp njuumti produit yi ci diir bi ak e.

Optimizer State Dechargement ci CPU ak NVMe ci jëf

New-conscious startups yi luwaase GPUs cloud yu gëna xéewale, memory bu néew, ak yobbu ko ci NVMe biñ takk ci barabu fay kàrt 80 GB yu gëna kawe.

New-conscious startups luwe lu yomb, GPUs cloud yu gëna néew memory ak offloading ci NVMe biñ takk ci barabu fay 80 GB kart yu gëna kawe.

Risk yi ak balustrade yi

!

Optimize benn benchmark mën na nëbb ñakk kattan yu gëna yaatu ci sistem bi.

!

Njëg li ñuy fay ci infrastructure yi ak ci toppatoo dañuy faral di suufeel.

!

Bu sistem yi di gëna xawa jafee xam, jafe-jafe yi am ci wàllu kaaraange ak seetlu mën nañu gëna bari.

Roadmap ngir samp gi

1

Mandargal latency, kalite, ak njëg yi laata ngay jëfandikoo.

Mandargal latency, kalite, ak njëg yi laata ngay jëfandikoo. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppali génne gi, tëj bërëb bi, ba noppi nga yaatal jëfandikoo gi.

2

Benchmark ci biir sargal ak done yu dëggu.

Benchmark ci biir sargal ak done yu dëggu. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppali génne gi, tëj bërëb bi, ba noppi nga yaatal jëfandikoo gi.

3

Jumtukaay bi di saytu njuumte yi, derive bi ak njeextalu jëfandikukat bi.

Jumtukaay bi di saytu njuumte yi, derive bi ak njeextalu jëfandikukat bi. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppali génne gi, tëj bërëb bi, ba noppi nga yaatal jëfandikoo gi.

4

Waajal rollback ak yooni tontu ci jafe-jafe yi laata ngay eskale.

Waajal rollback ak yooni tontu ci jafe-jafe yi laata ngay eskale. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppali génne gi, tëj bërëb bi, ba noppi nga yaatal jëfandikoo gi.

Weyal di banneexu