UMHLAHLANDLELA Wobuchwepheshe

I-Optimizer State Offloading ku-CPU ne-NVMe

Iqhinga lokulondoloza inkumbulo elimisa ukugcinwa kwamabhuku okusindayo kokuqeqeshwa (izimo ze-optimizer, ama-gradient, kwesinye isikhathi izisindo) ku-CPU RAM noma kuma-NVMe SSD esikhundleni sememori ye-GPU eyindlala.

Uhlolojikelele

Iqhinga lokulondoloza inkumbulo elimisa ukugcinwa kwamabhuku okusindayo kokuqeqeshwa (izimo ze-optimizer, ama-gradient, kwesinye isikhathi izisindo) ku-CPU RAM noma kuma-NVMe SSD esikhundleni sememori ye-GPU eyindlala. Ivumela abantu ukuba baqeqeshe amamodeli amakhudlwana kunenkumbulo ye-GPU yabo.

I-Optimizer State Offloading ku-CPU ne-NVMe iyibhulokhi yokwakha yobuchwepheshe ethinta ikhwalithi yemodeli, izindleko zengqalasizinda, ukubambezeleka, nokuthembeka esikalini.

I-Deep Dive

Uma uqeqesha inethiwekhi ye-neural ngesithuthukisi esifana no-Adam, yonke ipharamitha ithwala umthwalo owengeziwe: izibalo ezimbili ezisebenzayo (umfutho nokuhluka), kanye nekhophi enembayo yesisindo, kanye negradient yayo. Ekuqeqesheni okunembayo okuxubile lokhu kungahlanganisa cishe amabhayithi angu-16 ngepharamitha ngayinye, kufiphaze amabhayithi angu-2 ngesisindo ngokwaso. Ukulayisha kususa lowo mthwalo ku-GPU. I-CPU ikhipha i-stream optimizer ingena ohlelweni olujwayelekile lwe-RAM phezu kwebhasi le-PCIe, kuyilapho ukulayishwa kwe-NVMe kuwaphusha kuze kube phansi kumadiski e-solid-state asheshayo. Idume ngabakwa-DeepSpeed's ZeRO-Infinity kanye ne-ZeRO-Offload, le ndlela ihweba ngesivinini esingahluziwe ukuze ibe namandla, ivumela i-GPU eyodwa noma amamodeli amancane eqoqo acule kahle anezigidigidi zamapharamitha.

I-Technical Insight

Ukhiye oweqa ukunyakaza kwedatha nokubala. Izifunda ze-Optimizer zihlala ku-CPU/NVMe; ngesikhathi sokubuyela emuva, ama-partitions alandwa kuqala nge-PCIe ngaphambi nje kokuba adingeke futhi isinyathelo sokulungisa sona sivame ukusebenza ku-CPU. I-ZeRO-Offload igcina izisindo eziyinhloko ze-float32 kanye nezikhathi zika-Adam ku-CPU, ngakho izibalo eziya phambili neziya emuva kuphela ezihlala ku-GPU. I-NVMe yengeza inqolobane enezigaba ukuze isikali se-terabyte sichitheke kudiski kuyilapho izingxenye ezishisayo zihlala ku-RAM.

I-Mastering Optimizer State Offloading ku-CPU ne-NVMe

Iqhinga lokulondoloza inkumbulo elimisa ukugcinwa kwamabhuku okusindayo kokuqeqeshwa (izimo ze-optimizer, ama-gradient, kwesinye isikhathi izisindo) ku-CPU RAM noma kuma-NVMe SSD esikhundleni sememori ye-GPU eyindlala. Ivumela abantu ukuba baqeqeshe amamodeli amakhudlwana kunenkumbulo ye-GPU yabo. I-Optimizer State Offloading ku-CPU ne-NVMe iyibhulokhi yokwakha yobuchwepheshe ethinta ikhwalithi yemodeli, izindleko zengqalasizinda, ukubambezeleka, nokuthembeka esikalini. Ukuze wakhe ukuqonda okujulile, phatha i-Optimizer State Offloading ku-CPU ne-NVMe njengemodeli yokusebenza, hhayi isici esisodwa: chaza imiphumela efiselekayo, ucacise ukucabanga, futhi uhlukanise lokho uhlelo olungakwenza ngokwethembeka kulokho okusadinga ukwahlulela kochwepheshe.

Empeleni, amaqembu aqinile asebenzisa i-Optimizer State Offloading ku-CPU ne-NVMe athuthukisa ukukhetha kwezakhiwo, idatha, kanye nengqalasizinda ngokumelene nokuthembeka nezindleko. Babhala imibandela yempumelelo ecacile, ukuhlola okuqhathaniswa nedatha engokoqobo nokugeleza komsebenzi, futhi baphindaphinde ngokusekelwe kumaphethini okuhluleka aqashiwe esikhundleni sokuwina kwebhentshimakhi yesikhathi esisodwa. Yilapho ukuqonda kwethiyori kuguquka kube amandla ahlala njalo kuwo wonke umkhiqizo, inqubomgomo, kanye nokusebenza.

Izinqumo zezakhiwo ziqhuba ukusebenza kanye nezindleko zokusebenza iminyaka. Ngesikhathi esifanayo, Ukuthuthukisa ibhentshimakhi eyodwa kungafihla ubuthakathaka obubanzi besistimu. Indlela eqine kakhulu iwukuhlanganisa isivinini sokuhlola nesiyalo sokuphatha: qhuba abashayeli bezindiza, bamba ubufakazi, ushicilele amalogi ezinqumo, futhi ubuyekeze izivikelo ngokuqhubekayo njengoba imodeli yokuziphatha, okulindelwe ngabasebenzisi, kanye nezimfuneko zokulawula zishintsha.

I-Strategic Impact

Izinqumo zezakhiwo ziqhuba ukusebenza kanye nezindleko zokusebenza iminyaka.

Izinqumo zezakhiwo ziqhuba ukusebenza kanye nezindleko zokusebenza iminyaka. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Imfundo yobuchwepheshe isiza amaqembu ukuthi akhethe isitaki esifanele, hhayi nje esisha.

Imfundo yobuchwepheshe isiza amaqembu ukuthi akhethe isitaki esifanele, hhayi nje esisha. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Izinketho ezingcono zobunjiniyela zinciphisa izehlakalo ezinokwethenjelwa ekukhiqizeni.

Izinketho ezingcono zobunjiniyela zinciphisa izehlakalo ezinokwethenjelwa ekukhiqizeni. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Ikusasa Le-Optimizer State Offloading ku-CPU ne-NVMe

Njengoba amamodeli eqhubeka nokukhula inkumbulo ye-GPU, ukulayishwa okuhlanganisiwe kuba okujwayelekile kunokuba kungavamile. Lindela ukuhlanganiswa okuqinile ngokuxhumanisa okusheshayo okufana namachibi enkumbulo e-NVLink-C2C kanye ne-CXL afiphalisa umngcele we-CPU-GPU, kanye nabahleli bezinhlelo abahlakaniphe kakhulu ababikezela ukuthi isiphi isimo okufanele silande. Izakhiwo zenkumbulo ezihlanganisiwe ezifana noGrace Hopper zinciphisa inhlawulo ye-PCIe, futhi izinhlaka ziqhubekela phambili ekwenzeni ukulayishwa kwezigaba eziningi kucishe kucace ukuze abathanda ukuzilibazisa bakwazi ukushuna kahle amamodeli amakhulu kuhadiwe elinesizotha.

Ukuqaliswa Komhlaba Wangempela

Ukushuna kahle i-LLM yepharamitha yebhiliyoni engu-13 ku-GPU eyodwa yomthengi ongu-24 GB kusetshenziswa i-DeepSpeed ​​​​ZeRO-Offload ukuze iphushele izifunda zika-Adam ku-CPU RAM.

Ilebhu encane yocwaningo iqeqesha imodeli yepharamitha yezigidigidi kuma-GPU ambalwa ngokuchitha izifundazwe ze-optimizer kumadrayivu e-NVMe nge-ZeRO-Infinity.

Ubuso Bokubungaza Sheshisa ukulungiselelwa okunika amandla ukulayishwa kwe-CPU ukuze abasebenzisi bakwazi ukwenza imisebenzi yokuhlela kahle engase ikhiphe amaphutha angaphandle kwenkumbulo.

Iziqalo ezikhathalela izindleko eziqasha ama-GPU amafu ashibhile, anenkumbulo ephansi futhi zilayishwa ku-NVMe enamathiselwe esikhundleni sokukhokhela amakhadi aphezulu angama-80 GB.

Amaphethini Okusebenzisa

I-Optimizer State Offloading ku-CPU ne-NVMe isebenza

Ukushuna kahle i-LLM yepharamitha yebhiliyoni engu-13 ku-GPU eyodwa yomthengi ongu-24 GB kusetshenziswa i-DeepSpeed ​​​​ZeRO-Offload ukuze iphushele izifunda zika-Adam ku-CPU RAM.

Ukuhlela kahle i-LLM yepharamitha eyizigidi eziyizinkulungwane eziyi-13 kumthengi oyedwa ongu-24 GB we-GPU kusetshenziswa i-DeepSpeed ​​ZeRO-Offload ukusunduza izifundazwe zika-Adam ku-CPU RAM Amaqembu ngokuvamile athola imiphumela engcono lapho echaza imingcele yekhwalithi ngaphambili, agcine indlela yokukhuphuka yomuntu yamacala asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Optimizer State Offloading ku-CPU ne-NVMe isebenza

Ilebhu encane yocwaningo iqeqesha imodeli yepharamitha yezigidigidi kuma-GPU ambalwa ngokuchitha izifundazwe ze-optimizer kumadrayivu e-NVMe nge-ZeRO-Infinity.

Ilebhu encane yocwaningo iqeqeshela imodeli yepharamitha yezigidigidi kuma-GPU ambalwa ngokuchitha izifundazwe ze-NVMe kumadrayivu e-ZeRO-Infinity Teams ngokuvamile ithola imiphumela engcono uma ichaza imingcele yekhwalithi ngaphambili, igcina indlela yokukhuphuka yomuntu yamacala asemaphethelweni, futhi ilandelela kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Optimizer State Offloading ku-CPU ne-NVMe isebenza

Ubuso Bokubungaza Sheshisa ukulungiselelwa okunika amandla ukulayishwa kwe-CPU ukuze abasebenzisi bakwazi ukwenza imisebenzi yokuhlela kahle engase ikhiphe amaphutha angaphandle kwenkumbulo.

I-Hugging Face Sheshisa izilungiselelo ezivumela ukulayishwa kwe-CPU ukuze abasebenzisi bakwazi ukwenza imisebenzi yokuhlela kahle engase ilahle amaphutha angaphandle kwenkumbulo Amathimba ngokuvamile athola imiphumela engcono uma echaza imingcele yekhwalithi ngaphambili, agcine indlela yokukhuphuka yomuntu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Optimizer State Offloading ku-CPU ne-NVMe isebenza

Iziqalo ezikhathalela izindleko eziqasha ama-GPU amafu ashibhile, anenkumbulo ephansi futhi zilayishwa ku-NVMe enamathiselwe esikhundleni sokukhokhela amakhadi aphezulu angama-80 GB.

Iziqalo ezikhathalela izindleko eziqasha ama-GPU amafu ashibhile, anenkumbulo ephansi futhi zilayishwa ku-NVMe enamathiselwe esikhundleni sokukhokhela amakhadi aphezulu angama-80 GB Amaqembu ngokuvamile athola imiphumela engcono lapho echaza imingcele yekhwalithi ngaphambili, agcine indlela yokukhuphuka kwabantu ngamacala asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

Izingozi & Guardrails

!

Ukuthuthukisa ibhentshimakhi eyodwa kungafihla ubuthakathaka obubanzi besistimu.

!

Izindleko zengqalasizinda nezokulungisa zivame ukubukelwa phansi.

!

Izikhala zokuphepha nokubonakala zingakhula njengoba izinhlelo ziba nzima kakhulu.

Ukuqalisa Umhlahlandlela

1

Chaza ukubambezeleka, ikhwalithi, nezindleko ezihlosiwe ngaphambi kokuqaliswa.

Chaza ukubambezeleka, ikhwalithi, nezindleko ezihlosiwe ngaphambi kokuqaliswa. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

2

Ibhentshimakhi ngaphansi komthwalo wangempela nezimo zedatha.

Ibhentshimakhi ngaphansi komthwalo wangempela nezimo zedatha. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

3

Ukuqapha amathuluzi amaphutha, ukukhukhuleka, nomthelela wabasebenzisi.

Ukuqapha amathuluzi amaphutha, ukukhukhuleka, nomthelela wabasebenzisi. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

4

Lungiselela izindlela zokuhlehlisa nezigameko ngaphambi kokukala.

Lungiselela izindlela zokuhlehlisa nezigameko ngaphambi kokukala. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

Qhubeka Uhlole