Overview
Kana modhi yakakurisa kuti ikwane pane imwe GPU, modhi uye pombi parallelism inopatsanura modhi yacho pachayo pamidziyo. Izvi ndizvo zvinoita kuti kudzidzisa mamodheru emitauro mikuru ane mazana emabhiriyoni emaparamita anogona kuitika.
Model uye Pipeline Parallelism inyanzvi yekuvaka inobata mhando yemhando, mutengo wezvivakwa, latency, uye kuvimbika pachiyero.
Deep Dive
Model parallelism partitions modhi imwechete kune akawanda maGPU saka hapana mudziyo mumwe unoda kubata huremu hwese. Pane zvinonaka zviviri zvikuru. Tensor (intra-layer) parallelism inotsemura math mukati mechikamu, sekucheka matrix akakura kuwanda muGPUs iyo yega yega inoverengera chikamu chekubuda. Pipeline (inter-layer) parallelism inopa akasiyana akateedzana akateedzana kune akasiyana maGPU, saka layer block 1 inogara paGPU 0, block 2 paGPU 1, zvichingodaro, ne activation inopfuudzwa kumberi semutsetse wegungano. Dambudziko nepipelini yekusaziva ndiyo 'bubble': nepo GPU 0 inoshanda pabatch yekutanga, yakadzika maGPU anogara asina basa. Pipelining inotsemura batch yega yega kuita madiki-mabhechi kuti ese matanho agare akabatikana, zvinoshamisa kuvandudza mashandisirwo.
Technical Insight
Tensor parallelism (sezviri muNVIDIA Megatron-LM) inopatsanura huremu matrices koramu- kana mutsara-huchenjeri uye inoshandisa-yese-kudzikisa kusanganisa mhedzisiro mhedzisiro, kuchengetedza kutaurirana mukati mekukurumidza NVLink node. Pipeline parallelism (GPipe, PipeDream) inokamura batch kuita madiki-mabheji anoyerera nematanho mune yakadzama hurongwa, ichinyura isina basa 'bubble' nguva. Iwo maviri anowanzo akaturikidzana, aine tensor parallelism mukati me node uye pombi parallelism mhiri node.
Mastering Model uye Pipeline Parallelism
Kana modhi yakakurisa kuti ikwane pane imwe GPU, modhi uye pombi parallelism inopatsanura modhi yacho pachayo pamidziyo. Izvi ndizvo zvinoita kuti kudzidzisa mamodheru emitauro mikuru ane mazana emabhiriyoni emaparamita anogona kuitika. Model uye Pipeline Parallelism inyanzvi yekuvaka inobata mhando yemhando, mutengo wezvivakwa, latency, uye kuvimbika pachiyero. Kuti uvake kunzwisisa kwakadzama, bata Model uye Pipeline Parallelism semuenzaniso wekushandisa, kwete chinhu chimwe chete: tsanangura zvinodiwa, kujekesa fungidziro, uye patsanura izvo zvinogona kuitwa nehurongwa hwakavimbika kubva kune zvichiri kuda kutonga kwenyanzvi.
Mukuita, zvikwata zvakasimba zvinoshandisa Model uye Pipeline Parallelism zvinogonesa zvivakwa, data, uye sarudzo dzezvivakwa zvinopesana nekuvimbika uye mutengo. Ivo vanonyora zvakajeka maitiro ebudiriro, bvunzo vachipokana ne data rechokwadi uye mafambiro ebasa, uye iterate zvichibva pane zvakacherechedzwa maitiro ekutadza kwete kuhwina-nguva imwe chete yebhenji. Apa ndipo apo kunzwisisa kwe theoretical kunoshanduka kuve kugona kwakasimba pane chigadzirwa, mutemo, uye mashandiro.
Zvisarudzo zvezvivakwa zvinotyaira kuita uye mutengo wekushandisa kwemakore. Panguva imwecheteyo, Kukwirisa imwe bhenji kunogona kuvanza yakafara system kushaya simba. Nzira yakatsiga ndeyekubatanidza kukurumidza kuyedza nekutonga: mhanyisa vatyairi vendege, tora humbowo, buritsa matanda esarudzo, uye urambe uchivandudza chengetedzo semaitiro emuenzaniso, zvinotarisirwa nemushandisi, uye zvinodikanwa zvekutonga.
Strategic Impact
Zvisarudzo zvezvivakwa zvinotyaira kuita uye mutengo wekushandisa kwemakore.
Zvisarudzo zvezvivakwa zvinotyaira kuita uye mutengo wekushandisa kwemakore. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.
Dzidzo yehunyanzvi inobatsira zvikwata kusarudza murwi wakakodzera, kwete iwo mutsva chete.
Dzidzo yehunyanzvi inobatsira zvikwata kusarudza murwi wakakodzera, kwete iwo mutsva chete. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.
Sarudzo dzeinjiniya dziri nani dzinoderedza zviitiko zvekuvimbika mukugadzira.
Sarudzo dzeinjiniya dziri nani dzinoderedza zviitiko zvekuvimbika mukugadzira. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.
Real-World Implementation
Kudzidzira GPT-maitiro emhando neNVIDIA Megatron-LM, iyo inotsemura kutarisa kwega kwega kweshanduko uye kudyisa-mberi matrices paGPUs kuburikidza ne tensor parallelism.
Kushandisa GPipe kuisa zvikamu zvakasiyana zvechiono chihombe kana modhi yemutauro pane akasiyana accelerator nepo micro-batching ichivachengeta vakabatikana.
DeepSpeed's pombi injini yekuparadzanisa mazana-mazana-mabhiriyoni-parameter modhi mumatanho munzvimbo dzakawanda.
Kubatanidza tensor parallelism mukati meimwe 8-GPU sevha ine pombi parallelism inotenderera akawanda maseva kudzidzisa modhi yakakura kwazvo pamushini mumwe.
Maitiro Ekuita
Model uye Pipeline Parallelism mukuita
Kudzidzira GPT-maitiro emhando neNVIDIA Megatron-LM, iyo inotsemura kutarisa kwega kwega kweshanduko uye kudyisa-mberi matrices paGPUs kuburikidza ne tensor parallelism.
Kudzidzira GPT-maitiro emhando neNVIDIA Megatron-LM, iyo inotsemura kutarisisa kwega kwega kweshanduko uye kudyisa-mberi matrices paGPUs kuburikidza ne tensor parallelism Matimu anowanzo kuwana mibairo iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye mutengo wekukanganisa nekufamba kwenguva.
Model uye Pipeline Parallelism mukuita
Kushandisa GPipe kuisa zvikamu zvakasiyana zvechiono chihombe kana modhi yemutauro pane akasiyana accelerator nepo micro-batching ichivachengeta vakabatikana.
Kushandisa GPipe kuisa zvikamu zvakasiyana zvechiono chihombe kana modhi yemutauro pazviyereso zvakaparadzana ukuwo micro-batching ichivachengeta vakabatikana Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura mhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye kukanganisa mutengo nekufamba kwenguva.
Model uye Pipeline Parallelism mukuita
DeepSpeed's pombi injini yekuparadzanisa mazana-mazana-mabhiriyoni-parameter modhi mumatanho munzvimbo dzakawanda.
DeepSpeed's pombi injini inopatsanura mazana-mazana-mabhiriyoni-parameta modhi mumatanho munzvimbo dzakawanda Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye mutengo wekukanganisa nekufamba kwenguva.
Model uye Pipeline Parallelism mukuita
Kubatanidza tensor parallelism mukati meimwe 8-GPU sevha ine pombi parallelism inotenderera akawanda maseva kudzidzisa modhi yakakura kwazvo pamushini mumwe.
Kubatanidza tensor parallelism mukati meimwe 8-GPU sevha ine pombi parallelism inotora akawanda maseva kudzidzisa modhi yakakurisa zvakanyanya kumuchina mumwe Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi ekumucheto, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye kukanganisa mutengo nekufamba kwenguva.
Njodzi & Guardrails
Kugadzirisa imwe bhenji kunogona kuvanza yakafara system kushaya simba.
Infrastructure uye mari yekugadzirisa inowanzotarisirwa pasi.
Chengetedzo uye kucherechedzwa mapundu anogona kukura sezvo masisitimu anowedzera kuoma.
Implementation Roadmap
Tsanangura latency, mhando, uye mutengo zvinangwa usati waitwa.
Tsanangura latency, mhando, uye mutengo zvinangwa usati waitwa. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.
Benchmark pasi pechokwadi mutoro uye data mamiriro.
Benchmark pasi pechokwadi mutoro uye data mamiriro. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.
Chishandiso chekutarisa zvikanganiso, kudonha, uye mushandisi maitiro.
Chishandiso chekutarisa zvikanganiso, kudonha, uye mushandisi maitiro. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.
Gadzirira nzira dzekudzosera kumashure uye dzezviitiko usati wawedzera.
Gadzirira nzira dzekudzosera kumashure uye dzezviitiko usati wawedzera. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.