Technical GUIDE

Mixtral uye Sparse Models

Mixtral ndiyo Mistral AI yakavhurika musanganiswa-ye-nyanzvi modhi iyo inopa yakakura-modhi yemhando padiki-modhi yekumhanya.

Overview

Mixtral ndiyo Mistral AI yakavhurika musanganiswa-ye-nyanzvi modhi iyo inopa yakakura-modhi yemhando padiki-modhi yekumhanya. Sparse modhi senge iyo inomutsa chete chidimbu chemaparamita patokeni, yekucheka komputa pasina kupa kupa kugona.

Mixtral uye Sparse Models inyanzvi yekuvaka inobata mhando yemhando, mutengo wezvivakwa, latency, uye kuvimbika pachiyero.

Deep Dive

Mixtral 8x7B, yakaburitswa neMistral AI mukupera kwa2023, yakasimudzira nzira yekusanganisa-yenyanzvi (MoE) mumamodhi akavhurika. Iyo ine sere dzakaparadzana 'nyanzvi' yekudyisa-mberi network padanho, inosvika 47 bhiriyoni yakazara paramita, asi isingaremi router inosarudza nyanzvi mbiri chete pachiratidzo chega chega. Nekuda kweizvozvo, ingangoita bhiriyoni gumi nematatu paramita inoshanda pachiratidzo, saka fungidziro inomhanya nekukurumidza seye 13B dense modhi ichisvika pamhando inofananidzwa neyakakura kwazvo. Mixtral inofananidzwa kana kurova GPT-3.5 uye Llama 2 70B pane akawanda mabhenji uku uchikurumidza uye nekuchipa kushanda. Mistral akazoburitsa Mixtral 8x22B. Iyo modhi inopihwa pachena rezinesi pasi peApache 2.0, ichikurudzira kutorwa nekukurumidza uye kugadzirisa zvakanaka munharaunda yakavhurika-sosi.

Technical Insight

Mune imwe shoma yeMoE layer, iyo dense feed-forward block inotsiviwa neN nyanzvi network pamwe nediki network network (iyo router). Pachiratidzo chega chega, iyo router inoverengera zvibodzwa uye inotora yepamusoro-k nyanzvi (yepamusoro-2 muMixtral), ichifambisa chiratidzo kuburikidza neavo. Migumisiro yavo inoyerwa uye inopfupikiswa. Nekuti nyanzvi zhinji dzinogara dzisina basa pane tokeni, modhi inobata ma paramita mazhinji mundangariro asi ichiita kushoma komputa. Iko kutengeserana-kure: nyanzvi dzese dzinofanirwa kutakurwa muVRAM kunyangwe vamwe vachimhanya.

Mastering Mixtral uye Sparse Models

Mixtral ndiyo Mistral AI yakavhurika musanganiswa-ye-nyanzvi modhi iyo inopa yakakura-modhi yemhando padiki-modhi yekumhanya. Sparse modhi senge iyo inomutsa chete chidimbu chemaparamita patokeni, yekucheka komputa pasina kupa kupa kugona. Mixtral uye Sparse Models inyanzvi yekuvaka inobata mhando yemhando, mutengo wezvivakwa, latency, uye kuvimbika pachiyero. Kuti uvake kunzwisisa kwakadzama, bata Mixtral neSparse Models semuenzaniso wekushandisa, kwete chinhu chimwe chete: tsanangura zvinodikanwa, kujekesa fungidziro, uye patsanura izvo zvingaitwa nehurongwa zvakavimbika kubva kune zvichiri kuda kutonga kwenyanzvi.

Mukuita, zvikwata zvakasimba zvinoshandisa Mixtral uye Sparse Models zvinogonesa zvivakwa, data, uye sarudzo dzezvivakwa zvinopesana nekuvimbika uye mutengo. Ivo vanonyora zvakajeka maitiro ebudiriro, bvunzo vachipokana ne data rechokwadi uye mafambiro ebasa, uye iterate zvichibva pane zvakacherechedzwa maitiro ekutadza kwete kuhwina-nguva imwe chete yebhenji. Apa ndipo apo kunzwisisa kwe theoretical kunoshanduka kuve kugona kwakasimba pane chigadzirwa, mutemo, uye mashandiro.

Zvisarudzo zvezvivakwa zvinotyaira kuita uye mutengo wekushandisa kwemakore. Panguva imwecheteyo, Kukwirisa imwe bhenji kunogona kuvanza yakafara system kushaya simba. Nzira yakatsiga ndeyekubatanidza kukurumidza kuyedza nekutonga: mhanyisa vatyairi vendege, tora humbowo, buritsa matanda esarudzo, uye urambe uchivandudza chengetedzo semaitiro emuenzaniso, zvinotarisirwa nemushandisi, uye zvinodikanwa zvekutonga.

Strategic Impact

Zvisarudzo zvezvivakwa zvinotyaira kuita uye mutengo wekushandisa kwemakore.

Zvisarudzo zvezvivakwa zvinotyaira kuita uye mutengo wekushandisa kwemakore. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.

Dzidzo yehunyanzvi inobatsira zvikwata kusarudza murwi wakakodzera, kwete iwo mutsva chete.

Dzidzo yehunyanzvi inobatsira zvikwata kusarudza murwi wakakodzera, kwete iwo mutsva chete. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.

Sarudzo dzeinjiniya dziri nani dzinoderedza zviitiko zvekuvimbika mukugadzira.

Sarudzo dzeinjiniya dziri nani dzinoderedza zviitiko zvekuvimbika mukugadzira. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.

Ramangwana reMixtral uye Sparse Models

Sparse MoE ikozvino iri pakati pemuganhu weAI. Tarisira kuburitswa kwakavhurika kweMoE, nzira yakatsetseka nenyanzvi diki dzakawanda, uye akagovaniswa kana mahybrid nyanzvi madhizaini anovandudza kushanda zvakanaka. Sezvo mamodheru anokwira akananga kumatiririyoni ezvese paramita, sparsity ndiyo huru lever yekuchengeta fungidziro inokwanisika. Tsvagiridzo iri kubata nzvimbo dzisina kusimba dzeMoE, kuyera kuyera kune nyanzvi, ndangariro pamusoro, uye kugadzikana kwekudzidziswa, nepo Hardware uye masheki ekushandira achiwedzera kukwenenzvera kune nyanzvi nzira.

Real-World Implementation

Kushandira chatbot yemhando yepamusoro pamutengo uye nekumhanya kweiyo diki dense modhi

Kuzvitambira wega Apache-2.0 ine rezinesi modhi yezvigadzirwa zvekutengesa pasina muripo wekushandisa

Kunyatsogadzirisa maitiro emunhu paMixtral yekukodha, kupfupisa, kana mabasa emitauro yakawanda

Kumhanya kukurumidza kufungidzira pane imwechete yakawanda-GPU server uko 70B dense modhi yaizonyanya kunonoka

Maitiro Ekuita

Mixtral uye Sparse Models mukuita

Kushandira chatbot yemhando yepamusoro pamutengo uye nekumhanya kweiyo diki dense modhi.

Kushandira chatbot yemhando yepamusoro pamutengo uye nekumhanya kweiyo diki diki modhi Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi ekumucheto, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye mutengo wekukanganisa nekufamba kwenguva.

Mixtral uye Sparse Models mukuita

Kuzvitambira wega Apache-2.0 ine rezinesi modhi yezvigadzirwa zvekutengesa pasina muripo wekushandisa.

Kuzvitambira wega Apache-2.0 ine rezenisi modhi yezvigadzirwa zvekutengesa pasina muripo wekushandisa Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi ekumucheto, uye kuteedzera zvese zvakawanikwa zvechigadzirwa nemitengo yekukanganisa nekufamba kwenguva.

Mixtral uye Sparse Models mukuita

Kunyatsogadzirisa maitiro emunhu paMixtral yekukodha, kupfupisa, kana mabasa emitauro yakawanda.

Kunyatsogadzirisa maitiro emunhu paMixtral yekukodha, kupfupisa, kana mabasa emitauro yakawanda Zvikwata zvinowanzowana zvibodzwa zviri nani kana zvichitsanangudza zvikumbaridzo zvemhando yepamusoro, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa mukubereka uye mutengo wekukanganisa nekufamba kwenguva.

Mixtral uye Sparse Models mukuita

Kumhanya kukurumidza kufungidzira pane imwechete yakawanda-GPU server uko 70B dense modhi yaizonyanya kunonoka.

Kumhanya kukurumidza kufungidzira pane imwechete yakawanda-GPU sevha uko 70B dense modhi yaizonyanya kunonoka Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa zvechigadzirwa nemitengo yekukanganisa nekufamba kwenguva.

Njodzi & Guardrails

!

Kugadzirisa imwe bhenji kunogona kuvanza yakafara system kushaya simba.

!

Infrastructure uye mari yekugadzirisa inowanzotarisirwa pasi.

!

Chengetedzo uye kucherechedzwa mapundu anogona kukura sezvo masisitimu anowedzera kuoma.

Implementation Roadmap

1

Tsanangura latency, mhando, uye mutengo zvinangwa usati waitwa.

Tsanangura latency, mhando, uye mutengo zvinangwa usati waitwa. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.

2

Benchmark pasi pechokwadi mutoro uye data mamiriro.

Benchmark pasi pechokwadi mutoro uye data mamiriro. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.

3

Chishandiso chekutarisa zvikanganiso, kudonha, uye mushandisi maitiro.

Chishandiso chekutarisa zvikanganiso, kudonha, uye mushandisi maitiro. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.

4

Gadzirira nzira dzekudzosera kumashure uye dzezviitiko usati wawedzera.

Gadzirira nzira dzekudzosera kumashure uye dzezviitiko usati wawedzera. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.

Ramba Uchiongorora