Uhlolojikelele
I-Jamba iyimodeli yolimi enkulu evela ku-AI21 Labs ehlanganisa izendlalelo zokunaka ze-Transformer nezendlalelo ze-Mamba state-space (kanye nengxube yochwepheshe) ukuze uthole ukusebenza kahle kokuqukethwe okude ngaphandle kokuyeka ikhwalithi ye-Transformer. Kubalulekile ngoba kukhombisa ukuthi izakhiwo eziyingxubevange zingashaya ama-Transformer amsulwa kumemori nasekuphumeni ngobude bokulandelana okude.
I-Jamba Hybrid Transformer-Mamba Models iyingxenye yesitaki solimi-AI esisetshenziselwa ukufunda, ukukhiqiza, ukuhlukanisa, nokuguqula umbhalo nenkulumo ngezikali.
I-Deep Dive
Ama-Pure Transformers akhokha inani le-quadratic ekunakeni njengoba umongo ukhula, kanye namabhaluni awo enqolobane yenani elingukhiye anobude bokulandelana. Amamodeli we-state-space ahlanzekile njengesikali se-Mamba ngokulandelana futhi agcina usayizi ongashintshi ophindaphindayo, kodwa ngokomlando abambezeleka kweminye imisebenzi. I-Jamba ihlanganisa kokubili: inqwabelanisa amabhlogo lapho izendlalelo eziningi ziyi-Mamba (ezishibhile, ezinomugqa, zilungele ukulandelana okude) kanye nenombolo encane ukunakwa okujwayelekile (okunamandla ekukhumbuleni okunembayo kanye nokucabanga okungaphakathi kokuqukethwe). Futhi yengeza izendlalelo ze-mix-of-experts (MoE) ukuze kukhule umthamo ngenkathi igcina amapharamitha asebenzayo anesizotha. I-Jamba yokuqala ekhishwe ngefasitela lomongo wethokheni engu-256K futhi ingalingana nomongo omningi ku-GPU eyodwa kune-Transformers eqhathaniswayo, ngenxa yenqolobane yayo ye-KV encane kakhulu.
I-Technical Insight
I-Mamba iyimodeli ye-state-space ekhethiwe: esikhundleni sokubheka yonke ithokheni edlule, igcina isimo esicindezelayo esiphindaphindayo esibuyekezwa ngokulandelana ngokulandelana, ngesango elincike kokufakiwe elinquma ukuthi yini okufanele uyigcine noma uyikhohlwe. I-Jamba ihlanganisa izendlalelo ezimbalwa zokunaka okugcwele phakathi kwezendlalelo eziningi ze-Mamba ukuze imodeli igcine ukubheka okunembile kwebanga elide kuyilapho iningi lekhompuyutha nenkumbulo kuhlala kumugqa, futhi umzila we-MoE wenza kusebenze isethi encane yochwepheshe ngethokheni ngayinye.
I-Mastering Jamba Hybrid Transformer-Mamba Models
I-Jamba iyimodeli yolimi enkulu evela ku-AI21 Labs ehlanganisa izendlalelo zokunaka ze-Transformer nezendlalelo ze-Mamba state-space (kanye nengxube yochwepheshe) ukuze uthole ukusebenza kahle kokuqukethwe okude ngaphandle kokuyeka ikhwalithi ye-Transformer. Kubalulekile ngoba kukhombisa ukuthi izakhiwo eziyingxubevange zingashaya ama-Transformer amsulwa kumemori nasekuphumeni ngobude bokulandelana okude. I-Jamba Hybrid Transformer-Mamba Models iyingxenye yesitaki solimi-AI esisetshenziselwa ukufunda, ukukhiqiza, ukuhlukanisa, nokuguqula umbhalo nenkulumo ngezikali. Ukuze wakhe ukuqonda okujulile, phatha i-Jamba Hybrid Transformer-Mamba Models njengemodeli yokusebenza, hhayi isici esisodwa: chaza imiphumela oyifunayo, ucacise ukucabanga, futhi uhlukanise lokho uhlelo olungakwenza ngokwethembeka kulokho okusadinga ukwahlulela kochwepheshe.
Empeleni, amaqembu aqinile asebenzisa i-Jamba Hybrid Transformer-Mamba Models aklama imiyalelo, ukubuyisa, nokubuyekeza amalophu njengohlelo olulodwa lokuxhumana oludidiyelwe. Babhala imibandela yempumelelo ecacile, ukuhlola okuqhathaniswa nedatha engokoqobo nokugeleza komsebenzi, futhi baphindaphinde ngokusekelwe kumaphethini okuhluleka aqashiwe esikhundleni sokuwina kwebhentshimakhi yesikhathi esisodwa. Yilapho ukuqonda kwethiyori kuguquka kube amandla ahlala njalo kuwo wonke umkhiqizo, inqubomgomo, kanye nokusebenza.
Ukugeleza komsebenzi wolimi kungahamba ngokushesha ngaphandle kokudela ukuvumelana. Ngesikhathi esifanayo, amaqiniso Akhohliwe angafaka imibiko buthule, ukugeleza kosekelo, noma imiphumela yocwaningo. Indlela eqine kakhulu iwukuhlanganisa isivinini sokuhlola nesiyalo sokuphatha: qhuba abashayeli bezindiza, bamba ubufakazi, ushicilele amalogi ezinqumo, futhi ubuyekeze izivikelo ngokuqhubekayo njengoba imodeli yokuziphatha, okulindelwe ngabasebenzisi, kanye nezimfuneko zokulawula zishintsha.
I-Strategic Impact
Ukugeleza komsebenzi wolimi kungahamba ngokushesha ngaphandle kokudela ukuvumelana.
Ukugeleza komsebenzi wolimi kungahamba ngokushesha ngaphandle kokudela ukuvumelana. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.
Yandisa ukufinyelela kuzo zonke izilimi nezitayela zokuxhumana.
Yandisa ukufinyelela kuzo zonke izilimi nezitayela zokuxhumana. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.
Amaqembu angachitha isikhathi esiningi ekwahluleleni kuyilapho i-automation isingatha impinda.
Amaqembu angachitha isikhathi esiningi ekwahluleleni kuyilapho i-automation isingatha impinda. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.
Ukuqaliswa Komhlaba Wangempela
Icubungula okokufaka kwamathokheni angu-256K njengokugcwalisa okusemthethweni okude noma amaqoqo ekhodi amakhulu ku-GPU eyodwa engakwazi ukulingana nenqolobane ye-Transformer's KV.
Inikeza ingxoxo yokuqukethwe okuphezulu kokuqukethwe okude lapho isimo se-Mamba esigxilile sigcina inkumbulo iphansi njengoba izingxoxo zikhula
Ukuhlaziywa kwamadokhumenti nokukhiqizwa okuthuthukisiwe kokubuyisa phezu kwezisekelo zolwazi ezinkulu ezigxishwe ngqo kumongo
Isebenzisa i-LLM enesisindo eside esinesisindo esivulekile (i-Jamba ikhishwe ngezisindo ezivulekile) ukuze kwenziwe ucwaningo lwezakhiwo eziyingxubevange.
Amaphethini Okusebenzisa
Amamodeli we-Jamba Hybrid Transformer-Mamba ayasebenza
Kucutshungulwa okokufaka kwamathokheni angu-256K njengokugcwalisa okusemthethweni okude noma amaqoqo ekhodi amakhulu ku-GPU eyodwa engakwazi ukulingana nenqolobane ye-Transformer's KV.
Icubungula okokufaka kwamathokheni angu-256K njengokugcwalisa okungokomthetho okude noma amakhodi amakhulu enqolobane ku-GPU eyodwa engakwazi ukulingana ne-Transformer's KV cache Teams ngokuvamile athola imiphumela engcono uma echaza imikhawulo yekhwalithi ngaphambili, agcine indlela yokukhuphuka yomuntu yamacala asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.
Amamodeli we-Jamba Hybrid Transformer-Mamba ayasebenza
Inikeza ingxoxo yokuqukethwe okuphezulu kokuqukethwe okude lapho isimo esigxilile se-Mamba sigcina inkumbulo iphansi njengoba izingxoxo zikhula.
Inikeza ingxoxo yokuqukethwe okuphezulu kokuqukethwe okude lapho isimo se-Mamba esigxilile sigcina inkumbulo icacile njengoba izingxoxo zikhula Amaqembu ngokuvamile athola imiphumela engcono uma echaza izinga eliphezulu ngaphambili, egcina indlela yokukhuphuka yabantu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.
Amamodeli we-Jamba Hybrid Transformer-Mamba ayasebenza
Ukuhlaziywa kwamadokhumenti nokukhiqizwa okuthuthukisiwe kokubuyisa phezu kwezisekelo zolwazi ezinkulu ezigxishwe ngqo kumongo.
Ukuhlaziywa kwamadokhumenti nokukhiqizwa okuthuthukisiwe kokubuyiswa kuzisekelo zolwazi ezinkulu kakhulu ezigxishwe ngokuqondile kumongo Amaqembu ngokuvamile athola imiphumela engcono lapho echaza izinga eliphezulu ngaphambili, egcina indlela yokukhuphuka yabantu yamakesi asemaphethelweni, futhi elandelela kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.
Amamodeli we-Jamba Hybrid Transformer-Mamba ayasebenza
Ukusebenzisa i-LLM yokuqukethwe okunesisindo esivulekile (i-Jamba ikhishwe ngezisindo ezivulekile) ukuze kwenziwe ucwaningo lwezakhiwo eziyingxubevange.
Ukusebenzisa i-LLM enesisindo eside esinesisindo esivulekile (i-Jamba ikhishwe ngezisindo ezivulekile) ukuze kucwaningwe ezakhiweni ezixubile Amaqembu ngokuvamile athola imiphumela engcono uma echaza imingcele yekhwalithi ngaphambili, egcina indlela yokukhuphuka komuntu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.
Izingozi & Guardrails
Amaqiniso akhonjiwe angafaka ngokuthula imibiko, ukugeleza kosekelo, noma imiphumela yocwaningo.
Ukuzwela okusheshayo kungadala imiphumela engahambisani kuzo zonke izicelo ezifanayo.
Idatha yombhalo ebucayi ingase idalulwe uma izilawuli zokufinyelela zibuthakathaka.
Ukuqalisa Umhlahlandlela
Chaza ifomethi yokuphumayo, ithoni, namazinga wekhwalithi ngaphambi kokukhishwa.
Chaza ifomethi yokuphumayo, ithoni, namazinga wekhwalithi ngaphambi kokukhishwa. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.
Izimpendulo eziyisisekelo ngemithombo ethembekile noma nini lapho ukunemba kubalulekile.
Izimpendulo eziyisisekelo ngemithombo ethembekile noma nini lapho ukunemba kubalulekile. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.
Gcina indawo yokuhlola isibuyekezo somuntu ukuze uthole imiphumela ephezulu.
Gcina indawo yokuhlola isibuyekezo somuntu ukuze uthole imiphumela ephezulu. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.
Landela amaphethini okuhluleka futhi uqeqeshe kabusha imiyalo noma ukuhamba komsebenzi njalo.
Landela amaphethini okuhluleka futhi uqeqeshe kabusha imiyalo noma ukuhamba komsebenzi njalo. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.