GUIDE teknik

YaRN ak guddaayu ëmbiit li

YaRN (Beneen RoPE extension) xarala bu am solo la ngir tàllal palanteer bi ñuy jëfandikoo ci model bi, dem fu sori ci li ñu ko tàggat.

Résumé

YaRN (Beneen RoPE extension) xarala bu am solo la ngir tàllal palanteer bi ñuy jëfandikoo ci model bi, dem fu sori ci li ñu ko tàggat. Dafay soppi position rotary ci anam wu xarañ, suko defee benn model buñu tàggat ci, wax, jetons 4K mën na jëfandikoo 32K wala lu ëpp ak fine-tuning bu néew.

YaRN ak Context Length Extension ab bloku tabax la bu am njeexital ci kalite model bi, njëgu infrastructure bi, latency bi, ak wóor ci escale bi.

Plongeur bu xóot

LLM yi gëna bari ci jamono jii dañuy kode position token yi ak RoPE (Embeddings Position Rotary), ñuy wëlbati laajte ak vecteur clé ci angle yuñ takk ci position bi. Soo feed sequence yu gëna gudd guddaayi tàggat yaram, rotation yooyu dañuy dugg ci rang yuñu gisul, model bi dafay yàqu. YaRN, bi Bowen Peng ak ay naataangoom dugal ci 2023, defar na lii ci interpolation NTK-aware buñu jëfandikoo ci fréquence bu nekk: dafay bàyyi dimension yu fréquence yu kawe (yiy jàpp diggante yu am diggante yu gàtt) yu bari te kenn laalu ko ci diggante dimension yu fréquence yu néew (yiy trapoler). YaRN itam dafay yokk tàngoor wiñ koy méngale ak coppite yi ci entropie yiy bawoo ci diggante yu gëna gudd. Resultaa bi mooy performance bu am doole ci contexte bu yàgg ginaaw biñu defaree ay done yu ndaw ak jéego yi approche naïf yi soxla.

Gis-gis xarala

RoPE dafay jox bépp dimension buñ samp fréquence buy wëréelu. Interpolaasioŋ ligneer bu yomb dafay tënk bépp fréquence ci anam wu tolloo, di yàq dimension fréquence yu kawe yiy kode ay detay yu baax ci gox bi. YaRN dafay jëfandikoo fonction rampe ngir interpoler ci dimension yu fréquence yu woyof yi (guddaayi onde yu gudd) yi, boole ci 1/sqrt (t) tàngoor wuy bàyyi xel ci scaling biy tëye softmax sharpness bi gëna mag. Bii NTK-by-parts gis-gis bi dafay yokk muy gëna néew luñuy yàq.

Mastering YaRN ak yokk guddaayi contexte

YaRN (Beneen RoPE extension) xarala bu am solo la ngir tàllal palanteer bi ñuy jëfandikoo ci model bi, dem fu sori ci li ñu ko tàggat. Dafay soppi position rotary ci anam wu xarañ, suko defee benn model buñu tàggat ci, wax, jetons 4K mën na jëfandikoo 32K wala lu ëpp ak fine-tuning bu néew. YaRN ak Context Length Extension ab bloku tabax la bu am njeexital ci kalite model bi, njëgu infrastructure bi, latency bi, ak wóor ci escale bi. Ngir tabax xam-xam bu xóot, jàppal YaRN ak Context Length Extension ni xeetu liggéey, du benn man-man: leeral njariñ yi nga bëgg, leeral xalaat yi, ba noppi tàqale li sistem bi mëna def ci anam wu wóor ak li ba leegi soxla àtteb kàngam.

Ci jëf, ekip yu am doole yiy jëfandikoo YaRN ak Context Length Extension dañuy gëna baaxal architecture, done, ak tànneefi infrastructure ci wàllu wóor ak njëg. Dañuy bind kritër yu leer ngir am ndam, natt leen ci done yu dëggu ak def liggéey, ba noppi ñu baamtu ci anamu ñàkka mëna seetlu, du ci benn yoon benchmark wins. Mooy barab bi xam-xam theorie bi di soppiku nekk kàttan buy yàgg ci produit yi, ci politik yi ak ci liggéey yi.

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw. Ci jamano jooju, Optimisation benn benchmark mën na nëbb ñakk kattan yu gëna yaatu ci sistem bi. Xeetu jëf bi gëna dëgër mooy boole gaawaayu jàngat ak disipline nguur: doxal pilote, jàpp firnde, siiwal dogal yi, ak wéy di yeesal kaaraange gi ci anam wi ñuy doxalee, li jëfandikukat bi di xaar, ak sàrti sàrt yi di jëm kanam.

njeextalu pexe

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw.

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Njàngalem xarala yi dafay jàppale ekip yi ñu tànn li gën, te baña yam ci li gëna bees daal.

Njàngalem xarala yi dafay jàppale ekip yi ñu tànn li gën, te baña yam ci li gëna bees daal. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Tanneef yu gëna baax ci wàllu ingeñër dina wàññi jafe-jafe yi ci wàllu wóor ci liggéey bi.

Tanneef yu gëna baax ci wàllu ingeñër dina wàññi jafe-jafe yi ci wàllu wóor ci liggéey bi. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Ëlëgu YaRN ak yokk guddaayi contexte

Extension contexte leegi dafay nekk jëf buñ miin: model yu ubbeeku yi dañuy faral di yónnee YaRN-extended variants yuy yegg ba 128K tokens wala lu ëpp loolu. Gëstu mingi jëm ci pexe yuy yokk muy wax ak zero wala jege-zero fine-tuning, boole RoPE rescaling ak tricks motif attention, ak mën topp kalite ci palanteer bi yépp te baña yam ci njeexte yi. Xaarandi boole bu gëna dëgër ci pexe yii ci tàggat-yaram bu yàgg bi nekk native moo gën retrofitted.

Doxal ci àdduna dëgg

Yaatalal ab xeetu 4K-context bu ubbeeku ba 32K wala 128K ngir tontu laaj yu guddu ci këyit ak ab ajustement bu gàtt

May sistem yuñ yokk ngir seetlu ngir mëna jël paas yu bari yuñ boole te duñu dagg

Kodu assistant yi soxla fichier depositoire bu yaatu wala fichier yu bari ci benn prompt

Defar ab xeetu base ngir waxtaan yu gudd yu bari tur yuy dajale jaar-jaari waxtaan yu bari

Modèlu jëfandikoo

YaRN ak guddaayu ëmbiit ci jëf

Yaatalal ab xeetu 4K-context bu ubbeeku ba 32K wala 128K ngir tontu laaj yu gudd ci këyitu dokimaa ak ab ajustement bu gàtt.

Yaatalal benn xeetu 4K-context bu ubbeeku ci 32K wala 128K ngir tontu laaj yu guddu ak ay njuumte yu gàtt. Ekip yi dañuy faral di am njariñ yu gëna baax suñu joxee ay threshold yu baax ci kanam, tëye yoonu escalation nit ngir mbir yu am solo, ba noppi topp njariñu produit ak njëgu njuumte ci diir bi.

YaRN ak guddaayu ëmbiit ci jëf

May sistem yuñ yokk ci seet ñu mëna jël paas yu bari yuñ boole te duñu dagg.

May sistem yu gëna yokk ngir jël ay paas yu bari yuñ boole te duñu dagg. Ekip yi dañuy faral di am njariñ yu gëna baax suñu joxee ay threshold yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ba noppi topp njariñu produit yi ak njëgu njuumte yi ci diir bi.

YaRN ak guddaayu ëmbiit ci jëf

Kodu assistant yi soxla fichier deposit bu yaatu wala fichier yu bari ci benn prompt.

Assistant code Powering yi soxla fichier deposit bu mag wala fichier yu bari ci benn prompt Teams yi dañuy faral di am njariñ yu gëna baax suñu joxee thresholds yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ba noppi topp njariñu produit ak njëgu njuumte ci diir bi.

YaRN ak guddaayu ëmbiit ci jëf

Defar ab xeetu base ngir waxtaan yu gudd yu bari tur yuy dajale jaar-jaari waxtaan yu bari.

Defar ab xeetu base ngir waxtaan yu gudd yu bari yuy dajale jaar-jaar yu bari ci waxtaan. Ekip yi dañuy faral di am njariñ yu gëna baax suñu joxee ay threshold yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ba noppi topp njariñu produit ak njëgu njuumte ci diir bu gàtt.

Risk yi ak balustrade yi

!

Optimize benn benchmark mën na nëbb ñakk kattan yu gëna yaatu ci sistem bi.

!

Njëg li ñuy fay ci infrastructure yi ak ci toppatoo dañuy faral di suufeel.

!

Bu sistem yi di gëna xawa jafee xam, jafe-jafe yi am ci wàllu kaaraange ak seetlu mën nañu gëna bari.

Roadmap ngir samp gi

1

Mandargal latency, kalite, ak njëg yi laata ngay jëfandikoo.

Mandargal latency, kalite, ak njëg yi laata ngay jëfandikoo. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppalu génne gi, tëj bërëb bi, ba noppi yokk jëfandikoo gi.

2

Benchmark ci biir sargal ak done yu dëggu.

Benchmark ci biir sargal ak done yu dëggu. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppalu génne gi, tëj bërëb bi, ba noppi yokk jëfandikoo gi.

3

Jumtukaay bi di saytu njuumte yi, derive bi ak njeextalu jëfandikukat bi.

Jumtukaay bi di saytu njuumte yi, derive bi ak njeextalu jëfandikukat bi. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppalu génne gi, tëj bërëb bi, ba noppi yokk jëfandikoo gi.

4

Waajal rollback ak yooni tontu ci jafe-jafe yi laata ngay eskale.

Waajal rollback ak yooni tontu ci jafe-jafe yi laata ngay eskale. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppalu génne gi, tëj bërëb bi, ba noppi yokk jëfandikoo gi.

Weyal di banneexu