GUIDE teknik

Tensor

Tensor Cores ay jumtukaay lañu yuñ jagleel GPU NVIDIA yu bees yi, ñuy def matrix yu bari ak dajale ci anam wu gaaw lool.

Résumé

Tensor Cores ay jumtukaay lañu yuñ jagleel GPU NVIDIA yu bees yi, ñuy def matrix yu bari ak dajale ci anam wu gaaw lool. Mooy sabab bi tax benn GPU mëna tàggat ak doxal reso neuronal yu mag yu magnitude yu gëna gaaw ci ordinatër yu bari yi.

Tensor Cores ab bloku tabax la bu am njeexital ci kalite model bi, njëgu infrastructure bi, yeexal bi, ak wóor ci eskaal bi.

Plongeur bu xóot

Dugaale ak architecture Volta ci 2017, Tensor Cores ay sircuit yuñ jagleel ñuy xayma ab matrix bu ndaw buy yokk (D = A x B + C) ci benn jëfandikoo, moo gën ñuy def bu nekk benn-benn ci cores CUDA standard. Ndax daanaka bépp couche ci reso neuronal dafay wàññeeku ci matrix yu bari, loolu méngoo ak li IA soxla ci math. GPU bu nekk dafay yokk limuy jëfandikoo: Volta defar na 4x4 FP16, ci ganaw ga Ampere, Hopper, ak Blackwell yokk nañu ay formaa yu gëna ndaw yu melni TF32, BF16, INT8, FP8, ak FP4. Lu gëna ndaw ci njub mooy limu gëna bari ci montor bu nekk, loolu dafay yokk bu baax produit bi ci tàggat yaram ak ci inference boole ci tëye njubte gi.

Gis-gis xarala

Tensor Core dafay yokk ñaari matrix yu ndaw ba noppi dajale resultaa bi ci benn jéego buñ boole, di jàppee ni benn valeur bi ñuy dugal dañu koy jëfandikoowaat ci élément yu bari yuy génn. Dafay faral di lire ay dugal ci anam wu wàññeeku (FP16, BF16, wala FP8) waaye dafay dajale xaalis biy dawal ci anam wu gëna dëggu (dafay faral di nekk FP32) ngir wàññi njuumti yi ci wërsëg. Biblioteek losisel yu melni cuBLAS ak cuDNN, ak kaadar yu melni PyTorch, dañuy boole matris yu mag yi ci blok yu ndaw yii ci saasi suko defee model yi mëna gaaw te duñu soxla kodage manuel.

xam tensor cores

Tensor Cores ay jumtukaay lañu yuñ jagleel GPU NVIDIA yu bees yi, ñuy def matrix yu bari ak dajale ci anam wu gaaw lool. Mooy sabab bi tax benn GPU mëna tàggat ak doxal reso neuronal yu mag yu magnitude yu gëna gaaw ci ordinatër yu bari yi. Tensor Cores ab bloku tabax la bu am njeexital ci kalite model bi, njëgu infrastructure bi, yeexal bi, ak wóor ci eskaal bi. Ngir tabax xam-xam bu xóot, jàppal Tensor Cores ni xeetu liggéey, du benn man-man: leeral njariñ yi nga bëgg, leeral xalaat yi, ba noppi tàqale li sistem bi mëna def ci anam wu wóor ak li ba leegi soxla àtteb kàngam.

Ci jëf, ekip yu am doole yiy jëfandikoo Tensor Cores dañuy gëna baaxal architecture, done, ak tànneefi infrastructure ci wàllu wóor ak njëg. Dañuy bind kritër yu leer ngir am ndam, natt leen ci done yu dëggu ak def liggéey, ba noppi ñu baamtu ci anamu ñàkka mëna seetlu, du ci benn yoon benchmark wins. Mooy barab bi xam-xam theorie bi di soppiku nekk kàttan buy yàgg ci produit yi, ci politik yi ak ci liggéey yi.

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw. Ci jamano jooju, Optimisation benn benchmark mën na nëbb ñakk kattan yu gëna yaatu ci sistem bi. Xeetu jëf bi gëna dëgër mooy boole gaawaayu jàngat ak disipline nguur: doxal pilote, jàpp firnde, siiwal dogal yi, ak wéy di yeesal kaaraange gi ci anam wi ñuy doxalee, li jëfandikukat bi di xaar, ak sàrti sàrt yi di jëm kanam.

njeextalu pexe

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw.

Dogal yi architecture di jël dañuy indi njariñ ak njëgu liggéey bi ay at ci ginaaw. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Njàngalem xarala yi dafay jàppale ekip yi ñu tànn li gën, te baña yam ci li gëna bees daal.

Njàngalem xarala yi dafay jàppale ekip yi ñu tànn li gën, te baña yam ci li gëna bees daal. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Tanneef yu gëna baax ci wàllu ingeñër dina wàññi jafe-jafe yi ci wàllu wóor ci liggéey bi.

Tanneef yu gëna baax ci wàllu ingeñër dina wàññi jafe-jafe yi ci wàllu wóor ci liggéey bi. Ci jëfandikoo yu am kalite bu kawe, loolu dañu koy tekki ci sàrti liggéey yuñ mëna natt, ay peggu boroom, ak ay xew-xewu xoolaat yu bari suko defee ekip yi mëna yokk wóolu seen bopp ci barabu yokk lu jaxasoo.

Ëlëgu Tensor Cores

Tensor Cores wéyam di dem ci gëna néew njub: Hopper yokk FP8 ak Blackwell dugal 4-bit FP4 ak scaling hardware-managed, lu tollu ci ñaari yoon produit bi jéego bu nekk ngir liggéey bu diis. Xaarandi ndimmbal bu gëna dëgër ngir sparsity (teggi poids nul), formaa microscaling yuy takk facteur scale ci bloc yu ndaw yu lim, ak boole bu gëna xóot ak sistemu mémoire ngir core yi mëna des. Lu model yi di gëna màgg, motëru matrix bi, du gaawaayu montor bu ñor bi, mooy des ci xeex bi gëna mag ci performance hardware IA.

Doxal ci àdduna dëgg

Taggat xeetu làkk yu mag yu melni transformatër yu nuroo ak GPT, fu ay miliyaar ciy bari matrix jéego bu nekk di dox ci kaw Tensor Cores ci BF16 wala FP8.

Doxal ay gis-gis ci jamono dëgg ngir chatbots ak defarkati nataal, jëfandikoo INT8 wala FP8 ngir mëna jàppale jëfandikukat yu bari ci GPU bu nekk.

Gaawaay NVIDIA DLSS ci jeu video, fu reso neuronal di yokk kadre yu am dayo bu néew di jëfandikoo Tensor Cores ci kadre bu nekk.

Gaawaale ordinatër yu gëstukat yi lu ci melni pliage protein (AlphaFold) ak modeli meteo yuñ defaraat ñu nekk ay liggéey yu diis ci matrix.

Modèlu jëfandikoo

Tensor Cores ci jëf

Taggat xeetu làkk yu mag yu melni transformatër yu nuroo ak GPT, fu ay miliyaar ciy bari matrix jéego bu nekk di dox ci kaw Tensor Cores ci BF16 wala FP8.

Taggat xeetu làkk yu mag yu melni transformatër yu nuroo ak GPT, fu ay miliyaar ci matrix yu bari ci jéego bu nekk di dox ci Tensor Cores ci BF16 wala FP8 Teams dañuy faral di am njariñ yu gëna baax suñu joxee threshold yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ak topp njuréefi produit ci diir bi ak e.

Tensor Cores ci jëf

Doxal ay gis-gis ci jamono dëgg ngir chatbots ak defarkati nataal, jëfandikoo INT8 wala FP8 ngir mëna jàppale jëfandikukat yu bari ci GPU bu nekk.

Dawal inference ci jamono dëgg ngir chatbots ak generatëru nataal, jëfandikoo INT8 wala FP8 quantization ngir liggéey jëfandikukat yu bari ci GPU Teams dañuy faral di am njariñ yu gëna baax suñu joxee threshold yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ak topp njuréefi produit ak njëgu njuumte ci diir bi.

Tensor Cores ci jëf

Gaawaay NVIDIA DLSS ci jeu video, fu reso neuronal di yokk kadre yu am dayo bu néew di jëfandikoo Tensor Cores ci kadre bu nekk.

Gaawaay NVIDIA DLSS ci jeu video, fu reso neuronal upscales kadre yu gëna néew resolusioŋ jëfandikoo Tensor Cores kadre bu nekk. Ekip yi dañuy faral di am njariñ yu gëna baax suñu joxee threshold yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ba noppi topp njariñu produit yi ak njuumte yi.

Tensor Cores ci jëf

Gaawaale ordinatër yu gëstukat yi lu ci melni pliage protein (AlphaFold) ak modeli meteo yuñ defaraat ñu nekk ay liggéey yu diis ci matrix.

Gaawaay ordinatër gëstukat yu melni protein-folding (AlphaFold) ak xeetu meteo yuñ defaraat ni matrix-neural workloads Teams yi dañuy faral di am njariñ yu gëna baax suñu leeralee threshold yu baax ci kanam, tëye yoonu escalation nit ngir jafe-jafe yi, ak topp njuumte yi ci diiru produit ak diir bi.

Risk yi ak balustrade yi

!

Optimize benn benchmark mën na nëbb ñakk kattan yu gëna yaatu ci sistem bi.

!

Njëg li ñuy fay ci infrastructure yi ak ci toppatoo dañuy faral di suufeel.

!

Bu sistem yi di gëna xawa jafee xam, jafe-jafe yi am ci wàllu kaaraange ak seetlu mën nañu gëna bari.

Roadmap ngir samp gi

1

Mandargal latency, kalite, ak njëg yi laata ngay jëfandikoo.

Mandargal latency, kalite, ak njëg yi laata ngay jëfandikoo. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppali génne gi, tëj bërëb bi, ba noppi nga yaatal jëfandikoo gi.

2

Benchmark ci biir sargal ak done yu dëggu.

Benchmark ci biir sargal ak done yu dëggu. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppali génne gi, tëj bërëb bi, ba noppi nga yaatal jëfandikoo gi.

3

Jumtukaay bi di saytu njuumte yi, derive bi ak njeextalu jëfandikukat bi.

Jumtukaay bi di saytu njuumte yi, derive bi ak njeextalu jëfandikukat bi. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppali génne gi, tëj bërëb bi, ba noppi nga yaatal jëfandikoo gi.

4

Waajal rollback ak yooni tontu ci jafe-jafe yi laata ngay eskale.

Waajal rollback ak yooni tontu ci jafe-jafe yi laata ngay eskale. Japp jéego bu nekk ni buntu firnde: sudee mattul kritër yi, noppali génne gi, tëj bërëb bi, ba noppi nga yaatal jëfandikoo gi.

Weyal di banneexu