Uhlolojikelele
I-VQ-VAE icindezela izithombe, umsindo, noma ividiyo ibe yigridi encane yamakhodi ahlukahlukene athathwe ebhukwini lekhodi elifundiwe, esikhundleni sezinombolo eziqhubekayo. Lokhu kubhodlela okuhlukile kuvumela amamodeli alandelanayo anamandla afana nama-Transformers aphathe imidiya 'njengamathokheni', njengamagama.
I-VQ-VAE kanye ne-Discrete Latents ingeyokugeleza komsebenzi wokubona ngekhompyutha okuhumusha noma okukhiqiza imidiya ebonakalayo ukuze ihlaziywe, isebenze, futhi isungule.
I-Deep Dive
I-VQ-VAE (Vector Quantized Variational Autoencoder), eyethulwe ngu-van den Oord kanye nozakwabo e-DeepMind ngo-2017, iyi-autoencoder enesikhala sayo esicashile sihlukile. Isifaki khodi siphendula isithombe sibe igridi yamavekhtha aqhubekayo; ivekhtha ngayinye ibe isihlwithwa ekufakweni kwayo okuseduze ebhukwini lekhodi elifundiwe lokushumekiwe (i-vector quantization). I-decoder yakha kabusha isithombe kusuka kulawo makhodi abaliwe. Ngenxa yokuthi ama-latenti manje ayisilulumagama esilinganiselwe sezinkomba, imodeli ehlukile ingafunda ukusatshalaliswa kwazo futhi ikhiqize okuqukethwe okusha. Le ndlela yokupheka enezigaba ezimbili inika amandla i-DALL-E 1, i-Jukebox yomculo, kanye ne-VQGAN, enezela ukulahlekelwa okubonakalayo nokumelana nokwakhiwa kabusha okubukhali. I-VQ-VAE-2 ibeke izinqumo eziningi ukuze ikhiqize izithombe ezithembeke kakhulu.
I-Technical Insight
Isinyathelo sokulinganisa (i-argmin yokubheka umakhelwane oseduze) asinamehluko, ngakho-ke i-VQ-VAE isebenzisa isilinganiso esiqondile: ama-gradient akopishwa ngokuqondile kusukela ekufakweni kwe-decoder emuva kokukhishwayo kwesifaki khodi njengokungathi ukulinganisa ubuwena. Ukuqeqeshwa kuhlanganisa ukulahlekelwa kokwakha kabusha, ukulahlekelwa kwe-codebook kudonsela ukushumeka kokuphumayo kwesifaki khodi, kanye nokulahlekelwa ukuzibophezela okugcina isifaki khodi sizinikele kumakhodi aso akhethiwe. Ukwehluleka okuvamile ukuwa kwe-codebook, lapho kusetshenziswa khona amakhodi ambalwa.
I-Mastering VQ-VAE kanye ne-Discrete Latents
I-VQ-VAE icindezela izithombe, umsindo, noma ividiyo ibe yigridi encane yamakhodi ahlukahlukene athathwe ebhukwini lekhodi elifundiwe, esikhundleni sezinombolo eziqhubekayo. Lokhu kubhodlela okuhlukile kuvumela amamodeli alandelanayo anamandla afana nama-Transformers aphathe imidiya 'njengamathokheni', njengamagama. I-VQ-VAE kanye ne-Discrete Latents ingeyokugeleza komsebenzi wokubona ngekhompyutha okuhumusha noma okukhiqiza imidiya ebonakalayo ukuze ihlaziywe, isebenze, futhi isungule. Ukuze wakhe ukuqonda okujulile, phatha i-VQ-VAE ne-Discrete Latents njengemodeli yokusebenza, hhayi isici esisodwa: chaza imiphumela efiselekayo, ucacise ukucabanga, futhi uhlukanise lokho uhlelo olungakwenza ngokwethembeka kulokho okusadinga ukwahlulela kochwepheshe.
Empeleni, amaqembu aqinile asebenzisa i-VQ-VAE kanye ne-Discrete Latents yokunemba namaqiniso okusebenza njengekhwalithi yedatha, ukuhluka kokukhanya, nokuvumelana kwamalebula. Babhala imibandela yempumelelo ecacile, ukuhlola okuqhathaniswa nedatha engokoqobo nokugeleza komsebenzi, futhi baphindaphinde ngokusekelwe kumaphethini okuhluleka aqashiwe esikhundleni sokuwina kwebhentshimakhi yesikhathi esisodwa. Yilapho ukuqonda kwethiyori kuguquka kube amandla ahlala njalo kuwo wonke umkhiqizo, inqubomgomo, kanye nokusebenza.
I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini. Ngesikhathi esifanayo, amalungelo ezithombe kanye nemvume kungaba ubungozi bomthetho uma ukutholakala kungacacile. Indlela eqine kakhulu iwukuhlanganisa isivinini sokuhlola nesiyalo sokuphatha: qhuba abashayeli bezindiza, bamba ubufakazi, ushicilele amalogi ezinqumo, futhi ubuyekeze izivikelo ngokuqhubekayo njengoba imodeli yokuziphatha, okulindelwe ngabasebenzisi, kanye nezimfuneko zokulawula zishintsha.
I-Strategic Impact
I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini.
I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.
Amathimba aqanjiwe angakwazi ukulinganisa imiqondo ngokushesha ngezibuyekezo ezimbalwa ezenziwa mathupha.
Amathimba aqanjiwe angakwazi ukulinganisa imiqondo ngokushesha ngezibuyekezo ezimbalwa ezenziwa mathupha. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.
Imisebenzi ingasebenzisa amasiginali wesithombe nawevidiyo obekunzima ukuwenza ngaphambilini.
Imisebenzi ingasebenzisa amasiginali wesithombe nawevidiyo obekunzima ukuwenza ngaphambilini. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.
Ukuqaliswa Komhlaba Wangempela
I-DALL-E 1 isebenzise ithokheni ye-VQ-VAE ehlukene ukuze i-Transformer ikhiqize izithombe njengokulandelana kwezinkomba ze-codebook.
I-VQGAN ihlanganise i-VQ-VAE nokulahlekelwa okuphikisayo nokubonayo ukuze kukhiqizwe amathokheni esithombe acwebile, anesinqumo esiphezulu sokwenziwa kobuciko.
I-Jukebox ka-OpenAI isebenzise i-VQ-VAE kumsindo ongahluziwe, icindezela umculo ube amakhodi ahlukene emodeli ekhiqizayo.
I-VQ-VAE-2 inqwabelanise ngezinto ezigcinayo ezilandelanayo ukuze kuhlanganiswe izithombe ezihlukene, ezithembekile eziqhudelana nama-GAN enkathi yayo.
Amaphethini Okusebenzisa
I-VQ-VAE kanye ne-Discrete Latents iyasebenza
I-DALL-E 1 isebenzise ithokheni ye-VQ-VAE ehlukene ukuze i-Transformer ikhiqize izithombe njengokulandelana kwezinkomba ze-codebook.
I-DALL-E 1 isebenzise ithokheni ye-VQ-VAE eqondile ukuze i-Transformer ikwazi ukukhiqiza izithombe njengokulandelana kwezinkomba ze-codebook Amaqembu ngokuvamile athola imiphumela engcono uma echaza imingcele yekhwalithi ngaphambili, egcina indlela yokukhuphuka komuntu ngamacala asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.
I-VQ-VAE kanye ne-Discrete Latents iyasebenza
I-VQGAN ihlanganise i-VQ-VAE nokulahlekelwa okuphikisayo nokubonayo ukuze kukhiqizwe amathokheni esithombe acwebile, anesinqumo esiphezulu sokwenziwa kobuciko.
I-VQGAN ehlangene ye-VQ-VAE nokulahlekelwa okuphikisayo nokucabangayo ukuze kukhiqizwe amathokheni ezithombe ahlanzekile, anesinqumo esiphezulu Amathimba akhiqiza ubuciko ngokuvamile athola imiphumela engcono lapho echaza imingcele yekhwalithi ngaphambili, egcina indlela yokukhuphuka komuntu ngamacala asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.
I-VQ-VAE kanye ne-Discrete Latents iyasebenza
I-Jukebox ka-OpenAI isebenzise i-VQ-VAE kumsindo ongahluziwe, icindezela umculo ube amakhodi ahlukene emodeli ekhiqizayo.
I-Jukebox ka-OpenAI isebenzise i-VQ-VAE kumsindo ongahluziwe, icindezela umculo ube amakhodi ahlukene Amathimba okumodela akhiqizayo ngokuvamile athola imiphumela engcono uma echaza imingcele yekhwalithi ngaphambili, egcina indlela yokukhuphuka yomuntu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.
I-VQ-VAE kanye ne-Discrete Latents iyasebenza
I-VQ-VAE-2 inqwabelanise ngezinto ezigcinayo ezilandelanayo ukuze kuhlanganiswe izithombe ezihlukene, ezithembekile eziqhudelana nama-GAN enkathi yayo.
I-VQ-VAE-2 enqwabelene yalezinto ezifihliwe ezihlukanisayo ukuze kuhlanganiswe izithombe ezihlukene, ezithembekile eziqhudelana nama-GAN enkathi yayo Amaqembu ngokuvamile athola imiphumela engcono uma echaza imingcele yekhwalithi ngaphambili, egcina indlela yokukhuphuka komuntu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.
Izingozi & Guardrails
Amalungelo ezithombe kanye nemvume kungaba ubungozi bezomthetho uma ukuvela kungacacile.
Ukusebenza kwemodeli kungahluka kukho konke ukukhanya, izibalo zabantu, kanye nezindawo.
Okuhle okungelona iqiniso kungase kungabonakali ngaphandle uma izinga lokuzethemba liqashelwa.
Ukuqalisa Umhlahlandlela
Chaza indlela yokwamukela yokunemba, ukukhumbula, nezindleko zamaphutha.
Chaza indlela yokwamukela yokunemba, ukukhumbula, nezindleko zamaphutha. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.
Hlola ngedatha efana nezimo zangempela zokukhiqiza.
Hlola ngedatha efana nezimo zangempela zokukhiqiza. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.
Engeza isibuyekezo somuntu ukuze uthole ukuzethemba okuphansi noma izibikezelo zomthelela omkhulu.
Engeza isibuyekezo somuntu ukuze uthole ukuzethemba okuphansi noma izibikezelo zomthelela omkhulu. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.
Landelela ukukhukhuleka kwemodeli bese uqinisekisa kabusha ngemva kwezinguquko zekhamera noma zesethi yedatha.
Landelela ukukhukhuleka kwemodeli bese uqinisekisa kabusha ngemva kwezinguquko zekhamera noma zesethi yedatha. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.