I-VISual AI GUIDE

I-Zero-1-to-3 Novel View Diffusion

Uziro-1-kuya-3 uphendula isithombe esisodwa sento sibe izithombe zaleyo nto efanayo ebonwa kunoma iyiphi i-engeli entsha, kusetshenziswa imodeli yokusabalalisa efakwe esimweni sokuzungezisa kwekhamera oyicelayo.

Uhlolojikelele

Uziro-1-kuya-3 uphendula isithombe esisodwa sento sibe izithombe zaleyo nto efanayo ebonwa kunoma iyiphi i-engeli entsha, kusetshenziswa imodeli yokusabalalisa efakwe esimweni sokuzungezisa kwekhamera oyicelayo. Kubalulekile ngoba ikuvumela ukuthi wakhe kabusha ukubuka okungaguquguquki kwe-3D ngaphandle kokuskena into ezinhlangothini eziningi.

I-Zero-1-to-3 Novel View Diffusion ingeyokugeleza komsebenzi okubonwa ngekhompyutha okuhumusha noma okukhiqiza imidiya ebonakalayo ukuze ihlaziywe, isebenze, futhi isungule.

I-Deep Dive

I-Zero-1-to-3 (kusuka e-Columbia, 2023) iculela kahle i-Stable Diffusion ukuze ikwazi ukwenza ukuhlanganiswa kokubuka kwenoveli okungasho lutho ngesithombe esifakiwe esisodwa. Uyiphakela isithombe esisodwa kanye nokuguqulwa kwekhamera ehlobene (ukuzungezisa nokuhumusha okuncane), futhi imodeli ikhiqiza ukuthi into izobukeka kanjani kulowo mbono omusha. Umbono obalulekile ukuthi amamodeli amakhulu okusatshalaliswa kwe-2D, aqeqeshwe emaqoqweni amakhulu ezithombe zewebhu, athathe ngokungaguquki okubalulekile kwejometri kanye nokomzimba mayelana nendlela izinto ezibukeka ngayo ku-3D. Ngokulungisa kahle kudathasethi yezinto zokwenziwa ezinikezwe ngama-engeli amaningi ekhamera alawulwayo (kusetshenziswa i-Objaverse), imodeli ifunda ukwenza imephu lezo zangaphambili ekulawuleni ikhamera ingcaca. Ukubuka okukhiqizwayo kungabese kuphakela ukwakhiwa kabusha kwe-3D ezansi komfula.

I-Technical Insight

Izimo zemodeli kusithombe somthombo ngezindlela ezimbili: ukushumeka kwe-CLIP kuhlanganiswe nokuma kwekhamera okuhlobene (i-azimuth, ukuphakama, irediyasi) ukuze kuqondiswe ukunakwa okuphambene, kuyilapho isithombe esiluhlaza sixhunywe kusiteshi esicashile esinomsindo ukuze kugcinwe imininingwane emihle kanye nobunikazi. Ukuqeqeshwa kusebenzisa ama-triplets esithombe-isithombe-isithombe esinikezwe kusukela kuzinto ze-CAD, ngakho-ke inethiwekhi ifunda imephu elawulekayo phakathi kokushintsha kokubuka kanye noshintsho lwephikseli oluwumphumela.

I-Mastering Zero-1-to-3 Novel View Diffusion

Uziro-1-kuya-3 uphendula isithombe esisodwa sento sibe izithombe zaleyo nto efanayo ebonwa kunoma iyiphi i-engeli entsha, kusetshenziswa imodeli yokusabalalisa efakwe esimweni sokuzungezisa kwekhamera oyicelayo. Kubalulekile ngoba ikuvumela ukuthi wakhe kabusha ukubuka okungaguquguquki kwe-3D ngaphandle kokuskena into ezinhlangothini eziningi. I-Zero-1-to-3 Novel View Diffusion ingeyokugeleza komsebenzi okubonwa ngekhompyutha okuhumusha noma okukhiqiza imidiya ebonakalayo ukuze ihlaziywe, isebenze, futhi isungule. Ukuze wakhe ukuqonda okujulile, phatha i-Zero-1-to-3 Novel View Diffusion njengemodeli yokusebenza, hhayi isici esisodwa: chaza imiphumela oyifunayo, ucacise ukucabanga, futhi uhlukanise lokho isistimu engakwenza ngokwethembeka kulokho okusadinga ukwahlulela kochwepheshe.

Empeleni, amaqembu aqinile asebenzisa ukunemba kwebhalansi ye-Zero-1-to-3 Novel View Diffusion namaqiniso okusebenza njengekhwalithi yedatha, ukuhluka kokukhanya, nokuvumelana kwamalebula. Babhala imibandela yempumelelo ecacile, ukuhlola okuqhathaniswa nedatha engokoqobo nokugeleza komsebenzi, futhi baphindaphinde ngokusekelwe kumaphethini okuhluleka aqashiwe esikhundleni sokuwina kwebhentshimakhi yesikhathi esisodwa. Yilapho ukuqonda kwethiyori kuguquka kube amandla ahlala njalo kuwo wonke umkhiqizo, inqubomgomo, kanye nokusebenza.

I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini. Ngesikhathi esifanayo, amalungelo ezithombe kanye nemvume kungaba ubungozi bomthetho uma ukutholakala kungacacile. Indlela eqine kakhulu iwukuhlanganisa isivinini sokuhlola nesiyalo sokuphatha: qhuba abashayeli bezindiza, bamba ubufakazi, ushicilele amalogi ezinqumo, futhi ubuyekeze izivikelo ngokuqhubekayo njengoba imodeli yokuziphatha, okulindelwe ngabasebenzisi, kanye nezimfuneko zokulawula zishintsha.

I-Strategic Impact

I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini.

I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Amathimba aqanjiwe angakwazi ukulinganisa imiqondo ngokushesha ngezibuyekezo ezimbalwa ezenziwa mathupha.

Amathimba aqanjiwe angakwazi ukulinganisa imiqondo ngokushesha ngezibuyekezo ezimbalwa ezenziwa mathupha. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Imisebenzi ingasebenzisa amasiginali wesithombe nawevidiyo obekunzima ukuwenza ngaphambilini.

Imisebenzi ingasebenzisa amasiginali wesithombe nawevidiyo obekunzima ukuwenza ngaphambilini. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Ikusasa Lokuhlukaniswa kweNoveli ka-Zero-1-to-3

I-Zero-1-to-3 ithole igagasi lamapayipi wesithombe kuya ku-3D. Abalandelayo abanjengo-Zero123-XL, i-SyncDreamer, kanye ne-One-2-3-45 basunduzela ekuhambisaneni kokubukwa okuningi kanye nokuphuma ngokushesha, okuthembeke kakhulu kwe-3D mesh, kuyilapho ukuhlanganiswa ne-Gaussian Splatting kanye namamodeli amakhulu okwakha kabusha kuncipha isikhathi sokukhiqiza kusuka kumaminithi kuya kumasekhondi. Lindela ukufana kokubuka okuqinile, ukulungiswa okuphezulu, kanye nomhlaba wangempela (hhayi nje into yokwenziwa) njengoba lawa mamodeli okusakaza alawulekayo okubuka evuthwa abe amathuluzi ajwayelekile okudala okuqukethwe.

Ukuqaliswa Komhlaba Wangempela

Ikhiqiza ukubukwa okuphendukayo kwesithombe somkhiqizo owodwa ukuze ukufakwa kuhlu kwe-e-commerce kubonise into kusuka kuzo zonke izinhlangothi

I-Bootstrapping ye-3D mesh eyenziwe ngomumo yento kusuka kusifinyezo esisodwa socingo esingajwayelekile sokuhlola kuqala kwe-AR

Ukudala ubuciko obuyisethenjwa obunama-engeli amaningi obufanayo bomlingisi noma i-prop yabaculi bomqondo wegeyimu nefilimu

Ukondla ukubukwa kwenoveli ehlanganisiwe ku-NeRF noma ukwakhiwa kabusha kwe-Gaussian Splatting ukugcwalisa ijometri engabonakali

Amaphethini Okusebenzisa

I-Zero-1-to-3 Novel View Diffusion in practice

Ikhiqiza ukubukwa okuphendukayo kwesithombe somkhiqizo owodwa ukuze ukufakwa kuhlu kwe-e-commerce kubonise into kusuka kuzo zonke izinhlangothi.

Ukukhiqiza ukubukwa okuphendukayo kwesithombe somkhiqizo owodwa ukuze ukufakwa kuhlu kwe-e-commerce kubonise into evela kuzo zonke izinhlangothi Amaqembu ngokuvamile athola imiphumela engcono uma echaza imingcele yekhwalithi ngaphambili, egcina indlela yokukhuphuka yabantu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Zero-1-to-3 Novel View Diffusion in practice

I-Bootstrapping ye-3D mesh ebhaliwe yento kusuka kusifinyezo esisodwa sefoni esingajwayelekile sokuhlola kuqala kwe-AR.

Ukwenza i-bootstrapping ye-3D ye-texture mesh yento kusuka kusifinyezo esisodwa sefoni esivamile sokubuka kuqala kwe-AR Amaqembu ngokuvamile athola imiphumela engcono uma echaza ikhwalithi ephezulu ngaphambili, egcina indlela yokukhuphuka yomuntu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Zero-1-to-3 Novel View Diffusion in practice

Ukudala ubuciko obuyisethenjwa obunama-engeli amaningi obufanayo bomlingisi noma i-prop yabaculi bomqondo wegeyimu nefilimu.

Ukudala ubuciko obuyireferensi obunama-engeli amaningi womlingiswa noma i-prop yabaculi bomqondo wegeyimu nefilimu Amaqembu ngokuvamile athola imiphumela engcono uma echaza izilinganiso zekhwalithi ngaphambili, agcina indlela yokukhuphuka yabantu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Zero-1-to-3 Novel View Diffusion in practice

Ukondla ukubukwa kwenoveli ehlanganisiwe ku-NeRF noma ukwakhiwa kabusha kwe-Gaussian Splatting ukugcwalisa ijometri engabonakali.

Ukondla ukubukwa kwenoveli edidiyelwe ekwakhiweni kabusha kwe-NeRF noma kwe-Gaussian Splatting ukuze kugcwalise amathimba ejiyomethri angabonakali ngokuvamile athola imiphumela engcono lapho echaza imingcele yekhwalithi ngaphambili, egcina indlela yokukhuphuka yomuntu yamacala asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

Izingozi & Guardrails

!

Amalungelo ezithombe kanye nemvume kungaba ubungozi bezomthetho uma ukuvela kungacacile.

!

Ukusebenza kwemodeli kungahluka kukho konke ukukhanya, izibalo zabantu, kanye nezindawo.

!

Okuhle okungelona iqiniso kungase kungabonakali ngaphandle uma izinga lokuzethemba liqashelwa.

Ukuqalisa Umhlahlandlela

1

Chaza indlela yokwamukela yokunemba, ukukhumbula, nezindleko zamaphutha.

Chaza indlela yokwamukela yokunemba, ukukhumbula, nezindleko zamaphutha. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

2

Hlola ngedatha efana nezimo zangempela zokukhiqiza.

Hlola ngedatha efana nezimo zangempela zokukhiqiza. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

3

Engeza isibuyekezo somuntu ukuze uthole ukuzethemba okuphansi noma izibikezelo zomthelela omkhulu.

Engeza isibuyekezo somuntu ukuze uthole ukuzethemba okuphansi noma izibikezelo zomthelela omkhulu. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

4

Landelela ukukhukhuleka kwemodeli bese uqinisekisa kabusha ngemva kwezinguquko zekhamera noma zesethi yedatha.

Landelela ukukhukhuleka kwemodeli bese uqinisekisa kabusha ngemva kwezinguquko zekhamera noma zesethi yedatha. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

Qhubeka Uhlole