I-VISual AI GUIDE

I-Make-A-Video Text-to-Video

I-Make-A-Video iwuhlelo luka-Meta luka-2022 olushintsha ukwaziswa kombhalo ube isiqeshana sevidiyo esifushane ngaphandle kokuqeqeshwa ngamapheya anelebula yombhalo-vidiyo.

Uhlolojikelele

I-Make-A-Video iwuhlelo luka-Meta luka-2022 olushintsha ukwaziswa kombhalo ube isiqeshana sevidiyo esifushane ngaphandle kokuqeqeshwa ngamapheya anelebula yombhalo-vidiyo. Kubalulekile ngoba kubonise ukuthi ulwazi olubonakalayo ngaphakathi kwamamodeli wombhalo uye esithombeni 'lungafundiswa' ukunyakaza kusetshenziswa ividiyo engenamalebula kuphela.

I-Make-A-Video Text-to-Video ingeyokugeleza komsebenzi okubonwa ngekhompyutha okuhumusha noma okukhiqiza imidiya ebonakalayo ukuze ihlaziywe, isebenze, futhi isungule.

I-Deep Dive

I-Make-A-Video, imenyezelwe yi-Meta AI ngo-Septhemba 2022, ikhiqiza imizuzwana embalwa yevidiyo emshweni ofana 'nenja egqoke ikepisi leqhawe elindiza esibhakabhakeni.' Iqhinga layo eliyinhloko ukuhlukanisa ukubukeka kusuka ekunyakazeni: imodeli yombhalo uye esithombeni (eyakhelwe esikhaleni sesithombe esihlanganyelwe sesitayela se-CLIP) ifunda ukuthi izinto zibukeka kanjani ezigidini zezithombe ezinamagama-ncazo, kuyilapho izendlalelo ze-spatiotemporal ezihlukene zifunda ukuthi izinto zihamba kanjani kuvidiyo engenamalebula iyodwa. Lokhu kubeka eceleni ukushoda kwamapheya evidiyo yombhalo wekhwalithi ephezulu. Imodeli eyisisekelo ikhiqiza iziqeshana ezinokulungiswa okuphansi, ezinesilinganiso esiphansi, bese amanethiwekhi azinikele ahlanganisa ozimele abengeziwe kanye nokulungiswa kwendawo ephezulu. Umphumela ubuhambisana ngokumangalisayo ngesikhathi sawo, nakuba iziqeshana zazimfushane, zifiphele, futhi zijwayele ukucwayiza futhi zinyakaze.

I-Technical Insight

I-Make-A-Video inweba ukuguqulwa kwesizukulwane sesithombe se-2D nokunaka ku-3D ngokungeza izendlalelo zesikhashana-mbumbulu. Izisindo zendawo eziqeqeshelwe kusengaphambili ziyaqandiswa noma zicushwe kahle kuyilapho izendlalelo zesikhashana ezintsha zifunda ukunyakaza kuvidiyo eluhlaza, ngakho awekho amalebula evidiyo yombhalo adingekayo. Inethiwekhi yokuhumusha kozimele ibe isigxilisa umugqa wesikhathi kanye namamojula okusabalalisa okucacayo okuphezulu aphakamisa imininingwane yendawo, aguqule uhlaka olumahhadla-16, olunokukhanya okuphansi lube isiqeshana esibushelelezi, esibukhali epayipini elijikijelayo.

Ukwenza I-Make-A-Video Umbhalo-kuya-Ividiyo

I-Make-A-Video iwuhlelo luka-Meta luka-2022 olushintsha ukwaziswa kombhalo ube isiqeshana sevidiyo esifushane ngaphandle kokuqeqeshwa ngamapheya anelebula yombhalo-vidiyo. Kubalulekile ngoba kubonise ukuthi ulwazi olubonakalayo ngaphakathi kwamamodeli wombhalo uye esithombeni 'lungafundiswa' ukunyakaza kusetshenziswa ividiyo engenamalebula kuphela. I-Make-A-Video Text-to-Video ingeyokugeleza komsebenzi okubonwa ngekhompyutha okuhumusha noma okukhiqiza imidiya ebonakalayo ukuze ihlaziywe, isebenze, futhi isungule. Ukuze wakhe ukuqonda okujulile, phatha i-Make-A-Video Text-to-Video njengemodeli yokusebenza, hhayi isici esisodwa: chaza imiphumela oyifunayo, ucacise ukucabanga, futhi uhlukanise lokho isistimu engakwenza ngokwethembeka kulokho okusadinga ukwahlulela kochwepheshe.

Empeleni, amaqembu aqinile asebenzisa ukunemba kwebhalansi ye-Make-A-Video Text-to-Video namaqiniso okusebenza njengekhwalithi yedatha, ukuhluka kokukhanya, nokuvumelana kwamalebula. Babhala imibandela yempumelelo ecacile, ukuhlola okuqhathaniswa nedatha engokoqobo nokugeleza komsebenzi, futhi baphindaphinde ngokusekelwe kumaphethini okuhluleka aqashiwe esikhundleni sokuwina kwebhentshimakhi yesikhathi esisodwa. Yilapho ukuqonda kwethiyori kuguquka kube amandla ahlala njalo kuwo wonke umkhiqizo, inqubomgomo, kanye nokusebenza.

I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini. Ngesikhathi esifanayo, amalungelo ezithombe kanye nemvume kungaba ubungozi bomthetho uma ukutholakala kungacacile. Indlela eqine kakhulu iwukuhlanganisa isivinini sokuhlola nesiyalo sokuphatha: qhuba abashayeli bezindiza, bamba ubufakazi, ushicilele amalogi ezinqumo, futhi ubuyekeze izivikelo ngokuqhubekayo njengoba imodeli yokuziphatha, okulindelwe ngabasebenzisi, kanye nezimfuneko zokulawula zishintsha.

I-Strategic Impact

I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini.

I-Visual AI ingakwazi ukuhlola, ukutholwa, nokumaka imisebenzi esikalini. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Amathimba aqanjiwe angakwazi ukulinganisa imiqondo ngokushesha ngezibuyekezo ezimbalwa ezenziwa mathupha.

Amathimba aqanjiwe angakwazi ukulinganisa imiqondo ngokushesha ngezibuyekezo ezimbalwa ezenziwa mathupha. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Imisebenzi ingasebenzisa amasiginali wesithombe nawevidiyo obekunzima ukuwenza ngaphambilini.

Imisebenzi ingasebenzisa amasiginali wesithombe nawevidiyo obekunzima ukuwenza ngaphambilini. Ekusetshenzisweni kwekhwalithi ephezulu, lokhu kuhunyushwa emithethweni yokusebenza elinganisekayo, imingcele yobunikazi, nemikhuba yokubuyekeza ephindelelayo ukuze amaqembu akwazi ukukala ukuzethemba esikhundleni sokukala ukungaqondakali.

Ikusasa le-Make-A-Video Text-to-Video

Isithombe se-Make-A-Video's image-prior-plus-unlabel-motion recipe sikhiqize wonke amagagasi ombhalo ukuya kuvidiyo. Inzalo yayo igcizelela iziqeshana ezinde, ezinokulungiswa okuphezulu, ezinzile okwesikhashana ezinokunyakaza kwekhamera elawulekayo nomsindo. Lindela umqondo oyinhloko, ukusebenzisa kabusha ulwazi olukhulu lwesithombe nokufunda ukunyakaza okushibhile, ukuqhubeka ngisho noma izakhiwo zishintshela ekusakazeni okucashile okusekelwe ku-transformer kanye namamodeli ahlanganisiwe amukela isimo sesithombe noma sevidiyo ukuze sihlelwe futhi siqhubeke.

Ukuqaliswa Komhlaba Wangempela

Ukugqwayiza umusho owodwa ochazayo ube isiqeshana esifushane sokuthunyelwe kwenkundla yezokuxhumana

Ukuletha umqondo omile njengokuthi 'ibhere lika-teddy lipenda isithombe' njengomfanekiso onyakazayo

Ukuhunyushwa phakathi kwezithombe ezimbili ezimile ezihlinzekwe ngabasebenzisi ukudala ividiyo yoguquko ebushelelezi

Ikhiqiza okusalungiswa okunyakazayo okusheshayo kwezigcawu ezicatshangelwayo zokuqoshwa kwendaba ngaphambi kwanoma yikuphi ukuqoshwa

Amaphethini Okusebenzisa

I-Make-A-Video Text-to-Video in practice

Ukugqwayiza umusho owodwa ochazayo ube isiqeshana esifushane sokuthunyelwe kwenkundla yezokuxhumana.

Ukugqwayiza umusho owodwa ochazayo ube isiqeshana esifushane sokuthunyelwe kwenkundla yezokuxhumana Amaqembu ngokuvamile athola imiphumela engcono uma echaza ikhwalithi ephezulu ngaphambili, egcina indlela yokukhuphuka yabantu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Make-A-Video Text-to-Video in practice

Ukuletha umqondo omile ofana 'nebhere elipenda umdwebo' njengomfanekiso onyakazayo.

Ukuletha umqondo omile njengokuthi 'ibhere elipenda umdwebo' liphile njengomfanekiso onyakazayo Amathimba ngokuvamile athola imiphumela engcono uma echaza izinga eliphezulu ngaphambili, egcina indlela yokukhuphuka yomuntu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Make-A-Video Text-to-Video in practice

Ukuhunyushwa phakathi kwezithombe ezimbili ezimile ezihlinzekwe ngabasebenzisi ukudala ividiyo yoguquko ebushelelezi.

Ukuhunyushwa phakathi kwezithombe ezimbili ezimile ezihlinzekwe ngabasebenzisi ukudala ividiyo yoshintsho olushelelayo Amathimba ngokuvamile athola imiphumela engcono uma echaza izinga eliphezulu ngaphambili, egcina indlela yokukhuphuka yabantu yamakesi asemaphethelweni, futhi elandelela kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

I-Make-A-Video Text-to-Video in practice

Ikhiqiza okusalungiswa okunyakazayo okusheshayo kwezigcawu ezicatshangelwayo zokuqoshwa kwendaba ngaphambi kwanoma yikuphi ukuqoshwa.

Ukukhiqiza uhlaka olunyakazayo olusheshayo lwezigcawu ezicatshangelwayo zokuqoshwa kwezindaba ngaphambi kwanoma yimaphi Amathimba aqoshwayo ngokuvamile athola imiphumela engcono uma echaza ikhwalithi ephezulu ngaphambili, egcina indlela yokukhuphuka yabantu yamakesi asemaphethelweni, futhi alandelele kokubili izinzuzo zokukhiqiza nezindleko zamaphutha ngokuhamba kwesikhathi.

Izingozi & Guardrails

!

Amalungelo ezithombe kanye nemvume kungaba ubungozi bezomthetho uma ukuvela kungacacile.

!

Ukusebenza kwemodeli kungahluka kukho konke ukukhanya, izibalo zabantu, kanye nezindawo.

!

Okuhle okungelona iqiniso kungase kungabonakali ngaphandle uma izinga lokuzethemba liqashelwa.

Ukuqalisa Umhlahlandlela

1

Chaza indlela yokwamukela yokunemba, ukukhumbula, nezindleko zamaphutha.

Chaza indlela yokwamukela yokunemba, ukukhumbula, nezindleko zamaphutha. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

2

Hlola ngedatha efana nezimo zangempela zokukhiqiza.

Hlola ngedatha efana nezimo zangempela zokukhiqiza. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

3

Engeza isibuyekezo somuntu ukuze uthole ukuzethemba okuphansi noma izibikezelo zomthelela omkhulu.

Engeza isibuyekezo somuntu ukuze uthole ukuzethemba okuphansi noma izibikezelo zomthelela omkhulu. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

4

Landelela ukukhukhuleka kwemodeli bese uqinisekisa kabusha ngemva kwezinguquko zekhamera noma zesethi yedatha.

Landelela ukukhukhuleka kwemodeli bese uqinisekisa kabusha ngemva kwezinguquko zekhamera noma zesethi yedatha. Phatha isinyathelo ngasinye njengesango lobufakazi: uma imibandela ingafinyelelwa, misa ukukhishwa, vala igebe, bese unweba ukusetshenziswa.

Qhubeka Uhlole