Overview
Monocular kudzika fungidziro inofanotaura kuti pixel yega yega iri kure sei kubva kune imwechete yakajairika foto - hapana stereo kamera, lidar, kana kudzika sensor inodiwa. Iyo inobvumira imwe kamera kuona 3D chimiro kubva paflat 2D mufananidzo.
Monocular Depth Estimation ndeyekombuta-kuona mafambiro anodudzira kana kugadzira midhiya yekuona yekuongorora, mashandiro, uye kugadzira.
Deep Dive
Vanhu vanogona kutonga kudzika kubva kune rimwe ziso vachishandisa cues semaonero, saizi yehukama, magadzirirwo emagetsi, shading, uye occlusion. Monocular deep estimation inodzidzisa neural network the same trick: dyisa mune imwechete RGB mufananidzo uye inoburitsa kudzika kukosha kwepixel yega yega. Nekuti mufananidzo we 2D haunzwisisike pamusoro pechiyero chakakwana, basa racho rakaoma - mazhinji mapikicha e 3D anogona purojekiti kumufananidzo mumwe chete. Manetiweki anodzidza manhamba ekutanga kubva kumaseti makuru ekugadzirisa izvi. Kudzidzira kunouya mumhando mbiri: inotariswa, ichishandisa pasi-chokwadi kudzika kubva kune lidar kana RGB-D sensors, uye yekuzvitarisira, iyo inodzidza zvakadzama kubva pavhidhiyo kana stereo peya nekusimbisa kuti kudzika kwakafanotaurwa kunoramba nenzira kwayo maonero kune imwe. Zvichangoburwa nheyo modhi seMiDaS uye Kudzika Chero Chinhu chinowanzoitika zvinoshamisa pane zvisingaonekwe.
Technical Insight
Nzira dzekuzvitarisira dzinoshandisa geometry pachinzvimbo chemavara. Tichipihwa maonero maviri (stereo kana anoteedzana mavhidhiyo mafuremu) uye yakafanotaurwa kudzika mepu pamwe nekufamba kwekamera, modhi inochinjisa mufananidzo mumwe kuti uvakezve mumwe; iyo pixel-level rekuvaka kukanganisa inova chiratidzo chekudzidzisa. Uku kurasikirwa kwe 'kuona-synthesis' kunoreva kuti hudzamu hunogona kudzidzwa kubva muvhidhiyo yakasvibira, isina kunyorwa. Chinhu chakakosha chinodzikisira kusajeka kwechiyero: kudzika kwemonocular kunowanzo kururamisa chete kusvika kune isingazivikanwe yawandisa kunze kwekunge yakaenzaniswa nereferensi inozivikanwa kana metric supervision.
Mastering Monocular Depth Estimation
Monocular kudzika fungidziro inofanotaura kuti pixel yega yega iri kure sei kubva kune imwechete yakajairika foto - hapana stereo kamera, lidar, kana kudzika sensor inodiwa. Iyo inobvumira imwe kamera kuona 3D chimiro kubva paflat 2D mufananidzo. Monocular Depth Estimation ndeyekombuta-kuona mafambiro anodudzira kana kugadzira midhiya yekuona yekuongorora, mashandiro, uye kugadzira. Kuti uvake kunzwisisa kwakadzama, bata Monocular Depth Estimation semuenzaniso wekushandisa, kwete chinhu chimwe chete: tsanangura zvinodiwa, kujekesa fungidziro, uye patsanura izvo zvinogona kuitwa nehurongwa hwakavimbika kubva kune zvichiri kuda kutonga kwenyanzvi.
Mukuita, zvikwata zvakasimba zvinoshandisa Monocular Depth Estimation chiyero chechokwadi nemashandiro anoita semhando yedata, kusiyana kwemwenje, uye kuenderana kwemazita. Ivo vanonyora zvakajeka maitiro ebudiriro, bvunzo vachipokana ne data rechokwadi uye mafambiro ebasa, uye iterate zvichibva pane zvakacherechedzwa maitiro ekutadza kwete kuhwina-nguva imwe chete yebhenji. Apa ndipo apo kunzwisisa kwe theoretical kunoshanduka kuve kugona kwakasimba pane chigadzirwa, mutemo, uye mashandiro.
Visual AI inogona kuita otomatiki yekuongorora, yekuona, uye yekumaka mabasa pachiyero. Panguva imwecheteyo, kodzero dzeMufananidzo uye kubvumirwa kunogona kuve njodzi dzepamutemo kana hunhu husina kujeka. Nzira yakatsiga ndeyekubatanidza kukurumidza kuyedza nekutonga: mhanyisa vatyairi vendege, tora humbowo, buritsa matanda esarudzo, uye urambe uchivandudza chengetedzo semaitiro emuenzaniso, zvinotarisirwa nemushandisi, uye zvinodikanwa zvekutonga.
Strategic Impact
Visual AI inogona kuita otomatiki yekuongorora, yekuona, uye yekumaka mabasa pachiyero.
Visual AI inogona kuita otomatiki yekuongorora, yekuona, uye yekumaka mabasa pachiyero. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.
Zvikwata zvekugadzira zvinogona prototype pfungwa nekukurumidza nekudzokororwa kwemaoko mashoma.
Zvikwata zvekugadzira zvinogona prototype pfungwa nekukurumidza nekudzokororwa kwemaoko mashoma. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.
Mashandisirwo anogona kushandisa masaini emifananidzo nemavhidhiyo ayo aimbove akaoma kugadzirisa.
Mashandisirwo anogona kushandisa masaini emifananidzo nemavhidhiyo ayo aimbove akaoma kugadzirisa. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.
Real-World Implementation
Smartphone portrait modhi inoteedzera kusviba kwemashure (bokeh) nekufungidzira chinhambwe-chinopesana neshure.
Augmented reality apps inoisa chaiwo zvinhu kuti vagare zvakanaka kuseri kwefenicha yepasirese
Drones uye marobhoti akaderera-anodzivirira zvipingamupinyi achishandisa imwe chete yekumberi-yakatarisana kamera
Shandura mafoto nemafirimu e2D kuita 3D nekudzika papixel kudzika kwestereoscopic kuratidzwa.
Maitiro Ekuita
Monocular Depth Estimation mukuita
Smartphone portrait modhi inoteedzera kusviba kwemashure (bokeh) nekufungidzira chinhambwe-nekumashure-chinhambwe.
Smartphone portrait modhi inoteedzera kumashure blur (bokeh) nekufungidzira chidzidzo-chinopesana-yekumashure kure Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi ekumucheto, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye mutengo wekukanganisa nekufamba kwenguva.
Monocular Depth Estimation mukuita
Augmented reality apps inoisa chaiwo zvinhu kuti vagare zvakanaka kuseri kwefenicha yepasirese.
Augmented reality apps inoisa zvinhu chaizvo kuitira kuti vagare zvakanaka kuseri kwefenicha yepasirese Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi ekumucheto, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye mutengo wekukanganisa nekufamba kwenguva.
Monocular Depth Estimation mukuita
Drones uye marobhoti akaderera-anodzivirira zvipingamupinyi achishandisa imwe chete yekumberi-yakatarisana kamera.
Drones uye marobhoti akaderera-anodzivirira zvipingamupinyi achishandisa imwe chete kumberi-yakatarisana kamera Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa zvechigadzirwa nemitengo yekukanganisa nekufamba kwenguva.
Monocular Depth Estimation mukuita
Kushandura mafoto nemafirimu e2D kuita 3D nekudzika papixel kudzika kwestereoscopic kuratidza.
Kushandura mapikicha e2D nemafirimu kuita 3D nekudzika papixel kudzika kwestereoscopic kuratidza Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa zvechigadzirwa nemitengo yekukanganisa nekufamba kwenguva.
Njodzi & Guardrails
Kodzero dzemifananidzo uye kubvumirwa kunogona kuve njodzi dzepamutemo kana provenance isina kujeka.
Kuita kwemuenzaniso kunogona kusiyanisa kupenya, huwandu hwevanhu, uye nharaunda.
Manyepo enhema anogona kusacherechedzwa kunze kwekunge zvikumbaridzo zvekuvimba zvikatariswa.
Implementation Roadmap
Tsanangura maitiro ekugamuchirwa echokwadi, kurangarira, uye mutengo wekukanganisa.
Tsanangura maitiro ekugamuchirwa echokwadi, kurangarira, uye mutengo wekukanganisa. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.
Edzai nedata rinoenderana nemamiriro chaiwo ekugadzira.
Edzai nedata rinoenderana nemamiriro chaiwo ekugadzira. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.
Wedzera ongororo yemunhu kune yakaderera-kusavimbika kana yakakwirira-inokanganisa kufanotaura.
Wedzera ongororo yemunhu kune yakaderera-kusavimbika kana yakakwirira-inokanganisa kufanotaura. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.
Tevera modhi kudonha uye simbisa mushure mekuchinja kwekamera kana dataset.
Tevera modhi kudonha uye simbisa mushure mekuchinja kwekamera kana dataset. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.