Visual AI GUIDE

Monocular Depth Estimation

Monocular kudzika fungidziro inofanotaura kuti pixel yega yega iri kure sei kubva kune imwechete yakajairika foto - hapana stereo kamera, lidar, kana kudzika sensor inodiwa.

Overview

Monocular kudzika fungidziro inofanotaura kuti pixel yega yega iri kure sei kubva kune imwechete yakajairika foto - hapana stereo kamera, lidar, kana kudzika sensor inodiwa. Iyo inobvumira imwe kamera kuona 3D chimiro kubva paflat 2D mufananidzo.

Monocular Depth Estimation ndeyekombuta-kuona mafambiro anodudzira kana kugadzira midhiya yekuona yekuongorora, mashandiro, uye kugadzira.

Deep Dive

Vanhu vanogona kutonga kudzika kubva kune rimwe ziso vachishandisa cues semaonero, saizi yehukama, magadzirirwo emagetsi, shading, uye occlusion. Monocular deep estimation inodzidzisa neural network the same trick: dyisa mune imwechete RGB mufananidzo uye inoburitsa kudzika kukosha kwepixel yega yega. Nekuti mufananidzo we 2D haunzwisisike pamusoro pechiyero chakakwana, basa racho rakaoma - mazhinji mapikicha e 3D anogona purojekiti kumufananidzo mumwe chete. Manetiweki anodzidza manhamba ekutanga kubva kumaseti makuru ekugadzirisa izvi. Kudzidzira kunouya mumhando mbiri: inotariswa, ichishandisa pasi-chokwadi kudzika kubva kune lidar kana RGB-D sensors, uye yekuzvitarisira, iyo inodzidza zvakadzama kubva pavhidhiyo kana stereo peya nekusimbisa kuti kudzika kwakafanotaurwa kunoramba nenzira kwayo maonero kune imwe. Zvichangoburwa nheyo modhi seMiDaS uye Kudzika Chero Chinhu chinowanzoitika zvinoshamisa pane zvisingaonekwe.

Technical Insight

Nzira dzekuzvitarisira dzinoshandisa geometry pachinzvimbo chemavara. Tichipihwa maonero maviri (stereo kana anoteedzana mavhidhiyo mafuremu) uye yakafanotaurwa kudzika mepu pamwe nekufamba kwekamera, modhi inochinjisa mufananidzo mumwe kuti uvakezve mumwe; iyo pixel-level rekuvaka kukanganisa inova chiratidzo chekudzidzisa. Uku kurasikirwa kwe 'kuona-synthesis' kunoreva kuti hudzamu hunogona kudzidzwa kubva muvhidhiyo yakasvibira, isina kunyorwa. Chinhu chakakosha chinodzikisira kusajeka kwechiyero: kudzika kwemonocular kunowanzo kururamisa chete kusvika kune isingazivikanwe yawandisa kunze kwekunge yakaenzaniswa nereferensi inozivikanwa kana metric supervision.

Mastering Monocular Depth Estimation

Monocular kudzika fungidziro inofanotaura kuti pixel yega yega iri kure sei kubva kune imwechete yakajairika foto - hapana stereo kamera, lidar, kana kudzika sensor inodiwa. Iyo inobvumira imwe kamera kuona 3D chimiro kubva paflat 2D mufananidzo. Monocular Depth Estimation ndeyekombuta-kuona mafambiro anodudzira kana kugadzira midhiya yekuona yekuongorora, mashandiro, uye kugadzira. Kuti uvake kunzwisisa kwakadzama, bata Monocular Depth Estimation semuenzaniso wekushandisa, kwete chinhu chimwe chete: tsanangura zvinodiwa, kujekesa fungidziro, uye patsanura izvo zvinogona kuitwa nehurongwa hwakavimbika kubva kune zvichiri kuda kutonga kwenyanzvi.

Mukuita, zvikwata zvakasimba zvinoshandisa Monocular Depth Estimation chiyero chechokwadi nemashandiro anoita semhando yedata, kusiyana kwemwenje, uye kuenderana kwemazita. Ivo vanonyora zvakajeka maitiro ebudiriro, bvunzo vachipokana ne data rechokwadi uye mafambiro ebasa, uye iterate zvichibva pane zvakacherechedzwa maitiro ekutadza kwete kuhwina-nguva imwe chete yebhenji. Apa ndipo apo kunzwisisa kwe theoretical kunoshanduka kuve kugona kwakasimba pane chigadzirwa, mutemo, uye mashandiro.

Visual AI inogona kuita otomatiki yekuongorora, yekuona, uye yekumaka mabasa pachiyero. Panguva imwecheteyo, kodzero dzeMufananidzo uye kubvumirwa kunogona kuve njodzi dzepamutemo kana hunhu husina kujeka. Nzira yakatsiga ndeyekubatanidza kukurumidza kuyedza nekutonga: mhanyisa vatyairi vendege, tora humbowo, buritsa matanda esarudzo, uye urambe uchivandudza chengetedzo semaitiro emuenzaniso, zvinotarisirwa nemushandisi, uye zvinodikanwa zvekutonga.

Strategic Impact

Visual AI inogona kuita otomatiki yekuongorora, yekuona, uye yekumaka mabasa pachiyero.

Visual AI inogona kuita otomatiki yekuongorora, yekuona, uye yekumaka mabasa pachiyero. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.

Zvikwata zvekugadzira zvinogona prototype pfungwa nekukurumidza nekudzokororwa kwemaoko mashoma.

Zvikwata zvekugadzira zvinogona prototype pfungwa nekukurumidza nekudzokororwa kwemaoko mashoma. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.

Mashandisirwo anogona kushandisa masaini emifananidzo nemavhidhiyo ayo aimbove akaoma kugadzirisa.

Mashandisirwo anogona kushandisa masaini emifananidzo nemavhidhiyo ayo aimbove akaoma kugadzirisa. Mukutumirwa kwemhando yepamusoro, izvi zvinoshandurirwa kuita mitemo inoyerwa yekushanda, miganhu yevaridzi, uye tsika dzekudzokorora dzinodzokororwa kuitira kuti zvikwata zvikwire kuvimba pane kukwidza kusajeka.

Ramangwana reMonocular Depth Estimation

Generalist kudzika kwenheyo modhi akadzidziswa pamamirioni emifananidzo yakasanganiswa ari kusundira kune yakavimbika, metric (yechokwadi-chiyero) kudzika mune chero chiitiko, kunyangwe isina kumboonekwa mukudzidziswa. Tarisira kusanganiswa kwakasimba nekuyerera kwemaziso uye SLAM yekuvakazve kwechiitiko che 3D, modhi dzakareruka dzinomhanya pamafoni nemahedhifoni, uye kusimba kwakasimba kwe zero-shot. Izvi zvichaita kuti kupfuma kwenzvimbo yekuona kuchipe uye kuve kwese kwese, kuwanikwa kubva kune chero kamera imwe chete kwete kudhura kudzika-inonzwa rigs.

Real-World Implementation

Smartphone portrait modhi inoteedzera kusviba kwemashure (bokeh) nekufungidzira chinhambwe-chinopesana neshure.

Augmented reality apps inoisa chaiwo zvinhu kuti vagare zvakanaka kuseri kwefenicha yepasirese

Drones uye marobhoti akaderera-anodzivirira zvipingamupinyi achishandisa imwe chete yekumberi-yakatarisana kamera

Shandura mafoto nemafirimu e2D kuita 3D nekudzika papixel kudzika kwestereoscopic kuratidzwa.

Maitiro Ekuita

Monocular Depth Estimation mukuita

Smartphone portrait modhi inoteedzera kusviba kwemashure (bokeh) nekufungidzira chinhambwe-nekumashure-chinhambwe.

Smartphone portrait modhi inoteedzera kumashure blur (bokeh) nekufungidzira chidzidzo-chinopesana-yekumashure kure Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi ekumucheto, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye mutengo wekukanganisa nekufamba kwenguva.

Monocular Depth Estimation mukuita

Augmented reality apps inoisa chaiwo zvinhu kuti vagare zvakanaka kuseri kwefenicha yepasirese.

Augmented reality apps inoisa zvinhu chaizvo kuitira kuti vagare zvakanaka kuseri kwefenicha yepasirese Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi ekumucheto, uye kuteedzera zvese zvakawanikwa zvechigadzirwa uye mutengo wekukanganisa nekufamba kwenguva.

Monocular Depth Estimation mukuita

Drones uye marobhoti akaderera-anodzivirira zvipingamupinyi achishandisa imwe chete yekumberi-yakatarisana kamera.

Drones uye marobhoti akaderera-anodzivirira zvipingamupinyi achishandisa imwe chete kumberi-yakatarisana kamera Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa zvechigadzirwa nemitengo yekukanganisa nekufamba kwenguva.

Monocular Depth Estimation mukuita

Kushandura mafoto nemafirimu e2D kuita 3D nekudzika papixel kudzika kwestereoscopic kuratidza.

Kushandura mapikicha e2D nemafirimu kuita 3D nekudzika papixel kudzika kwestereoscopic kuratidza Matimu anowanzo kuwana mhedzisiro iri nani kana achinge atsanangura emhando yepamusoro kumberi, chengetedza nzira yekukwira kwevanhu yemakesi emupendero, uye kuteedzera zvese zvakawanikwa zvechigadzirwa nemitengo yekukanganisa nekufamba kwenguva.

Njodzi & Guardrails

!

Kodzero dzemifananidzo uye kubvumirwa kunogona kuve njodzi dzepamutemo kana provenance isina kujeka.

!

Kuita kwemuenzaniso kunogona kusiyanisa kupenya, huwandu hwevanhu, uye nharaunda.

!

Manyepo enhema anogona kusacherechedzwa kunze kwekunge zvikumbaridzo zvekuvimba zvikatariswa.

Implementation Roadmap

1

Tsanangura maitiro ekugamuchirwa echokwadi, kurangarira, uye mutengo wekukanganisa.

Tsanangura maitiro ekugamuchirwa echokwadi, kurangarira, uye mutengo wekukanganisa. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.

2

Edzai nedata rinoenderana nemamiriro chaiwo ekugadzira.

Edzai nedata rinoenderana nemamiriro chaiwo ekugadzira. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.

3

Wedzera ongororo yemunhu kune yakaderera-kusavimbika kana yakakwirira-inokanganisa kufanotaura.

Wedzera ongororo yemunhu kune yakaderera-kusavimbika kana yakakwirira-inokanganisa kufanotaura. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.

4

Tevera modhi kudonha uye simbisa mushure mekuchinja kwekamera kana dataset.

Tevera modhi kudonha uye simbisa mushure mekuchinja kwekamera kana dataset. Bata nhanho yega yega segedhi rehumbowo: kana maitiro asina kusangana, imbomira kuburitsa, vhara gaka, uye wobva wawedzera kushandiswa.

Ramba Uchiongorora