Dubawa
Samfuran Vision-Language-Action (VLA) manyan hanyoyin sadarwa ne na jijiyoyi waɗanda ke ɗaukar hotunan kyamara tare da rubutacciyar umarni da fitar da umarnin motar robot kai tsaye. Suna da mahimmanci saboda suna kawo ma'anar gama gari na ƙirar tushe zuwa injina na zahiri, barin ƙirar ɗaya ta sarrafa mutum-mutumi a cikin ɗawainiya da yawa maimakon sanya hannu a kowane hali.
Hanyoyi-Harshe-Ayyukan Model don Robotics na cikin ayyukan aikin hangen nesa na kwamfuta wanda ke fassara ko samar da kafofin watsa labarai na gani don bincike, ayyuka, da kerawa.
Zurfafa nutsewa
Samfurin VLA yana haɗa rafuka guda uku: hangen nesa (firam ɗin kyamara), harshe (maƙasudi kamar 'sanya ƙoƙon a cikin nutsewa'), da kuma aiki (kusurwoyi na haɗin gwiwa, buɗewa / kusa, ko saurin tasiri). Google DeepMind's RT-2 ya kasance abin tarihi: ya ɗauki samfurin hangen nesa wanda aka horar da shi akan hotuna da rubutu na yanar gizo, sannan aka daidaita shi akan hanyoyin robot don haka hanyar sadarwa iri ɗaya ce zata iya amsa 'wane 'ya'yan itace wannan?' Hakanan yana fitar da ayyuka masu alama azaman rubutu. Buɗe samfura kamar OpenVLA (ma'auni na 7B) da kuma Pi-0 na Intelligence na Jiki. Mahimmanci, waɗannan samfuran suna nuna canja wurin 'gaggawa': ilimin gidan yanar gizo (gane tambarin alama, fahimtar 'ƙaramin') yana ɗauka cikin magudi, don haka robot ɗin gabaɗaya ga abubuwa da umarnin da bai taɓa gani ba yayin horon robot.
Fahimtar Fasaha
Yawancin VLAs suna ɓarna ayyukan ci gaba a cikin alamomi don haka mai canzawa zai iya tsinkayar su ta atomatik, kamar kalmomi. RT-2 taswirar kowane girman aiki zuwa ɗaya daga cikin 256 bins kuma yana fitar da su azaman sigar rubutu. Sabbin ƙira irin su pi-0 suna haɗe da watsawa ko kwarara-madaidaicin 'kwararre na ayyuka' kan kashin baya-bayan hangen nesa mai sanyi, yana haifar da chunks mai saurin mitoci mai santsi (misali, 50 Hz) maimakon matakai masu hankali guda ɗaya, haɓaka ƙima.
Ƙwararren Ƙwararrun Hanyoyi-Harshen-Ayyukan Ayyuka don Robotics
Samfuran Vision-Language-Action (VLA) manyan hanyoyin sadarwa ne na jijiyoyi waɗanda ke ɗaukar hotunan kyamara tare da rubutacciyar umarni da fitar da umarnin motar robot kai tsaye. Suna da mahimmanci saboda suna kawo ma'anar gama gari na ƙirar tushe zuwa injina na zahiri, barin ƙirar ɗaya ta sarrafa mutum-mutumi a cikin ɗawainiya da yawa maimakon sanya hannu a kowane hali. Hanyoyi-Harshe-Ayyukan Model don Robotics na cikin ayyukan aikin hangen nesa na kwamfuta wanda ke fassara ko samar da kafofin watsa labarai na gani don bincike, ayyuka, da kerawa. Don haɓaka fahimta mai zurfi, bi da Tsarin Ayyukan Hannu-Harshe don Robotics azaman ƙirar aiki, ba sifa ɗaya ba: ayyana sakamakon da ake so, fayyace zato, da raba abin da tsarin zai iya dogara da abin da har yanzu ke buƙatar yanke hukunci na ƙwararru.
A aikace, ƙungiyoyi masu ƙarfi suna amfani da Model-Hanya-Ayyukan Ayyukan don Robotics daidaito daidaito tare da gaskiyar aiki kamar ingancin bayanai, bambancin haske, da daidaiton lakabi. Suna rubuta ƙayyadaddun ƙa'idodin nasara, gwaji akan bayanan gaskiya da gudanawar aiki, da jujjuyawar bisa ga tsarin gazawar da aka lura maimakon cin nasara na lokaci ɗaya. Wannan shine inda fahimtar ka'idar ta juya zuwa iyawa mai dorewa a cikin samfura, manufofi, da ayyuka.
Kayayyakin AI na iya sarrafa aiki da bincike, ganowa, da ayyuka masu alama a sikelin. A lokaci guda, Haƙƙin Hoto da yarda na iya zama haɗari na shari'a idan ba a fayyace ba. Hanyar da ta fi dacewa ita ce haɗa saurin gwaji tare da horon gudanarwa: gudanar da matukin jirgi, kama shaida, buga rajistan ayyukan yanke shawara, da ci gaba da sabunta abubuwan tsaro kamar yadda halayen ƙira, tsammanin mai amfani, da buƙatun tsari ke tasowa.
Dabarun Tasiri
Kayayyakin AI na iya sarrafa aiki da bincike, ganowa, da ayyuka masu alama a sikelin.
Kayayyakin AI na iya sarrafa aiki da bincike, ganowa, da ayyuka masu alama a sikelin. A cikin ƙawance masu inganci, ana fassara wannan zuwa ƙa'idodin aiki waɗanda za a iya aunawa, iyakokin ikon mallaka, da kuma bita-da-kullin bita don ƙungiyoyi su iya haɓaka kwarin gwiwa a maimakon ɓata shakku.
Ƙungiyoyin ƙirƙira za su iya samar da ra'ayoyi cikin sauri tare da ƙarancin bita da hannu.
Ƙungiyoyin ƙirƙira za su iya samar da ra'ayoyi cikin sauri tare da ƙarancin bita da hannu. A cikin ƙawance masu inganci, ana fassara wannan zuwa ƙa'idodin aiki waɗanda za a iya aunawa, iyakokin ikon mallaka, da kuma bita-da-kullin bita don ƙungiyoyi su iya haɓaka kwarin gwiwa a maimakon ɓata shakku.
Ayyuka na iya amfani da siginar hoto da bidiyo waɗanda a baya suke da wahalar aiwatarwa.
Ayyuka na iya amfani da siginar hoto da bidiyo waɗanda a baya suke da wahalar aiwatarwa. A cikin ƙawance masu inganci, ana fassara wannan zuwa ƙa'idodin aiki waɗanda za a iya aunawa, iyakokin ikon mallaka, da kuma bita-da-kullin bita don ƙungiyoyi su iya haɓaka kwarin gwiwa a maimakon ɓata shakku.
Aiwatar da Gaskiyar Duniya
RT-2 yana sarrafa robot Google don ' matsar da ayaba zuwa lamba 3' ta amfani da lambobi da aka koya daga rubutun yanar gizo, ba demos robot.
OpenVLA, samfurin 7B mai buɗe ido, wanda aka tsara shi ta hanyar labs don gudanar da zaɓin tebur da wuri akan makamai masu arha.
Wanke kayan wanki na Intelligence's Pi-0 da share tebur ta hanyar ɗaure ƙananan fasaha da yawa daga umarni ɗaya.
Wani hannun ma'ajiya ya gaya wa 'ka ɗauki abu mafi rauni' kuma ya faɗi abin da yake daga kamanninsa na gani
Hanyoyin Aiwatarwa
Hanyoyi-Harshen-Ayyukan Ayyuka don Robotics a aikace
RT-2 yana sarrafa robot Google don 'matsar da ayaba zuwa lamba 3' ta amfani da lambobi da aka koya daga rubutun gidan yanar gizo, ba demos robot.
RT-2 yana sarrafa robot ɗin Google don matsar da ayaba zuwa lamba 3' ta amfani da lambobi da aka koya daga rubutun gidan yanar gizo, ba wasan kwaikwayo na robot Ƙungiyoyi yawanci suna samun sakamako mafi kyau lokacin da suke ayyana ma'auni masu inganci a gaba, kiyaye hanyar haɓaka ɗan adam don ƙararraki, da bin diddigin nasarorin samarwa da kuma kashe kuɗi na lokaci.
Hanyoyi-Harshen-Ayyukan Ayyuka don Robotics a aikace
OpenVLA, ƙirar buɗaɗɗen tushen 7B, mai kyau wanda aka daidaita ta dakunan gwaje-gwaje don gudanar da zaɓin tebur da wuri akan makamai masu rahusa.
OpenVLA, samfurin 7B mai buɗewa, mai kyau wanda aka daidaita ta dakunan gwaje-gwaje don gudanar da zaɓin tebur da wuri akan makamai masu rahusa Ƙungiyoyi yawanci suna samun sakamako mafi kyau lokacin da suka ayyana ma'auni masu inganci a gaba, kiyaye hanyar haɓakar ɗan adam don shari'o'in gefen, da kuma bin diddigin nasarorin samarwa da ƙimar kuskure akan lokaci.
Hanyoyi-Harshen-Ayyukan Ayyuka don Robotics a aikace
Lantarki na Jiki's pi-0 mai nadawa wanki da share tebur ta hanyar ɗaure ƙananan fasaha da yawa daga umarni guda.
Pi-0 na Lantarki na Jiki da wanki da share tebur ta hanyar ɗaure ƙwararru da yawa daga koyarwa ɗaya Ƙungiyoyi yawanci suna samun sakamako mafi kyau lokacin da suka ayyana ma'auni masu inganci a gaba, kiyaye hanyar haɓakar ɗan adam don shari'o'in gefen, da kuma bin diddigin nasarorin samarwa da ƙimar kuskure akan lokaci.
Hanyoyi-Harshen-Ayyukan Ayyuka don Robotics a aikace
Wani hannun ma'ajiya ya gaya wa 'ka ɗauki abu mafi rauni' kuma ya faɗi abin da yake daga kamanninsa na gani.
Wani hannun sito ya gaya wa 'ka ɗauki mafi ƙarancin abu' kuma gano abin da yake daga bayyanarsa na gani Ƙungiyoyi yawanci suna samun sakamako mafi kyau lokacin da suka ayyana ma'auni masu inganci a gaba, kiyaye hanyar haɓaka ɗan adam don shari'o'i, da bin duk nasarorin samarwa da ƙimar kuskure akan lokaci.
Hatsari & Tsare-tsare
Haƙƙoƙin hoto da yarda na iya zama haɗari na shari'a idan ba a fayyace ba.
Ayyukan samfuri na iya bambanta a ko'ina cikin haske, ƙididdiga, da mahalli.
Ƙarya tabbataccen ƙila ba za a iya lura da shi ba sai dai idan an kula da ƙofofin amincewa.
Taswirar Hanya
Ƙayyade ma'auni na karɓa don daidaito, tunowa, da farashi na kuskure.
Ƙayyade ma'auni na karɓa don daidaito, tunowa, da farashi na kuskure. Ɗauki kowane mataki azaman ƙofar shaida: idan ba a cika sharuɗɗa ba, dakatar da fitar, rufe tazarar, sannan kawai faɗaɗa amfani.
Gwada tare da bayanan da suka dace da ainihin yanayin samarwa.
Gwada tare da bayanan da suka dace da ainihin yanayin samarwa. Ɗauki kowane mataki azaman ƙofar shaida: idan ba a cika sharuɗɗa ba, dakatar da fitar, rufe tazarar, sannan kawai faɗaɗa amfani.
Ƙara bita na ɗan adam don ƙarancin amincewa ko tsinkaya mai tasiri.
Ƙara bita na ɗan adam don ƙarancin amincewa ko tsinkaya mai tasiri. Ɗauki kowane mataki azaman ƙofar shaida: idan ba a cika sharuɗɗa ba, dakatar da fitar, rufe tazarar, sannan kawai faɗaɗa amfani.
Bi diddigin ƙirar ƙira kuma sake ingantawa bayan canje-canjen kamara ko saitin bayanai.
Bi diddigin ƙirar ƙira kuma sake ingantawa bayan canje-canjen kamara ko saitin bayanai. Ɗauki kowane mataki azaman ƙofar shaida: idan ba a cika sharuɗɗa ba, dakatar da fitar, rufe tazarar, sannan kawai faɗaɗa amfani.