Jagoran Harshe AI

Tsarin Kulawa don Tunanin Lissafi

Kula da tsari yana ba da lada ga abin ƙira ga kowane madaidaicin mataki a cikin jerin tunani, ba kawai amsar ƙarshe ba.

Dubawa

Kula da tsari yana ba da lada ga abin ƙira ga kowane madaidaicin mataki a cikin jerin tunani, ba kawai amsar ƙarshe ba. Don lissafi, inda kuskure ɗaya ya lalata komai, ƙididdige aikin da kansa yana samar da mafita mafi aminci.

Kula da Tsari don Tunanin Lissafi wani ɓangare ne na tarin yare-AI da ake amfani da shi don karantawa, ƙirƙira, rarrabuwa, da canza rubutu da magana a sikeli.

Zurfafa nutsewa

Yawancin nau'ikan lada suna samun amsa ta ƙarshe kawai (sakamakon sakamako). Wannan yana bawa samfurin damar 'sami sa'a' - isa ga lambar da ta dace ta matakai mara kyau waɗanda zasu soke fita. Kulawa da tsari maimakon horar da Model Lada na Tsari (PRM) akan alamun mutum ko AI waɗanda ke yiwa kowane matsakaicin mataki alama daidai, kuskure, ko tsaka tsaki. Takardar OpenAI's 2023 'Bari Mu Tabbatar da Mataki Ta Mataki' ta fitar da PRM800K, kusan alamomin mataki 800,000 akan matsalolin MATH, kuma ta nuna mai tabbatarwa da ke kula da tsari wanda ya warware kashi 78% na juzu'in gwaji tare da ƙarancin sakamako-kawai tushe. Ana amfani da PRM don ƙididdige samfuran samfuri da yawa, ɗaukar sarkar tare da mafi ƙarancin matakin mataki. Hakanan yana ba da ra'ayi mai ma'ana: kuna iya ganin daidai inda dalilin ya karye.

Fahimtar Fasaha

A lokacin gwaji samfurin samfurori da yawa mafita mafita; PRM yana ƙididdige kowane mataki kuma ƙimar gabaɗayan maganin shine yawanci samfur (ko ƙarami) na yiwuwar kowane mataki na daidaito. 'Best-of-N' sannan ya zaɓi sarkar da aka fi zira kwallaye. Saboda an ba da ƙima a cikin gida, siginar horon yana da yawa kuma ba ta da hayaniya fiye da lada guda ɗaya na ƙarshen-jere, wanda ke rage satar lada inda matakan da ba su dace ba suka ba da amsoshi daidai.

Kula da Tsarin Jagora don Tunanin Lissafi

Kula da tsari yana ba da lada ga abin ƙira ga kowane madaidaicin mataki a cikin jerin tunani, ba kawai amsar ƙarshe ba. Don lissafi, inda kuskure ɗaya ya lalata komai, ƙididdige aikin da kansa yana samar da mafita mafi aminci. Kula da Tsari don Tunanin Lissafi wani ɓangare ne na tarin yare-AI da ake amfani da shi don karantawa, ƙirƙira, rarrabuwa, da canza rubutu da magana a sikeli. Don gina fahimta mai zurfi, bibiyar Kula da Tsari don Tunanin Math a matsayin ƙirar aiki, ba fasali ɗaya ba: ayyana sakamakon da ake so, fayyace zato, da raba abin da tsarin zai iya yi da dogaro daga abin da har yanzu ke buƙatar yanke hukunci na ƙwararru.

A aikace, ƙungiyoyi masu ƙarfi da ke amfani da Kulawar Tsari don ƙira Math Reasoning ƙira, maidowa, da sake duba madaukai azaman tsarin sadarwar haɗin gwiwa ɗaya. Suna rubuta ƙayyadaddun ƙa'idodin nasara, gwaji akan bayanan gaskiya da gudanawar aiki, da jujjuyawar bisa ga tsarin gazawar da aka lura maimakon cin nasara na lokaci ɗaya. Wannan shine inda fahimtar ka'idar ta juya zuwa iyawa mai dorewa a cikin samfura, manufofi, da ayyuka.

Gudun aikin harshe na iya tafiya da sauri ba tare da sadaukar da daidaito ba. A lokaci guda, abubuwan da ba a iya gani ba na iya shigar da rahotanni cikin nutsuwa, kwararar goyan baya, ko abubuwan bincike. Hanyar da ta fi dacewa ita ce haɗa saurin gwaji tare da horon gudanarwa: gudanar da matukin jirgi, kama shaida, buga rajistan ayyukan yanke shawara, da ci gaba da sabunta abubuwan tsaro kamar yadda halayen ƙira, tsammanin mai amfani, da buƙatun tsari ke tasowa.

Dabarun Tasiri

Gudun aikin harshe na iya tafiya da sauri ba tare da sadaukar da daidaito ba.

Gudun aikin harshe na iya tafiya da sauri ba tare da sadaukar da daidaito ba. A cikin ƙawance masu inganci, ana fassara wannan zuwa ƙa'idodin aiki waɗanda za a iya aunawa, iyakokin ikon mallaka, da kuma bita-da-kullin bita don ƙungiyoyi su iya haɓaka kwarin gwiwa a maimakon ɓata shakku.

Yana faɗaɗa damar shiga cikin harsuna da salon sadarwa.

Yana faɗaɗa damar shiga cikin harsuna da salon sadarwa. A cikin ƙawance masu inganci, ana fassara wannan zuwa ƙa'idodin aiki waɗanda za a iya aunawa, iyakokin ikon mallaka, da kuma bita-da-kullin bita don ƙungiyoyi su iya haɓaka kwarin gwiwa a maimakon ɓata shakku.

Ƙungiyoyi za su iya ciyar da ƙarin lokaci akan hukunci yayin da aiki da kai ke sarrafa maimaitawa.

Ƙungiyoyi za su iya ciyar da ƙarin lokaci akan hukunci yayin da aiki da kai ke sarrafa maimaitawa. A cikin ƙawance masu inganci, ana fassara wannan zuwa ƙa'idodin aiki waɗanda za a iya aunawa, iyakokin ikon mallaka, da kuma bita-da-kullin bita don ƙungiyoyi su iya haɓaka kwarin gwiwa a maimakon ɓata shakku.

Makomar Kula da Tsari don Tunanin Lissafi

Lakabin mataki na hannun hannu yana da tsada, don haka bincike yana jujjuya zuwa kulawar sarrafawa ta atomatik - ta amfani da Monte Carlo rollouts (Math-Shepherd) don ƙididdige ƙimar kowane mataki ba tare da tambarin ɗan adam ba, ko samun ƙirar ƙira ta yanke hukunci masu rauni. Yi tsammanin PRMs don fitar da ingantaccen ingantaccen koyo na ƙarfafawa, ba wai kawai sake tsarawa ba, da kuma yada sama da lissafi cikin lamba, hujjojin kimiyya, da kuma tsari na matakai da yawa inda matakin matakin daidai yake da mahimmanci.

Aiwatar da Gaskiyar Duniya

Saitin bayanan PRM800K na AIU_PROTECTED_10__: 800K alamomin matakin matakin ɗan adam da aka yi amfani da su don horar da masu tantancewa akan ma'aunin MATH

Math-Shepherd: ta atomatik yiwa alamar matakin daidai ta hanyar Monte Carlo rollouts don guje wa bayanin ɗan adam mai tsada

Mafi kyawun-N sake fasalin: samar da mafita na 256 da zaɓar wanda PRM ya sami mafi girma a kowane mataki

Kayan aikin koyarwa waɗanda ke nuna madaidaicin layi a cikin aikin aikin ɗalibi inda kuskuren ya fara bayyana

Hanyoyin Aiwatarwa

Tsarin Kulawa don Tunanin Lissafi a aikace

OpenAI's PRM800K dataset: 800K alamomin matakin matakin ɗan adam da aka yi amfani da su don horar da masu tantancewa akan ma'aunin MATH.

OpenAI's PRM800K dataset: 800K alamomin matakin matakin ɗan adam da ake amfani da su don horar da masu tantancewa a kan Ƙungiyoyin ma'auni na MATH yawanci suna samun sakamako mafi kyau lokacin da suka ayyana ma'auni masu inganci a gaba, kiyaye hanyar haɓakar ɗan adam don ƙararraki, da bin diddigin abubuwan samarwa da ƙimar kuskure akan lokaci.

Tsarin Kulawa don Tunanin Lissafi a aikace

Math-Shepherd: sanya alamar daidaitaccen mataki ta atomatik ta hanyar fitar da Monte Carlo don guje wa bayanin ɗan adam mai tsada.

Math-Shepherd: yin lakabin matakin daidai ta atomatik ta hanyar Monte Carlo rollouts don guje wa ƙididdige ƙididdiga na ɗan adam Ƙungiyoyi yawanci suna samun sakamako mafi kyau lokacin da suka ayyana ma'auni masu inganci a gaba, kiyaye hanyar haɓakar ɗan adam don shari'o'i, da bin duk abubuwan da ake samu da kuma tsadar kuɗi a kan lokaci.

Tsarin Kulawa don Tunanin Lissafi a aikace

Mafi kyawun-N reranking: samar da mafita 256 da zaɓar wanda PRM ya fi girma a kowane mataki.

Mafi kyawun-N sake fasalin: samar da mafita na 256 da zaɓin wanda PRM ya sami mafi girma a kowane mataki Ƙungiyoyi yawanci suna samun sakamako mafi kyau lokacin da suka ayyana ma'auni masu inganci a gaba, kiyaye hanyar haɓakar ɗan adam don ƙararraki, da bin diddigin nasarorin samarwa da ƙimar kuskure akan lokaci.

Tsarin Kulawa don Tunanin Lissafi a aikace

Kayan aikin koyarwa waɗanda ke nuna madaidaicin layi a cikin aikin aikin ɗalibi inda kuskuren ya fara bayyana.

Kayan aikin koyarwa waɗanda ke nuna ainihin layin aiki na ɗalibi inda kuskuren ya fara bayyana Ƙungiyoyi yawanci suna samun sakamako mafi kyau lokacin da suka ayyana ƙima masu inganci a gaba, kiyaye hanyar haɓakar ɗan adam don shari'o'in gefe, da bin duk nasarorin samarwa da farashi na kuskure akan lokaci.

Hatsari & Tsare-tsare

!

Abubuwan da aka ruɗe suna iya shigar da rahotanni cikin nutsuwa, kwararar tallafi, ko abubuwan bincike.

!

Hankali na gaggawa na iya ƙirƙirar sakamako mara daidaituwa a cikin buƙatun iri ɗaya.

!

Za a iya fallasa bayanan rubutu mai ma'ana idan ikon samun dama yana da rauni.

Taswirar Hanya

1

Ƙayyade tsarin fitarwa, sautin, da ma'auni masu inganci kafin fitowa.

Ƙayyade tsarin fitarwa, sautin, da ma'auni masu inganci kafin fitowa. Ɗauki kowane mataki azaman ƙofar shaida: idan ba a cika sharuɗɗa ba, dakatar da fitar, rufe tazarar, sannan kawai faɗaɗa amfani.

2

Amsa a ƙasa tare da amintattun tushe a duk lokacin da daidaito ya shafi mahimmanci.

Amsa a ƙasa tare da amintattun tushe a duk lokacin da daidaito ya shafi mahimmanci. Ɗauki kowane mataki azaman ƙofar shaida: idan ba a cika sharuɗɗa ba, dakatar da fitar, rufe tazarar, sannan kawai faɗaɗa amfani.

3

Ajiye wurin binciken ɗan adam don abubuwan da ake samu masu girma.

Ajiye wurin binciken ɗan adam don abubuwan da ake samu masu girma. Ɗauki kowane mataki azaman ƙofar shaida: idan ba a cika sharuɗɗa ba, dakatar da fitar, rufe tazarar, sannan kawai faɗaɗa amfani.

4

Bibiyar tsarin gazawar kuma sake horar da tsokaci ko tafiyar aiki akai-akai.

Bibiyar tsarin gazawar kuma sake horar da tsokaci ko tafiyar aiki akai-akai. Ɗauki kowane mataki azaman ƙofar shaida: idan ba a cika sharuɗɗa ba, dakatar da fitar, rufe tazarar, sannan kawai faɗaɗa amfani.

Ci gaba da Bincike