Ubuyobozi bwa Audio AI

Igikoresho cyo Kumenyekanisha Kaldi

Kaldi nigitabo cyubusa, gifungura-isoko-ibikoresho byahindutse urubuga rwubushakashatsi bwubaka sisitemu yo kumenya imvugo.

Incamake

Kaldi nigitabo cyubusa, gifungura-isoko-ibikoresho byahindutse urubuga rwubushakashatsi bwubaka sisitemu yo kumenya imvugo. Nibyingenzi kuko mumyaka hafi icumi aribwo buryo bwo gushingira kumirimo ya ASR yamasomo ninganda.

Kaldi Speech Recognition Toolkit yicaye mubikorwa byamajwi-AI bihindura imvugo, umuziki, nijwi ryitumanaho, kugerwaho, no gukora itangazamakuru.

Kwibira cyane

Kaldi, yasohotse mu 2011 kandi iyobowe na Daniel Povey, yanditswe muri C ++ hamwe na resept zifatanije hamwe na bash na Perl. Yubatswe kumuyoboro wa kera wa ASR: gukuramo ibintu bya acoustic (MFCCs cyangwa filterbanks), amajwi ya foneme yerekana hamwe na Gaussian Mixture Models cyangwa, nyuma, imiyoboro yimbitse y’imitsi, kandi igahuza icyitegererezo cya acoustic, imvugo yamagambo, hamwe nururimi rwindimi mubishushanyo bimwe byashakishwa. Ihitamo ryayo rya tekiniki ryakoreshaga transducers iremereye (WFSTs) ivuye mubitabo bya OpenFST kugirango ihimbe ubumenyi bwose mubishushanyo mbonera. Kaldi yohereje 'resept' kuri dataseti zisanzwe nka Switchboard, Librispeech, na Wall Street Journal, bituma abashakashatsi berekana ibisubizo bigezweho. Byahindutse ishyirwa mubikorwa sisitemu nshya yashizweho.

Ubushishozi

Amayeri yibanze ya Kaldi ni uguhimba WFST enye mu gishushanyo kimwe cyitwa HCLG: H ikarita ya neural-net cyangwa GMM ivuga kuri terefone zishingiye ku miterere, C ikora imiterere ya fonetike (triphones), L ni imvugo yerekana ikarita ya terefone ku magambo, naho G ni urugero rwururimi. Kugwiza izo transducers no guhitamo ibisubizo bitanga igishushanyo kimwe decoder ishakisha hamwe na algorithm ya Viterbi yacishijwe bugufi, ihindura amajwi kumajwi muburyo bukurikirana ijambo.

Kumenya Kaldi Imvugo Yerekana Igikoresho

Kaldi nigitabo cyubusa, gifungura-isoko-ibikoresho byahindutse urubuga rwubushakashatsi bwubaka sisitemu yo kumenya imvugo. Nibyingenzi kuko mumyaka hafi icumi aribwo buryo bwo gushingira kumirimo ya ASR yamasomo ninganda. Kaldi Speech Recognition Toolkit yicaye mubikorwa byamajwi-AI bihindura imvugo, umuziki, nijwi ryitumanaho, kugerwaho, no gukora itangazamakuru. Kugirango wubake byimbitse, fata Kaldi Speech Recognition Toolkit nkicyitegererezo gikora, ntabwo ari ikintu kimwe: gusobanura ibyagezweho, gusobanura ibitekerezo, no gutandukanya ibyo sisitemu ishobora gukora byizewe nibisaba guca imanza zinzobere.

Mubimenyerezo, amakipe akomeye akoresha Kaldi Speech Recognition Toolkit ifata ubuziranenge, ubukererwe, no kwemererwa nkibice byingenzi byingamba zo kohereza. Bandika ibipimo ngenderwaho byerekana intsinzi, bagerageza kurwanya amakuru afatika hamwe nakazi keza, kandi bagasubiramo bashingiye kubikorwa byagaragaye ko batsinzwe aho gutsinda inshuro imwe. Aha niho imyumvire yubumenyi ihinduka mubushobozi burambye kubicuruzwa, politiki, nibikorwa.

Itezimbere kugerwaho binyuze mu kwandukura, kuvuga, no guhuza amajwi. Mugihe kimwe, gukoresha nabi amajwi no kwigira ibyago byiyongera mugihe uruhushya rubuze. Uburyo bukomeye cyane ni uguhuza umuvuduko wikigereranyo hamwe na disipuline yimiyoborere: kuyobora abaderevu, gufata ibimenyetso, gutangaza ibyemezo byicyemezo, no gukomeza kuvugurura uburyo bwo kwirinda nkimyitwarire yicyitegererezo, ibyo abakoresha bategereje, nibisabwa n'amategeko bigenda bihinduka.

Ingaruka z'Ingamba

Itezimbere kugerwaho binyuze mu kwandukura, kuvuga, no guhuza amajwi.

Itezimbere kugerwaho binyuze mu kwandukura, kuvuga, no guhuza amajwi. Mubikorwa byujuje ubuziranenge, ibi bihindurwa mumategeko agenga imikorere, imipaka nyirubwite, hamwe n'imihango yo gusubiramo kenshi kugirango amakipe ashobore kwigirira ikizere aho gupima ibidasobanutse.

Amatsinda yibitangazamakuru arashobora kohereza amajwi yihuse hamwe na bije nto.

Amatsinda yibitangazamakuru arashobora kohereza amajwi yihuse hamwe na bije nto. Mubikorwa byujuje ubuziranenge, ibi bihindurwa mumategeko agenga imikorere, imipaka nyirubwite, hamwe n'imihango yo gusubiramo kenshi kugirango amakipe ashobore kwigirira ikizere aho gupima ibidasobanutse.

Sisitemu ireba abakiriya irashobora gutunganya imikoranire ivugwa murwego runini.

Sisitemu ireba abakiriya irashobora gutunganya imikoranire ivugwa murwego runini. Mubikorwa byujuje ubuziranenge, ibi bihindurwa mumategeko agenga imikorere, imipaka nyirubwite, hamwe n'imihango yo gusubiramo kenshi kugirango amakipe ashobore kwigirira ikizere aho gupima ibidasobanutse.

Ejo hazaza ha Kaldi Imvugo yo Kumenyekanisha

Kaldi ya Hybrid ya HMM-DNN yasimbujwe ahanini na moderi yimitsi iherezo-iherezo yerekana ikarita yerekana amajwi kumyandiko. Umushinga uzasimbura Daniel Povey, k2 (hamwe na ecosystem ya Icefall na Lhotse), yongeye gutekereza ibitekerezo bya WFST ya Kaldi muri PyTorch hamwe na automatique itandukanye ya leta. Witege ko Kaldi ubwayo izakomeza kuba amateka nigikoresho cyo kwigisha, mugihe ababakomokaho bahuza decoding ya classique yubatswe hamwe na moderi igezweho kandi ishingiye kuri acoustic.

Gushyira mu bikorwa Isi

Laboratwari yamasomo yerekana ibipimo ngenderwaho bya Librispeech na Switchboard kugirango yemeze ubushakashatsi bushya bwo kwerekana imiterere ya acoustic

Kubaka amajwi yigenga ya sisitemu ya sisitemu yo hasi cyangwa indimi nkeya ukoresheje Kaldi resept

Guhatira guhuza amajwi inyandiko-mvugo ku bumenyi bw'indimi, kurema dataset, hamwe na subtitle igihe

Imbaraga zo gushakisha amajwi hakiri kare hamwe nigitekerezo gisubira inyuma munganda mbere yuko impera zanyuma zirangira

Uburyo bwo Gushyira mu bikorwa

Kaldi Speech Recognition Toolkit mubikorwa

Laboratwari yamasomo yerekana ibipimo ngenderwaho bya Librispeech na Switchboard kugirango yemeze ubushakashatsi bushya bwo kwerekana imiterere ya acoustic.

Laboratwari yamasomo yerekana ibipimo ngenderwaho bya Librispeech na Switchboard kugirango yemeze ubushakashatsi bushya bwo kwerekana imiterere ya acoustic Amakipe ubusanzwe abona ibisubizo byiza iyo asobanuye ibipimo ngenderwaho byimbere, agakomeza inzira yo kuzamura abantu kubibazo, kandi akurikirana inyungu zibyara umusaruro hamwe nibiciro byamakosa mugihe runaka.

Kaldi Speech Recognition Toolkit mubikorwa

Kubaka amajwi yigenga ya sisitemu ya sisitemu yo hasi cyangwa indimi nkeya ukoresheje Kaldi resept.

Kubaka sisitemu yo gutondekanya amajwi ya sisitemu yo gukoresha amikoro make cyangwa indimi nkeya ukoresheje ibisobanuro bya Kaldi Amakipe ubusanzwe abona ibisubizo byiza iyo asobanuye ibipimo byujuje ubuziranenge imbere, agakomeza inzira yo kuzamura abantu kubibazo, kandi akurikirana inyungu zibyara umusaruro hamwe nibiciro byamakosa mugihe.

Kaldi Speech Recognition Toolkit mubikorwa

Guhatira guhuza amajwi inyandiko-mvugo ku bumenyi bw'indimi, kurema dataset, hamwe na subtitle igihe.

Guhuza amajwi ku gahato ku nyandiko-mvugo y’ubumenyi bw’indimi, guhanga dataset, hamwe na subtitle igihe Amakipe akunze kubona ibisubizo byiza iyo asobanuye ibipimo ngenderwaho byimbere, agakomeza inzira yo kuzamuka kwabantu kubibazo byimbitse, kandi agakurikirana inyungu zibyara umusaruro nibiciro byamakosa mugihe runaka.

Kaldi Speech Recognition Toolkit mubikorwa

Imbaraga zo gushakisha amajwi hakiri kare hamwe nigitekerezo gisubira inyuma munganda mbere yuko impera zanyuma zirangira.

Imbaraga zishakisha amajwi hakiri kare hamwe nigitekerezo cyinyuma mu nganda mbere yuko impera zanyuma kugeza ku ndunduro zikuze Amakipe ubusanzwe abona ibisubizo byiza iyo asobanuye ibipimo byujuje ubuziranenge imbere, agakomeza inzira yo kuzamura abantu kubibazo, kandi akurikirana inyungu zibyara umusaruro hamwe nibiciro byamakosa mugihe runaka.

Ingaruka & Kurinda

!

Gukoresha nabi amajwi no kwigira ibyago byiyongera mugihe uruhushya rubuze.

!

Ukuri kurashobora kugabanuka hejuru yimvugo, imvugo, cyangwa urusaku rwibidukikije.

!

Amajwi yubukorikori arashobora kwibeshya kumvugo yukuri nta kirango gisobanutse.

Igishushanyo mbonera

1

Shaka uruhushya rusobanutse rwo gufata amajwi, gukoroniza, no gukoresha.

Shaka uruhushya rusobanutse rwo gufata amajwi, gukoroniza, no gukoresha. Fata buri ntambwe nk irembo ryibimenyetso: niba ibipimo bitujujwe, hagarika kuzenguruka, funga icyuho, hanyuma noneho wagure imikoreshereze.

2

Ikizamini cyiza mubiganiro bitandukanye hamwe nuburyo bwimbere.

Ikizamini cyiza mubiganiro bitandukanye hamwe nuburyo bwimbere. Fata buri ntambwe nk irembo ryibimenyetso: niba ibipimo bitujujwe, hagarika kuzenguruka, funga icyuho, hanyuma noneho wagure imikoreshereze.

3

Sobanura igihe umuntu agomba gusuzuma cyangwa kwemeza ibisubizo.

Sobanura igihe umuntu agomba gusuzuma cyangwa kwemeza ibisubizo. Fata buri ntambwe nk irembo ryibimenyetso: niba ibipimo bitujujwe, hagarika kuzenguruka, funga icyuho, hanyuma noneho wagure imikoreshereze.

4

Andika amajwi yubukorikori kandi ugumane inyandiko zerekana kubazwa.

Andika amajwi yubukorikori kandi ugumane inyandiko zerekana kubazwa. Fata buri ntambwe nk irembo ryibimenyetso: niba ibipimo bitujujwe, hagarika kuzenguruka, funga icyuho, hanyuma noneho wagure imikoreshereze.

Komeza Ubushakashatsi