Training of speaker-clustered discriminative acoustic models for use in real-time recognizers

Vaněk, Jan; Psutka, Josef V.; Zelinka, Jan; Trmal, Jan

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.author	Vaněk, Jan
dc.contributor.author	Psutka, Josef V.
dc.contributor.author	Zelinka, Jan
dc.contributor.author	Trmal, Jan
dc.date.accessioned	2015-12-10T13:20:32Z
dc.date.available	2015-12-10T13:20:32Z
dc.date.issued	2010
dc.identifier.citation	VANĚK, Jan; PSUTKA, Josef V.; ZELINKA, Jan; TRMAL, Jan. Training of speaker-clustered discriminative acoustic models for use in real-time recognizers. In: Speech processing. Prague: Institute of photonics and electronics AS CR , 2010, p. 152-158. ISBN 978-80-86269-21-4.	en
dc.identifier.isbn	978-80-86269-21-4
dc.identifier.uri	http://hdl.handle.net/11025/16957
dc.description.abstract	Je dobře známo, že akustické modely, založené na informaci o pohlaví řečníka, jsou více akusticky homogenní, a proto dosahují lepších výsledků rozpoznávání než jeden univerzální akustický model v případě, že je pohlaví řečníka úspěšně detekováno, nebo předem známo. Řečníci ovšem nemusí být rozděleni jen do dvou skupin. V tomto článku je popsán algoritmus, který je schopen vytvořit větší množství shluků řečníků. Dále se tento článek zabývá problémem vhodného použití těchto modelů v systémech rozpoznávání řeči pracujících v reálném čase, kde informace od detektoru správného shluku řečníků je často zpožděná nebo nesprávná. Dále jsou ještě v článku diskutovány různé přístupy k začlenění diskriminativních metod při trénování těchto akustických modelů.	cs
dc.format	7 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	en	en
dc.publisher	Institute of photonics and electronics AS CR	en
dc.rights	© Jan Vaněk - Josef V. Psutka - Jan Zelinka - Jan Trmal	cs
dc.subject	model shlukování řečníků	cs
dc.subject	akustické modelování	cs
dc.subject	automatické rozpoznávání řeči	cs
dc.title	Training of speaker-clustered discriminative acoustic models for use in real-time recognizers	en
dc.title.alternative	Trénování diskriminativních akustických modelů založených na shlucích řečníků pro rozpoznávání řeči pracujícím v reálném čase	cs
dc.type	článek	cs
dc.type	article	en
dc.rights.access	openAccess	en
dc.type.version	publishedVersion	en
dc.description.abstract-translated	It is well known that gender-dependent (male/female) acoustic models are more acoustically homogeneous and therefore give better recognition performance than single gender-independent model in the case where the gender is successfully detected or a priory known. Speakers do not need to be split to two groups only. An algorithm to make higher number of speaker clusters is described in this paper. Further, the paper deals with a problem how to use these gender-based or speaker-clustered acoustic models in a real-time LVCSR where information from an automatic cluster detector is often delayed or incorrect. Moreover, various ways, how to incorporate discriminative training methods into training of the speaker-clustered acoustic models, are discussed in the paper.	en
dc.subject.translated	speaker-clustered model	en
dc.subject.translated	acoustics modeling	en
dc.subject.translated	automatic speech recognition	en
dc.type.status	Peer-reviewed	en
Vyskytuje se v kolekcích:	Články / Articles (KIV) Články / Articles (KKY)

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
VanekJan_2010_Trainingof.pdf	Plný text	184,32 kB	Adobe PDF	Zobrazit/otevřít

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/16957

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace