Full metadata record
DC pole | Hodnota | Jazyk |
---|---|---|
dc.contributor.author | Psutka, Josef V. | |
dc.contributor.author | Vaněk, Jan | |
dc.contributor.author | Psutka, Josef | |
dc.date.accessioned | 2016-01-06T13:29:22Z | |
dc.date.available | 2016-01-06T13:29:22Z | |
dc.date.issued | 2011 | |
dc.identifier.citation | PSUTKA, Josef V.; VANĚK, Jan; PSUTKA, Josef. Speaker-clustered acoustic models evaluated on GPU for on-line subtitling of parliament meetings. In: Text, speech and dialogue. Berlin: Springer, 2011, p. 284-290. (Lectures notes in computer science; 6836). ISBN 978-3-642-23537-5. | en |
dc.identifier.isbn | 978-3-642-23537-5 | |
dc.identifier.uri | http://www.kky.zcu.cz/cs/publications/JosefVPsutka_2011_Speaker-clustered | |
dc.identifier.uri | http://hdl.handle.net/11025/17133 | |
dc.format | 7 s. | cs |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | en |
dc.publisher | Springer | en |
dc.relation.ispartofseries | Lecture notes in computer science; 6836 | en |
dc.rights | © Josef V. Psutka - Jan Vaněk - Josef Psutka | cs |
dc.subject | akustické modelování | cs |
dc.subject | GPU | cs |
dc.title | Speaker-clustered acoustic models evaluated on GPU for on-line subtitling of parliament meetings | en |
dc.type | článek | cs |
dc.type | article | en |
dc.rights.access | openAccess | en |
dc.type.version | publishedVersion | en |
dc.description.abstract-translated | This paper describes the effort with building speaker-clustered acoustic models as a part of the real-time LVCSR system that is used more than one year by the Czech TV for automatic subtitling of parliament meetings broadcasted on the channel ČT24. Speaker-clustered acoustic models are more acoustically homogeneous and therefore give better recognition performance than single gender-independent model or even gender-dependent models. Frequent changes of speakers and a direct connection of the LVCSR system to the audio channel require an automatic switching/fusion of models as quickly as possible. An important part of the solution is real time likelihood evaluations of all clustered acoustic models, taking advantage of a fast GPU(Graphic Processing Unit). The proposed method achieved a WER reduction to the baseline gender-independent model over 2.34% relatively with more than 2M Gaussian mixtures evaluated in real-time. | en |
dc.subject.translated | acoustic models | en |
dc.subject.translated | GPU | en |
dc.identifier.doi | 10.1007/978-3-642-23538-2_36 | |
dc.type.status | Peer-reviewed | en |
Vyskytuje se v kolekcích: | Články / Articles (KIV) Články / Articles (KKY) |
Soubory připojené k záznamu:
Soubor | Popis | Velikost | Formát | |
---|---|---|---|---|
JosefVPsutka_2011_Speaker-clustered.pdf | Plný text | 158,08 kB | Adobe PDF | Zobrazit/otevřít |
Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam:
http://hdl.handle.net/11025/17133
Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.