Diarization of the Language Consulting Center Telephone Calls

Zajíc, Zbyněk; Psutka, Josef; Zajícová, Lucie; Müller, Luděk; Salajka, Petr

Title:	Diarization of the Language Consulting Center Telephone Calls
Other Titles:	Diarizace telefonních hovorů Jazykové poradny Ústavu pro jazyk český
Authors:	Zajíc, Zbyněk Psutka, Josef Zajícová, Lucie Müller, Luděk Salajka, Petr
Citation:	ZAJÍC, Z., PSUTKA, J., ZAJÍCOVÁ, L., MÜLLER, L., SALAJKA, P. Diarization of the Language Consulting Center Telephone Calls. In: Speech and Computer, 21st International Conference, SPECOM 2019, Istanbul, turkey, August 20-25,2019, Proceedings. Cham: Springer, 2019. s. 549-558. ISBN 978-3-030-26060-6 , ISSN 0302-9743.
Issue Date:	2019
Publisher:	Springer
Document type:	konferenční příspěvek conferenceObject
URI:	2-s2.0-85071471006 http://hdl.handle.net/11025/36622
ISBN:	978-3-030-26060-6
ISSN:	0302-9743
Keywords:	Diarizace, x-vektor, automatické rozpoznávání řeči, modelování pomocí gaussovských směsí
Keywords in different language:	Diarization, x-vector, Automatic speech recognition, GMM
Abstract:	V tomto článku popisujeme diarizaci archivu Jazykové poradny vznikajícím v rámci projektu "Zpřístupnění dotazů jazykové poradny v lingvisticky strukturované databázi". Jedna část tohoto archivu je nahraná pouze v mono kvalit, naším úkolem je proto rozdělit data pomocí diarizace. Náš přístup využívá informace o identitě jazykového poradce získané z přepisu jeho představení na začátku každého z hovorů. Protože naše data jsou jedinenčná, pro porovnání uvádíme také výsledky dostupného systému diarizace Kaldi.
Abstract in different language:	In this paper, we describe a diarization of the archive data from the project “Access to a Linguistically Structured Database of Enquiries from the Language Consulting Center”. This project is attempting to provide improved access to the large archives of the Czech language of mainly telephone conversations collected continuously by The Language Consulting Center. One part of this archives contains mono recordings, where the data of the client and the language counsellor are mixed in one channel. In our proposed approach to a diarization, we used the information about the identity of the language counsellor acquired from the text transcription on the beginning of the conversation. For the initial stage of the diarization, our system based on clustering the x-vectors was adopted. The resegmentation step is used for refining the boundaries of speaker changes by the pre-trained Gaussian mixture model of the counsellor. Because of the uniqueness of our data, we compared our results with the Kaldi diarization as the baseline system.
Rights:	Plný text není přístupný. © Springer
Appears in Collections:	Konferenční příspěvky / Conference Papers (KKY) OBD

Files in This Item:

File	Size	Format
Zajic2019_Chapter_DiarizationOfTheLanguageConsul.pdf	420,62 kB	Adobe PDF	View/Open Request a copy

Show full item record

Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/36622

search

navigation