ASR 2018-19  |  News Archive  |  Lectures  |  Labs  |  Coursework  |  Piazza

Lecture 18 - Speaker diarization

This is the first of two lectures on speaker recognition. This lecture concerns speaker diarization - the task of "who spoke when" in which a recording is split into segments, where wach segment corresponds to the speech of a single speaker. Unlike the settings we have previously considered, speaker diarization assumes there are multiple speakers in a recording. A good description of a current approach to speaker diarization is the ICASSP-2017 paper from Garcia-Romero et al, Speaker diarization using deep neural network embeddings.

Speaker diarization


Copyright (c) University of Edinburgh 2015-2019
The ASR course material is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License
licence.txt
This page maintained by Steve Renals.
Last updated: 2019/04/26 17:02:41UTC


Home : Teaching : Courses : Asr 

Informatics Forum, 10 Crichton Street, Edinburgh, EH8 9AB, Scotland, UK
Tel: +44 131 651 5661, Fax: +44 131 651 1426, E-mail: school-office@inf.ed.ac.uk
Please contact our webadmin with any comments or corrections. Logging and Cookies
Unless explicitly stated otherwise, all material is copyright © The University of Edinburgh