TWiki> CSTR Web>Listen (06 Mar 2017, Main.clai)EditAttach

-- Main.s0566164 - 24 October 2008

Listen speech recognition meeting

Please note: this wiki page is no longer being updated. Please see the new wiki page:

https://www.wiki.ed.ac.uk/display/CSTR/Listen

Weekly CSTR speech recognition meeting: every Tuesday at 11am in the instrumented meeting room (3.7). All welcome.

All talks will be short and informal, and discussion is encouraged. The idea is for people giving a talk to send round slides and/or relevant references in advance to encourage everyone to contribute.

NB. If editing the programme for this semester or attaching new papers, edit this page instead. This avoids having to modify the Wiki at the start of each semester, and keeps all attachments in a sensible place!

2016-17Semester1
11.10.16Planning meeting
18.10.16Active and semi-supervised learning & optimising evaluation (Peter)
25.10.16

WaveNet: A Generative Model for Raw Audio (Blog post with examples). Background reading: Pixel Recurrent Neural Networks, Conditional Image Generation with PixelCNN Decoders, and Video Pixel Networks. (Steve)

01.11.16
Domain adversarial MTL - Practical paper from IS: Adversarial Multi-task Learning of Deep Neural Networks for Robust Speech Recognition and a background paper (JMLR): Domain-Adversarial Training of Neural Networks (Joachim)
08.11.16

Open-Domain Audio-Visual Speech Recognition: A Deep Learning Approach & Multimodal deep learning (Ondrej)

15.11.16
A DNN-HMM Approach to Story Segmentation and Dialogue Session Segmentation by Embedding-Enhanced TextTiling (Emiru)
22.11.16
Grouping context-dependent targets & phonetic context embeddings (Peter)
29.11.16
A review of Interspeech CNN/LSTM papers. Advances in Very Deep Convolutional Neural Networks for LVCSR & The IBM 2016 English Conversational Telephone Speech Recognition System Link to presentation (Joanna)
06.12.16
How neural network depth compensates for the conditional independence assumption (Joachim)
13.12.16
No meeting (SLT)
TBC
Speaker separation with deep clustering (Steve)
TBC
A mathematical theory of deep CNNs for feature extraction (Gustav)
TBC
Phone synchronous decoding with CTC lattice (Ondrej)
TBC
Generative adversarial networks review (Joanna and Joachim)
TBC
Review of Interspeech low-resource/semi-supervised papers

Future plans

NB. no meeting on 2 Mar 2015.

Previous sessions

Topic attachments
I Attachment Action Size Date WhoSorted ascending Comment
pdfpdf IS08_Probabilistic_Phone_Mapping_Model.pdf manage 196.4 K 03 Feb 2011 - 17:18 Main.llu  
pdfpdf Super-human_multi-talker_speech_recognition.pdf manage 2177.2 K 29 Oct 2009 - 13:30 Main.llu  
pdfpdf icassp10_tandem_low_resource.pdf manage 72.1 K 04 Feb 2011 - 10:11 Main.llu  
pdfpdf Direct_Construction_of_Compact_Context-Dependency_Transducers_From_Data.pdf manage 121.6 K 05 Nov 2010 - 10:38 RavichanderVipperla Computer Speech and Language best paper award
pdfpdf main.pdf manage 309.9 K 04 Jun 2009 - 08:46 Main.s0565860 Multilingual recognition week 2
pdfpdf 120509_1.pdf manage 206.1 K 12 May 2009 - 17:23 Main.s0566164  
pdfpdf 120509_2.pdf manage 117.0 K 12 May 2009 - 17:23 Main.s0566164  
pdfpdf 120509_bg.pdf manage 823.7 K 12 May 2009 - 17:23 Main.s0566164  
pdfpdf 2002-droppo-icassp.pdf manage 75.3 K 21 Jan 2009 - 17:14 Main.s0566164  
pdfpdf gales_ASRU07.pdf manage 517.4 K 10 Feb 2009 - 11:59 Main.s0566164  
pdfpdf giuliagarau_eurospeech05.pdf manage 85.7 K 17 Feb 2009 - 17:26 Main.s0566164  
pdfpdf icassp08_ubm.pdf manage 126.2 K 20 May 2009 - 11:31 Main.s0566164  
pdfpdf jasa2006.pdf manage 244.1 K 26 Feb 2009 - 14:26 Main.s0566164  
pdfpdf liao05euro.pdf manage 128.2 K 21 Jan 2009 - 17:15 Main.s0566164  
pdfpdf liao_SPEECOM.pdf manage 646.0 K 10 Feb 2009 - 11:58 Main.s0566164  
pdfpdf liao_tr499.pdf manage 766.8 K 28 Jan 2009 - 10:47 Main.s0566164  
pdfpdf listen0509_1.pdf manage 205.9 K 04 May 2009 - 17:00 Main.s0566164  
pdfpdf listen0509_2.pdf manage 362.7 K 04 May 2009 - 17:00 Main.s0566164  
pdfpdf lm_stc.pdf manage 204.3 K 30 Jun 2009 - 11:12 Main.s0566164  
pdfpdf mod_mpe.pdf manage 230.7 K 30 Jun 2009 - 11:12 Main.s0566164  
pdfpdf talk.pdf manage 75.4 K 26 Apr 2010 - 14:57 Main.s0566164 gaelic slides
pdfpdf vtln_csl09.pdf manage 284.9 K 23 Feb 2009 - 12:37 Main.s0566164  
pdfpdf vtln_is01.pdf manage 77.3 K 17 Feb 2009 - 17:27 Main.s0566164  
pdfpdf Yao2012_dnnadapt.pdf manage 188.8 K 16 Mar 2013 - 15:34 Main.s1136550 MSFT paper on adaptation from SLT
pdfpdf icml08.pdf manage 494.6 K 05 Mar 2009 - 12:38 SongfangHuang  
pdfpdf wang2008mispronunciationDetection1.pdf manage 167.5 K 17 Nov 2009 - 17:39 Main.v1rgosan Lexicon-based mispronunciation detection (1)
pdfpdf wang2008mispronunciationDetection2.pdf manage 231.1 K 17 Nov 2009 - 17:39 Main.v1rgosan Lexicon-based mispronunciation detection (2)
Topic revision: r115 - 06 Mar 2017 - 18:21:12 - Main.clai
 
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback
This Wiki uses Cookies