TWiki> CSTR Web>Speak (revision 38)EditAttach

Speak! speech synthesis meeting.

The purpose of these regular informal meetings is to discuss and share progress relating to speech synthesis (audio and visual) research - within CSTR specifically as well as in the field generally. Talks are intended to be short and informal, with an emphasis on discussion, interaction and feedback. Relevant references should be sent round in advance to encourage everyone to contribute.

Everybody with an interest in speech synthesis (audio and visual) research is welcome.

  • Meetings will typically be held in the Instrumented Meeting Room ("IMR" - room 3.07), on Level 3 of the Informatics Forum building (though this may vary on odd occasions.)
  • At present, the standard time for these meetings is Thursdays at 2-3pm.

(NOTE: go to this topic to edit the schedule included below)

Speak! synthesis meetings schedule for 2016/2017

22.09.16 No meeting
29.09.16 Semester 1 Planning Meeting
06.10.16 Felipe - A hybrid harmonics-and-bursts modelling approach to speech synthesis (pp. 224–229)
13.10.16 Gustav - Temporal modeling in neural network based statistical parametric speech synthesis (pp. 113–118)
20.10.16 Rasmus - Prediction of Emotions from Text using Sentiment Analysis for Expressive Speech Synthesis
27.10.16 Sam - WaveNet: A Generative Model for Raw Audio (Blog post with examples). Background reading:Pixel Recurrent Neural Networks,Conditional Image Generation with PixelCNN Decoders, and Video Pixel Networks.
03.11.16 Cassia - [TTS quality prediction] paper1 paper2 (pp.139-145)
10.11.16 Srikanth - End to end modelling using neural attention

Junichi - [ongoing work presentation] Waveform generation based on phase reconstruction from FFT spectral amplitudes

17.11.16 Simon - Multi-output RNN-LSTM for multiple speaker speech synthesis with alpha-interpolation model (p.119-124)
24.11.16 Catherine - some papers on experimental design: The garden of forking paths, and Beyond power calculations: Assessing Type S (sign) and Type M (magnitude) errors.
01.12.16 [canceled] most people are away due to Tom Merritt's visit
08.12.16 Oliver - Enhance the word vector with prosodic information for the recurrent neural network based TTS system
09.12.16 CSTR Christmas Lunch and ILCC Christmas Party
Christmas Break  

Suggestions for future presentations

  • Hojo et al. (2016) An Investigation of DNN-Based Speech Synthesis Using Speaker Codes. Interspeech 2016. [Cassia]

  • Papers on articulatory synthesis [Korin]

  • your suggestion here

Speak! meeting schedules

Topic attachments
I Attachment Action Size Date Who Comment
pdfpdf Brognaux_IS14.pdf manage 266.8 K 25 Sep 2014 - 14:07 Main.mwester pronunciation variation in TTS #2
pdfpdf Kolluru_IS04.pdf manage 270.2 K 25 Sep 2014 - 14:05 Main.mwester pronunciation variation TTS #1
pdfpdf Lecumberri_IS14.pdf manage 434.8 K 25 Sep 2014 - 14:08 Main.mwester pronunciation variation in TTS #3
pdfpdf collobert-2011.pdf manage 726.8 K 11 Jun 2012 - 09:09 Main.s0676515  
Edit | Attach | Print version | History: r39 < r38 < r37 < r36 < r35 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r38 - 21 Sep 2016 - 10:14:31 - Main.cvbotinh
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback
This Wiki uses Cookies