ETH Zurich

Postdoctoral Researcher in Text-to-Speech Synthesis (Computer Science)

Save as a favourite

About the employer

ETH Zurich is well known for its excellent education, ground-breaking fundamental research and for implementing its results directly into practice.

Visit the employer page

Postdoctoral Researcher in Text-to-Speech Synthesis (Computer Science)

The Media Technology Center (MTC) at ETH Zurich connects research and innovative industry applications to build the media technologies of tomorrow. To achieve this, the center provides a unique arrangement that allows a lively knowledge exchange between industry partners, experts, and students to work on the digital transformation of media, covering salient disciplines such as media technology, digital journalism, and design. To join our interdisciplinary team, we offer a position as a Postdoctoral Researcher in Computer Science.

We are looking for a talented candidate to join the MTC as a postdoctoral researcher. The candidate will work on multiple applied research projects (80%), as defined by the center and our media and industry partner, in the areas of Computer Vision, Computer Graphics, Natural Language Processing, Machine Learning, Video- and audio processing. The candidate may also pursue his or her own research (20%) with students at ETH Zurich.

Project background

The MTC is  exploring how speech synthesis methods can allow for new possibilities to create engaging audio content. Previously, we have built the first voice assistant that can understand and speak in the local Swiss dialects (low-resource text-to-speech). A new project at MTC aims to expand on this work and explore technologies that allow for controllable, adaptive, and expressive text-to-speech synthesis. While automatic text-to-speech systems have dramatically improved over the past years, current technologies have issues that make them hard to apply for applications in the media industry, especially for radio and video broadcasts. They often lack expressiveness in the produced speech, and the original speaker is often re-identifiable based on the generated speech.

Job description

Initially, you will drive the research for our project on expressive text-to-speech synthesis. For this, you will be developing new algorithms and systems, and adapting the state-of-the-art to the specific challenges in the industry. In dialogue with our partners, you will develop novel methods with direct real-world applications. You will be working closely with research engineers who will implement your research into software prototypes. You will supervise 1-3 Bachelor's and Master's students per semester and interns supporting the projects. You will benefit from a lively exchange of interdisciplinary projects at the intersection of machine learning, computer graphics and vision, and natural language processing.

The initial employment contract will be for 12 months. Further contract extensions on an annual basis will be decided during the initial employment.

We offer

ETH Zurich is a family-friendly employer with excellent working conditions. You can look forward to an exciting working environment, cultural diversity and attractive offers and benefits.

Your profile

  • Must hold Ph.D. in Computer Science or related fields (e.g., Electrical Engineering, Computational Science)
  • Proven track record in speech synthesis or related fields (e.g., publications in venues such as ICASSP, ASRU, ICML, NIPS)
  • Interest in working on applied research projects at the intersection between academia and industry.
  • Proficiency in written and verbal communication in English.
  • Experience in supervising Bachelor and Master students
  • Speaking one of the national languages of Switzerland is a plus: German, French, Italian, Romansh.

About ETH Zürich

ETH Zurich is one of the world’s leading universities specialising in science and technology. We are renowned for our excellent education, cutting-edge fundamental research and direct transfer of new knowledge into society. Over 30,000 people from more than 120 countries find our university to be a place that promotes independent thinking and an environment that inspires excellence. Located in the heart of Europe, yet forging connections all over the world, we work together to develop solutions for the global challenges of today and tomorrow.

Curious? So are we.

We look forward to receiving your online application with the following documents:

  • Cover letter: a short (one page or less) statement of your research interests and your motivation for joining the MTC
  • CV and publication list
  • University degree(s)

Please note that we exclusively accept applications submitted through our online application portal. Applications via e-mail or postal services will not be considered.

Further information about the Media Technology Center can be found on our website, For more information about the position, contact Dr. Fabio Zünd by e-mail (no applications).

Job details

Postdoctoral Researcher in Text-to-Speech Synthesis (Computer Science)
Rämistrasse 101 Zurich, Switzerland
Application deadline
Job type
Save as a favourite

More jobs from this employer

About the employer

ETH Zurich is well known for its excellent education, ground-breaking fundamental research and for implementing its results directly into practice.

Visit the employer page

Relevant stories

Solar Fuels: Harnessing Energy From the Sun Dutch Institute for Fundamental Energy Research (DIFFER) 4 min read
Printing the Solar Cells of the Future King Abdullah University of Science and Technology (KAUST) 5 min read
“You Can Think Big Here” Tampere University 4 min read
TU/e Enables Surgeons to See the Invisible Eindhoven University of Technology 5 min read
More stories