• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer

My TechDecisions

  • Best of Tech Decisions
  • Topics
    • Video
    • Audio
    • Mobility
    • Unified Communications
    • IT Infrastructure
    • Network Security
    • Physical Security
    • Facility
    • Compliance
  • RFP Resources
  • Resources
  • Podcasts
  • Project of the Week
  • About Us
    SEARCH
Audio, Unified Communications

Google AI’s Translatotron Can Translate A Speaker’s Voice — With the Same Characteristics

New direct speech-to-speech translation tool, Google AI Translatotron, still in development but could soon change the way we communicate across languages.

May 29, 2019 Adam Forziati 1 Comment

Google AI Translatotron,

Google AI recently announced Translatotron, an experimental new direct speech-to-speech translation tool that Google says is capable of “faster inference speed, naturally avoiding compounding errors between recognition and translation… [and retaining] the voice of the original speaker after translation…”

Google AI Translatotron “is based on a sequence-to-sequence network which takes source spectrograms as input and generates spectrograms of the translated content in the target language,” the development team says.

How Google AI Translatotron Works: Simplified

Here is a visual recreation — taken from Google AI’s announcement — of how the technology works:

Preserving the Sound of the Original Speaker

“By incorporating a speaker encoder network, Translatotron is also able to retain the original speaker’s vocal characteristics in the translated speech, which makes the translated speech sound more natural and less jarring… The speaker encoder is pretrained on the speaker verification task, learning to encode speaker characteristics from a short example utterance. Conditioning the spectrogram decoder on this encoding makes it possible to synthesize speech with similar speaker characteristics, even though the content is in a different language.”

What does this mean in practice? Let’s listen to find out.

The audio clips below, taken from the Google AI announcement, show the Google AI Translatotron transferring the original Spanish speaker’s voice into a translation in English.

Spanish Source: 

https://mytechdecisions.com/wp-content/uploads/2019/05/10148907792880119076.wav

Reference Translation in English:

https://mytechdecisions.com/wp-content/uploads/2019/05/10148907792880119076-1.wav

Google AI Translatotron Translation in Original Speaker’s Voice:

https://mytechdecisions.com/wp-content/uploads/2019/05/10148907792880119076-2.wav

 

What this Means for Collaboration

Google AI claims the Translatotron is possibly the first end-to-end model direct speech-to-speech translation tool that can directly translate speech from language into similar-sounding speech in a different language.

If the technology is further developed, this could effectively break down the language barrier in a more instantaneous, seamless way for teams working across cultural or international borders. It could also allow for quicker client relations and a reduced translation service cost.

 

If you enjoyed this article and want to receive more valuable industry content like this, click here to sign up for our digital newsletters!

Tagged With: Artificial Intelligence, Collaboration

Related Content:

  • Engaging virtual meeting with diverse participants discussing creative ideas in a bright office space during daylight hours Diversified Survey: Workplace AV Tech is Falling Short,…
  • women using Yealink WH64 Hybrid wireless headset Hybrid Work Trend Arises: The Impact on DECT…
  • Yealink banner WH64 Hybrid Wirless Headset Yealink Introduces WH64 Hybrid DECT & Bluetooth Wireless…
  • Children using smartboard in classroom | Interactive learning with modern technology PPDS & DisplayNote Introduce Philips ScreenShare for Wireless…

Free downloadable guide you may like:

  • Practical Design Guide for Office SpacesPractical Design Guide for Office Spaces

    Recent Gartner research shows that workers prefer to return to the office for in-person meetings for relevant milestones, as well as for face-to-face time with co-workers. When designing the office spaces — and meeting spaces in particular — enabling that connection between co-workers is crucial. But introducing the right collaboration technology in meeting spaces can […]

Reader Interactions

Trackbacks

  1. Making the NAS Algorithm More Accessible: This MIT Research Could Be a Boon to Machine Learning - My TechDecisions says:
    May 31, 2019 at 10:22 am

    […] Related: Google AI’s Translatotron Can Translate A Speaker’s Voice — With the Same Characteristics […]

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Latest Downloads

Practical Design Guide for Office Spaces
Practical Design Guide for Office Spaces

Recent Gartner research shows that workers prefer to return to the office for in-person meetings for relevant milestones, as well as for face-to-fa...

New Camera Can Transform Your Live Production Workflow
New Camera System Can Transform Your Live Production Workflow

Sony's HXC-FZ90 studio camera system combines flexibility and exceptional image quality with entry-level pricing.

Creating Great User Experience and Ultimate Flexibility with Clickshare

Working and collaborating in any office environment today should be meaningful, as workers today go to office for very specific reasons. When desig...

View All Downloads

Would you like your latest project featured on TechDecisions as Project of the Week?

Apply Today!

More from Our Sister Publications

Get the latest news about AV integrators and Security installers from our sister publications:

Commercial IntegratorSecurity Sales

AV-iQ

Footer

TechDecisions

  • Home
  • Welcome to TechDecisions
  • Contact Us
  • Comment Guidelines
  • RSS Feeds
  • Twitter
  • Facebook
  • Linkedin

Free Technology Guides

FREE Downloadable resources from TechDecisions provide timely insight into the issues that IT, A/V, and Security end-users, managers, and decision makers are facing in commercial, corporate, education, institutional, and other vertical markets

View all Guides
TD Project of the Week

Get your latest project featured on TechDecisions Project of the Week. Submit your work once and it will be eligible for all upcoming weeks.

Enter Today!
Emerald Logo
ABOUTCAREERSAUTHORIZED SERVICE PROVIDERSYour Privacy ChoicesTERMS OF USEPRIVACY POLICY

© 2025 Emerald X, LLC. All rights reserved.