History of Voice Recognition

André Bastié

Posted in Media

Feb 19

2 min read

Voice recognition is the ability of a machine or program to identify words and phrases in spoken language and convert them to a machine-readable format. Designing a machine that mimics human behavior, especially the capability of speaking and responding to it, has intrigued engineers and scientists for centuries. Speech technologies

Voice recognition is the ability of a machine or program to identify words and phrases in spoken language and convert them to a machine-readable format.

Designing a machine that mimics human behavior, especially the capability of speaking and responding to it, has intrigued engineers and scientists for centuries. Speech technologies have witnessed a dramatic transformation, from what started as a speech machine using resonance tubes to Graham Bell’s first recording device to Dictaphone and the first voice synthesizer, Voice Operating Demonstrator (VODER) to today’s smart virtual assistants like Apple’s Siri or Amazon’s Alexa . Thanks to the advancements in AI, Voice recognition technology is gaining popularity. According to a recent U.S. Cellular survey, 36% of smartphone owners use a virtual assistant daily and 30% use smart home technology daily. This connectivity is expected to increase with the number of devices and sensors predicted to rise 200% to 46 billion by 2021.

The idea is to transform recorded audio into a sequence of words, as an alternative to typing on the keyboard. From helping people with physical disabilities, transcription of interviews, learning a new language or accessing a file via voice commands, speech recognition finds use in a number of applications. Voice recognition systems facilitate the interaction with technology, enabling hands-free requests.

From 1952 to today.

The earliest voice recognition technologies could only comprehend digits. Audrey system, built by Bell Labs in 1952 considered to be the first speech recognition device, recognised only ten digits spoken by a single voice. This was followed by the Shoebox machine, developed by IBM in 1962, which could recognise 16 English words, 10 digits and 6 arithmetic commands.

The U.S. Department of Defence made great contributions towards the development speech recognition systems. From 1971 to 1976, it funded the DARPA SUR (Speech Understanding Research) program, which led to the development of Harpy by Carnegie Mellon that could comprehend 1011 words. At around the same time, the first commercial speech recognition company, Threshold Technology was founded and Bell Labs introduced a system that could interpret multiple people’s voices. In 1978, Texas Instruments introduced Speak & Spell, which was a milestone in speech development because of its use of speech chip, leading to more human-like digital synthesis sound. The development of hidden Markov model, which considered the probability of unknown sounds using statistics proved to be a major breakthrough, it even entered the home, in the form of Worlds of Wonder’s Julie doll.

Faster microprocessors

Thanks to the introduction of faster microprocessors, speech, in 1990, the world’s first speech recognition software for consumers was developed. It was the first continuous dictation software, meaning one did not have to pause between words. In 1992, Apple also produced its real-time continuous speech recognition system that could recognise as many as 20,000 words.

Smart Assistant

By 2001, speech recognition development had hit a plateau, until in 2008, Google emerged with its Google Voice Search application for iPhones. In 2010, Google introduced personalized recognition on Android devices which would record different users’ voice queries to develop an enhanced speech model. It consists of 230 billion English words. Eventually, Apple’s Siri was implemented in iPhone 4S in 2011, which relied on cloud computing as well.

The Breakthrough

A Stanford study revealed that speech recognition is now about three times as fast as typing on a cell phone. Once 8.5%, the error rate has now dropped to 4.9%. These technological advances have given rise to multiple applications like transcription assistant tools including Happy Scribe.

Little Known Facts About Speech Recognition Technology

Technically speaking, speech recognition goes way back to 1877 when Thomas Edison invented the phonograph, the first device to record and reproduce sound.
When it comes to speech recognition, accuracy is measured by a Word Error Rate calculation, which tracks how often a word is transcribed incorrectly.

Authors :

Akanksha Tiwari (akanksha.tiwari2@mail.dcu.ie) Saikruti Kesipeddi (saikruti.kesipeddi2@mail.dcu.ie) Sumer Jagda (sumer.jagda2@mail.dcu.ie)

Get 1 hour of Transcription for Free with Happy Scribe!

Happy Scribe is a Transcription plateform that convert all Formats of Audio & Video into text for more than +119 languages.

START FREE TRIAL ➜

The increased need for many languages in the media is a reality that can never be understated in today’s world

The Growing Role of Subtitling and Captioning in Multilingual Media

Niek Leermakers

Posted in Media

Dec 11

9 min read

This blog post discusses the increasing importance of subtitling and captioning in multilingual media. It highlights their role in ensuring accessibility, reaching broader audiences, and improving understanding. Subtitles and captions have become essential tools for media creators to engage viewers from different languages and cultures.

The Significance Of Video Content In Customer Acquisition

Niek Leermakers

Posted in Media

Dec 20

10 min read

Video content plays a crucial role in acquiring customers. It helps businesses engage with their target audience effectively and showcase their products or services. Videos are more engaging and informative, making them a powerful tool for customer acquisition. Marketers should prioritize creating high-quality video content to drive growth and conversions.

Happy Scribe selected as GALA’s Subtitling Partner for 2024

André Bastié

Posted in Media

Feb 13

1 min read

Happy Scribe is thrilled to announce a new partnership with GALA, The Globalization and Localization Association, where Happy Scribe provides English subtitling services for GALA’s videos in 2024.

A lady making a video content with her PC

The Art of Creating SDH Subtitles for Movies: A Step-by-Step Guide

Niek Leermakers

Posted in Media

Feb 14

6 min read

This article provides a step-by-step guide on creating SDH subtitles for movies, offering tips and techniques on how to make them accessible and visually appealing to viewers.

A lady making a video content with her computer and translating

Best Practices in SDH Subtitling for Professionals

Niek Leermakers

Posted in Media

Feb 14

6 min read

This article explores SDH subtitling as a complex art that enriches the viewing experience for the deaf and hard of hearing by blending dialogue, sound effects, and emotional depth. It discusses foundational aspects, methods, obstacles, and technological advancements in SDH subtitling, emphasizing the critical roles of precision, timing, and comprehensive audio cues, and anticipates the use of AI and cloud technology to improve subtitling accessibility and efficiency.

A lady making an SDH content with her pc

How To Provide SDH Subtitles for Live Broadcasts

Niek Leermakers

Posted in Media

Feb 15

6 min read

Exploring the intricate world of providing SDH subtitles for live broadcasts, this article delves into the technical, ethical, and logistical challenges of ensuring live shows are accessible to all, highlighting the importance of inclusivity in modern media.

Compliance and Inclusion: Understanding SDH Subtitles and Accessibility Laws

Niek Leermakers

Posted in Media

Feb 15

4 min read

This article delves into the crucial role of SDH subtitles in fostering media inclusivity and compliance, highlighting their importance in making content accessible to diverse audiences and ensuring adherence to accessibility laws.

The Challenges in Producing Accurate SDH Subtitles

Niek Leermakers

Posted in Media

Feb 15

6 min read

This article examines the complexities of creating effective and accurate SDH subtitles, highlighting the technical, linguistic, and cultural challenges involved in making media content accessible and inclusive for all viewers.

The Art of Subtitling: Translation vs. Adaptation

Boris Simonse

Posted in Media

Mar 04

4 min read

In this article we explain the differences between adapting and translating your subtitles. Find out what works best for your audiovisual content!

How to Automate Media Localization Workflows with AI

Henni Paulsen

Posted in Media

Jul 15

5 min read

With zettabytes of digital content being produced every minute, there has been an explosion of audiovisual (AV) content, with streaming platforms like Netflix and Amazon Prime Video, and video content platforms like YouTube, Vimeo, Patreon, and TikTok hosting huge amounts of videos.

From 1952 to today.

Faster microprocessors

Smart Assistant

The Breakthrough

Little Known Facts About Speech Recognition Technology

Related posts

The Growing Role of Subtitling and Captioning in Multilingual Media

The Significance Of Video Content In Customer Acquisition

Happy Scribe selected as GALA’s Subtitling Partner for 2024

The Art of Creating SDH Subtitles for Movies: A Step-by-Step Guide

Best Practices in SDH Subtitling for Professionals

How To Provide SDH Subtitles for Live Broadcasts

Compliance and Inclusion: Understanding SDH Subtitles and Accessibility Laws

The Challenges in Producing Accurate SDH Subtitles

The Art of Subtitling: Translation vs. Adaptation

How to Automate Media Localization Workflows with AI