Text to speech in digital television explained

Text to speech in digital television refers to digital television products that use speech synthesis (computer-generated speech that “talks” to the end user) to enable access to blind or partially sighted people. By combining a digital television (a television, set-top box, personal video recorder, or other type of receiver) with a speech synthesis engine, blind and partially sighted people are able to access information that is normally displayed visually in order to operate the menus and electronic program guides of the receiver.

User need

Using an audiovisual medium causes problems for certain people with disabilities, notably individuals with sight or hearing loss. These problems can be split between interface accessibility barriers and impediments in using the content itself. Text-to-speech features in television products helps address interface accessibility barriers for blind and partially sighted people who may be unable to use the standard visual interface or even special features such as large fonts, magnifiers, adjustable color schemes, etc.

Digital television solutions are often more complicated products compared to their analog ancestors.[1] The ability to navigate many menus, to see on-screen program information, and to browse electronic program guides or on-screen content listings to find out what is available to watch, is essential to using digital TVs.

Policy makers across the world have recognized the importance of access to (digital) television. Recital 64 of the EU’s Audiovisual Media Services Directive (AVMS) [2] states: "The right of persons with a disability and of the elderly to participate and be integrated in the social and cultural life of the Community is inextricably linked to the provision of accessible audiovisual media services." The initial report of a European Commission study "Measuring progress of eAccessibility in Europe" refers to television as one of a set of fields "that are now essential elements of social and economic life." The United Nations Convention on the Rights of People with Disabilities[3] makes specific reference to television access in Article 30(1) ("Participation in cultural life, recreation, leisure and sport"): "States Parties recognize the right of persons with disabilities to take part on an equal basis with others in cultural life, and shall take all appropriate measures to ensure that persons with disabilities: [...] b. Enjoy access to television programmes, films, theatre and other cultural activities, in accessible formats."

History

Text-to-speech software has been widely available for desktop computers since the 1990s, and Moore’s Law increases in CPU and memory capabilities have contributed to making their inclusion in software and hardware solutions more feasible. In the wake of these trends, text-to-speech is finding its way into everyday consumer electronics.[4] In addition to text-to-speech solutions for computers, we now see talking watches and clocks, calendars, thermometers, kitchen aids, and many other products. Talking books and GPS navigation systems have become widely used as well.[5]

Organizations representing blind and partially sighted people are long-standing supporters of text-to-speech technology in consumer electronics. In the UK, the Royal National Institute of Blind People (RNIB) has been advocating for speaking radio and television products since the early part of the century and has supported manufacturers in creating such solutions.[6]

The Digital TV Group, the UK Industry association for Digital TV, first discussed the topic in 2007 and subsequently brought the industry together to write a technical specification for text to speech in the horizontal market in 2009. This collaboration formed part of the UK Government BERR Usability Action Plan.[7] When complete, the plan was submitted to Digital Europe for ETSI standardization and also published as a white paper. Subsequently, the plan was incorporated in the U-Book - UK Digital TV Usability and Accessibility Guidelines including text to speech.[8]

In 2010, two talking products for digital television came into the market in the UK. The Sky Talker is an add-on for the Sky set top box. It provides talking features for program and channel information and play back control. The Sky Talker is operated through the standard Sky remote control. In the same year, the Smart Talk Freeview (terrestrial digital broadcasting) set-top box was also launched into the UK market. This is a Goodmans branded Freeview set top box, developed by a partnership between Harvard International Ltd and the RNIB. It was the first complete talking solution for digital television in the UK, including speaking of the Electronic Program Guide, menus, and providing spoken assistance during setup.

In Japan, both Panasonic and Mitsubishi Electric, have been producing television and Blu-ray products since 2010. According to information compiled by the Japanese blindness organization, Lighthouse for the Blind, there are some 70-odd products from Mitsubishi and Panasonic with talking features.[9]

Around 2011 in Spain a talking Linux-based set-top box solution, using the free Festival text-to-speech engine, was distributed to blind and visually impaired people free of charge by the Ministry of Industry, Tourism and Trade. However, this product is no longer available.

In 2012, Panasonic launched its voice guidance solution on the UK market.[10] Voice Guidance is a set of talking features for their 2012 Viera range (and beyond). Voice Guidance announces on-screen information on the most important menus and has support for reminders, recordings, and playback functions. It is available for Freesat and Freeview receivers. In creating its solution, Panasonic took into account advice from RNIB experts.[11]

Also in 2012, TVonics, a former UK digital video recorder maker, launched its talking PVR solution: a twin-tuner Freeview HD recorder based on the Ivona TTS engine which is widely lauded by disability groups for its high-quality voice. The TVonics solution was essentially a software addition for its existing platform and can be deployed as a software upgrade to customers of existing products. TVonics went into production in June 2012.[12] The RNIB acquired the core DVR IP including the text-to-speech system. The TVonics brand was bought by Peterborough-based Pulse-Eight.

List of possible text-to-speech enabled features

Interaction with interactive services and widgets.

Implementation guidance and standardization

An early effort to capture the user requirements and define a functional specification was undertaken by the Digital TV Group (DTG) in the UK, who published a White Paper on the subject. This White Paper has since been submitted into the publication UK Digital TV Usability and Accessibility Guidelines[13] (known as the U-Book). The same White Paper was also used as the basis for a discussion between disability user groups and DigitalEurope, a European industry body for manufacturers of consumer equipment on the topic of text-to-speech for television, The DigitalEurope work stream led to the International Electrotechnical Commission (IEC) setting up a project group (IEC 62731) to create an International Standard for text-to-speech in digital television. The first edition of the standard, IEC 62731:2013 was published officially as an International Standard in January 2013.[14] The Standard does not dictate implementation but provides a functional description on how a text-to-speech enabled television product should behave and what should be spoken when properly used.

Notes and References

  1. Web site: Me and My TV - How Can we Connect? . 2013-02-17 . Danker . Daniel . 2 Mar 2012 . PDF . BBC Internet Blog.
  2. 2007/65/EC . 11 Dec 2007 . amending Council Directive 89/552/EEC on the coordination of certain provisions laid down by law, regulation or administrative action in Member States concerning the pursuit of television broadcasting activities. 32007L0065 .
  3. Web site: Convention on the Rights of Persons with Disabilities . 2013-02-17 . United Nations . United Nations . 2006 . United Nations.
  4. Web site: 2021-11-08 . What is Text-to-Speech (TTS), and How Does It Work? . 2022-05-23 . Media Whale . en-US.
  5. Web site: Top ten talking products . 2013-02-17 . RNIB . RNIB . RNIB.
  6. Web site: Are you really listening? . 2013-02-17 . RNIB . RNIB . 6 September 2012 . RNIB.
  7. Web site: Usability Action Plan .
  8. Web site: UK Digital TV Usability and Accessibility Guidelines, including Text to Speech .
  9. Web site: http://www.iccb.jp/%E5%9C%B0%E3%83%87%E3%82%B8/%E9%9F%B3%E5%A3%B0%E8%AA%AD%E3%81%BF%E4%B8%8A%E3%81%92%E6%A9%9F%E8%83%BD%E4%BB%98%E3%81%8D%E5%9C%B0%E3%83%87%E3%82%B8%E3%83%86%E3%83%AC%E3%83%93%E3%80%80%E5%93%81%E7%95%AA%E3%83%AA%E3%82%B9%E3%83%88/ . ja:日本ライトハウス情報文化センター - 音声読み上げ機能付き地デジテレビ 品番リスト . A list of models with digital TV and text-to-speech support . 2013-02-17 . NipponLighthouse . Japanese.
  10. Web site: Panasonic Launches Range of Talking TVs . 2013-02-17 . Panasonic . Panasonic . 27 March 2012.
  11. Web site: Panasonic television with Voice Guidance . 2013-02-17 . RNIB . RNIB . 10 July 2012 . With advice from RNIB experts.
  12. Web site: Administrator eyes DVR firesale after TVonics collapse - Freeview HD recorder firm founders . 2013-02-17 . Whitfield . Nigel . 27 June 2012 . The Register.
  13. Web site: Books and White Papers . 2013-02-17 . September 2011 . PDF . UK Digital TV Usability and Accessibility Guidelines, including Text to Speech . Digital TV Group.
  14. Web site: International Electrotechnical Commission . International_Electrotechnical_Commission . 29 Jan 2013 . IEC 62731 ed1.0: Text-to-speech for television - General requirements . International Electrotechnical Commission . 2013-02-17.