Microsoft Translator Explained

Microsoft Translator
Url:
Type:Machine translation
Registration:Optional
Language:see below
Owner:Microsoft
Current Status:Active

Microsoft Translator or Bing Translator is a multilingual machine translation cloud service provided by Microsoft. Microsoft Translator is a part of Microsoft Cognitive Services[1] and integrated across multiple consumer, developer, and enterprise products, including Bing, Microsoft Office, SharePoint, Microsoft Edge, Microsoft Lync, Yammer, Skype Translator, Visual Studio, and Microsoft Translator apps for Windows, Windows Phone, iPhone and Apple Watch, and Android phone and Android Wear.

Microsoft Translator also offers text and speech translation through cloud services for businesses. Service for text translation via the Translator Text API ranges from a free tier supporting two million characters per month to paid tiers supporting billions of characters per month.[2] Speech translation via Microsoft Speech services is offered based on the time of the audio stream.

The service supports text translation between many languages and language varieties. It also supports several speech translation systems that currently power the Microsoft Translator live conversation feature, Skype Translator, and Skype for Windows Desktop, and the Microsoft Translator Apps for iOS and Android.[3]

Development

History

The first version of Microsoft's machine translation system was developed between 1999 and 2000 within Microsoft Research. This system was based on semantic predicate-argument structures known as logical forms (LF) and was spun from the grammar correction feature developed for Microsoft Word. This system was eventually used to translate the entire Microsoft Knowledge Base into Spanish, French, German, and Japanese.[4]

Microsoft's approach to machine translation, like most modern machine translation systems, is "data driven":[5] rather than relying on writing explicit rules to translate natural language, algorithms are trained to understand and interpret translated parallel texts, allowing them to automatically learn how to translate new natural language text. Microsoft's experience with the LF system led directly to a treelet translation system that simplified the LF to dependency trees and eventually to an order template model, significantly improving in speed and enabling the incorporation of new target languages.

The consumer-facing translation site known as Bing Translator (previously known as Windows Live translator) was launched in 2007 and provides free text and website translations on the web. Text is translated directly within the Bing Translator webpage while websites are translated through the Bilingual Viewer tools.

In 2011, the service was extended to include numerous Microsoft Translator products through a cloud-based application programming interface, which supports products available to both consumer and enterprise users. An additional speech translation capability was introduced in March 2016.[6]

In May 2018, an update to the API was introduced. This new version offered neural machine translation as the default method of translating. In addition to translation, the new version features transliteration and a bilingual dictionary to look up words to find alternative translations and view examples in sentences.[7]

Speech translation was integrated into Microsoft Speech services in September 2018, providing end-to-end speech, speech-to-text, and text-to-speech translation.

Translation methodology and research

Microsoft Translator uses machine translation to create instantaneous translations from one natural language to another. This system is based on four distinct areas of computer learning research seen below.[5]

Type of learning Impact on translation
Neural networks try to mimic how the brain works to translate between languages. At a high level, neural network translation works in two stages. First, a first stage models the word that needs to be translated based on the context of this word (and its possible translations) within the full sentence. Second, the neural network translates this word model (not the word itself but the model the neural networks built of it), within the context of the sentence, into the other language.[8] Neural machine translation is the default translation method for the Microsoft Translator API.
Syntax-based SMT
Syntax-based translation is based on the idea of translating syntactic units, rather than a word or string of words. Microsoft has used syntax-based SMT to translate much of its computer-related texts from English into multiple target languages. Ongoing research in this area has produced improvements in word inflections and word ordering.
Phrase-based SMT In phrase-based SMT, the machine learns correspondence between languages from parallel text without the aid of linguist knowledge. This produces better translations in less time than other systems.
SMT systems rely on existing translated data to learn how to automatically translate from one language to another. To train the systems, identifying word correspondences (or word alignments) is crucial. Microsoft has developed work in both discriminative[9] and generative[10] approaches to word alignment, resulting in faster algorithms and higher quality and translations.
Language modeling Language modeling uses n-gram models to construct comprehensible translations in the target language. This ensures that the output translation is fluent and readable.

Accuracy

The quality of Microsoft Translator's machine translation outputs are evaluated using a method called the BLEU score.[11]

BLEU (Bilingual Evaluation Understudy) is an algorithm for evaluating the quality of text which has been machine-translated from one natural language to another. Quality is considered to be the correspondence between a machine's output and that of a human. BLEU was one of the first metrics to achieve a high correlation with human judgments of quality, and remains one of the most popular automated and inexpensive metrics.

Because machine translation is based on statistical algorithms rather than human translators, the automatic translations it produces are not always entirely accurate. Microsoft Translator has introduced various feedback features, such as the Collaborative Translation Framework, into its products to allow users to suggest alternative translations. These alternative translations are then integrated into the Microsoft Translator algorithms to improve future translations.

In November 2016, Microsoft Translator introduced translation using deep neural networks in nine of its highest-traffic languages, including all of its speech languages and Japanese. Neural networks provide better translation than industry standard statistical machine learning.[12]

Core products

Microsoft Translator is a cloud-based API that is integrated into numerous Microsoft products and services.[13] The Translator API can be used on its own and can be customized for use in a pre-publishing or post-publishing environment. The API, which is available through subscription, is free for lower translation volumes, and is charged according to a tiered payment system for volumes exceeding two million characters per month.[2] The remaining core products are available for free.

Microsoft Translator cloud translation

The Microsoft Translator is a cloud-based automatic translation service that can be used to build applications, websites, and tools requiring multi-language support.

Custom Translator

Custom Translator is a feature of the Microsoft Translator services that allows enterprises, app developers, and language service providers to build neural translation systems that understand the terminology used in their own business and industry. The customized translations can then be delivered into existing applications, workflows and websites using a normal call to the Microsoft Translator API. Custom Translator can be used when translating text with the Microsoft Translator Text API and when translating speech with Microsoft Speech services.[14]

Live feature

A personal universal translator that enables up to 500 people to have live, multi-device, multi-language, in-person translated conversations.[15] This feature is currently free and available in the Microsoft Translator apps (Android, iOS or Windows) and from the Microsoft Translator website.

Microsoft Translator Hub

The Microsoft Translator Hub allows enterprises and language service providers to build their own translation systems that understand business- and industry-specific terminology.[16] The Hub can also be used in conjunction with the CTF, allowing administrators to approve CTF results and add them directly to the Hub. The Microsoft Translator Hub is only available for statistical machine translation and cannot be used with the newest version of the Microsoft Translator API.

The Hub has also been used for language preservation, allowing communities to create their own language translation systems for language and cultural preservation.[17] The Hub has been used to create translation systems for languages such as Hmong, Mayan, Nepali, and Welsh.

Multilingual App Toolkit (MAT)

The Multilingual App Toolkit (MAT) is an integrated Visual Studio tool, which allows developers to streamline localization workflows of their Windows, Windows Phone and desktop apps.[18] MAT improves localization of file management, translation support, and editing tools.

Bing Microsoft Translator web app

Microsoft Translator
Type:Neural machine translation
Registration:Optional
Language:See below
Owner:Microsoft
Launch Date: (as Live Search Translator)
Current Status:Active

Bing Microsoft Translator (previously Live Search Translator, Windows Live Translator, and Bing Translator)[19] is a user-facing translation portal provided by Microsoft as part of its Bing services to translate texts or entire web pages into different languages. All translation pairs are powered by the Microsoft Translator, a neural machine translation platform and web service, developed by Microsoft Research, as its backend translation software. Two transliteration pairs (between simplified Chinese and traditional Chinese) are provided by Microsoft's Windows International team.[20]

Bing Translator can translate phrases entered by the user or acquire a link to a web page and translate it entirely. When translating an entire web page, or when the user selects "Translate this page" in Bing search results, the Bilingual Viewer is shown, which allows users to browse the original web page text and translation in parallel, supported by synchronized highlights, scrolling, and navigation.[21] Four Bilingual Viewer layouts are available:

Bing Translator integrates with several other Microsoft products. The following is a table of products into which Bing Translator could be integrated:

scope=col Integrates intoscope=col Means of integration
scope=row Bing Instant AnswersAlready integrated
scope=row Internet ExplorerAn Accelerator for Internet Explorer 8 or higher

Supported products

Through its core product offerings, Microsoft Translator supports the translation features of many Microsoft products at the consumer and enterprise levels. These products fall broadly into three categories[22] —communication products, Microsoft Office, and apps.

Communication

Microsoft Office

Apps

Deprecated products

Collaborative Translation Framework (CTF)

The Collaborative Translation Framework (CTF) is an extension of the Microsoft Translator API that allows post-publishing improvement of translated text.[23] By using the CTF, readers have the ability to suggest alternative translations to those provided by the API, or vote on previously offered alternatives. This information is then delivered to the API to improve future translations.

Translator Web Widget

The Translator Web Widget is a translation tool that can be added to web pages by pasting a predefined snippet of JavaScript code into the page.[24] The web widget is offered for free by Microsoft, and supports both pre-publishing customized translations using the Translator Hub, and post-publishing improvements using the Collaborative Translation Framework.

Supported languages

, Microsoft Translator supports 134 languages and language varieties.[3] [25] The list of supported languages is available at the Microsoft Translator website and can also be retrieved programmatically through the cloud services.[26]

  1. Afrikaans
  2. Albanian
  3. Amharic
  4. Arabic
  5. Armenian
  6. Assamese
  7. Azerbaijani
  8. Bangla
  9. Bashkir
  10. Basque
  11. Bhojpuri
  12. Bodo
  13. Bosnian
  14. Bulgarian
  15. Cantonese (Traditional)
  16. Catalan
  17. Chhattisgarhi
  18. Chinese (Literary)
  19. Chinese (Simplified)
  20. Chinese (Traditional)
  21. Croatian
  22. Czech
  23. Danish
  24. Dari
  25. Divehi
  26. Dogri
  27. Dutch
  28. English
  29. English (United Kingdom)
  30. Estonian
  31. Faroese
  32. Fijian
  33. Filipino
  34. Finnish
  35. French (France)
  36. French (Canada)
  37. Galician
  38. Ganda
  39. Georgian
  40. German
  41. Greek
  42. Gujarati
  43. Haitian Creole
  44. Hausa
  45. Hebrew
  46. Hindi
  47. Hmong Daw
  48. Hungarian
  49. Icelandic
  50. Igbo
  51. Indonesian
  52. Inuinnaqtun
  53. Inuktitut
  54. Inuktitut (Latin)
  55. Irish
  56. Italian
  57. Japanese
  58. Kannada
  59. Kashmiri
  60. Kazakh
  61. Khmer
  62. Kinyarwanda
  63. Klingon (Latin)
  64. Konkani
  65. Korean
  66. Kurdish (Central)
  67. Kurdish (Northern)
  68. Kyrgyz
  69. Lao
  70. Latvian
  71. Lingala
  72. Lithuanian
  73. Lower Sorbian
  74. Macedonian
  75. Maithili
  76. Malagasy
  77. Malay
  78. Malayalam
  79. Maltese
  80. Marathi
  81. Mongolian (Cyrillic)
  82. Mongolian (Traditional)
  83. Myanmar (Burmese)
  84. Māori
  85. Nepali
  86. Norwegian
  87. Nyanja
  88. Odia
  89. Pashto
  90. Persian
  91. Polish
  92. Portuguese (Brazil)
  93. Portuguese (Portugal)
  94. Punjabi (Gurmukhi)
  95. Querétaro Otomi
  96. Romanian
  97. Rundi
  98. Russian
  99. Samoan
  100. Serbian (Cyrillic)
  101. Serbian (Latin)
  102. Sesotho
  103. Sesotho sa Leboa
  104. Setswana
  105. Shona
  106. Sindhi
  107. Sinhala
  108. Slovak
  109. Slovenian
  110. Somali
  111. Spanish
  112. Swahili
  113. Swedish
  114. Tahitian
  115. Tamil
  116. Tatar
  117. Telugu
  118. Thai
  119. Tibetan
  120. Tigrinya
  121. Tongan
  122. Turkish
  123. Turkmen
  124. Ukrainian
  125. Upper Sorbian
  126. Urdu
  127. Uyghur
  128. Uzbek (Latin)
  129. Vietnamese
  130. Welsh
  131. Xhosa
  132. Yoruba
  133. Yucatec Maya
  134. Zulu

Languages formerly in development and beta version

, Microsoft Translator has a beta version only time of depends in the 100+ languages there are available in Microsoft.

    1. Abkhaz Acehnese (Latin Script)
      1. Adyghe Afar Ainu
  1. Arabic (Egyptian)
  2. Asturian
  3. Avar
    1. Aymara Balinese
    2. Bambara Baoulé
  4. Batak Karo
  5. Batak Toba
    1. Belarusian Bemba
  6. Betawi
  7. Bikol
  8. Buryat
  9. Cebuano
  10. Chamorro
  11. Cocama
  12. Corsican
  13. Crimean Tatar (Cyrillic)
  14. Crimean Tatar (Latin)
  15. Cuyonon
  16. Deccani
  17. Dinka
  18. Dombe
  19. Dzongkha
  20. Emerillon
  21. Esperanto
  22. Ewe
  23. Fang
  24. Fon
  25. Frisian
  26. Friulian
  27. Fulani
    1. Gagauz Ghomala'
  28. Guadeloupean Creole
  29. Haryanvi
  30. Herero
  31. Hiligaynon
  32. Hunsrik
  33. Iban
  34. Ilocano
    1. Ingush Jamaican Patois
  35. Javanese (Traditional)
  36. Javanese (Indonesian script)
  37. Jingpo
  38. Kabuverdianu
  39. Kalaallisut
  40. Kapampangan
  41. Kikongo
  42. Kituba
  43. Koryak
  44. Krio
    1. Latin Ligurian
  45. Limburgish
  46. Lombard
  47. Luo
  48. Luxembourgish
  49. Madurese
  50. Makassar
  51. Mapuche
  52. Marshallese
  53. Marwari (India)
  54. Mauritian Creole
    1. Meadow Mari Minang
  55. Moroccan Arabic
  56. Nahuatl (Eastern Huasteca)
  57. Nepalbhasa (Newari)
  58. NKo
  59. Norwegian (Nynorsk)
  60. Nuer
  61. Occitan
  62. Okinawan
    1. Ossetian Pangasinan
  63. Papiamento
  64. Pohnpeian
  65. Punjabi (Shahmukhi)
  66. Q'eqchi'
  67. Quechua
  68. Quorato Spanish Creole
  69. Rarotongan
  70. Romani
  71. Sami (North)
  72. Samogitian
  73. Sango
  74. Sanskrit
  75. Santali
  76. Sardinian
  77. Scots Gaelic
  78. Seediq
  79. Seselwa Creole French
  80. Shan
    1. Siberian Tatar Sicilian
  81. Standard Arabic
  82. Sundanese
  83. Susu
  84. Tajik
  85. Tamazight
  86. Tamazight (Tifinagh)
  87. Tausug
  88. Tetun
  89. Tok Pisin
    1. Tuvan Twi
    2. Venda Venetian
  90. Waray
  91. Wolof
  92. Xamtanga
    1. Yakut Yiddish
  93. Zapotec
  94. Zeelandic
  95. Zhuang

Community partners

Microsoft Translator has engaged with community partners to increase the number of languages and to improve overall language translation quality. Below is a list of community partners that Microsoft Translator has teamed with.[27]

Additionally, Microsoft has teamed with the Klingon Language Institute, which promotes the constructed language, Klingon, which is used within the fictional Star Trek universe produced by Paramount and CBS Studios. Klingon has been supported by Microsoft Translator since May 2013.[28]

See also

External links

SW development

Notes and References

  1. Book: Carmen., Steiner. Pilotstudie: über die Funktion des Anhebens der Augenbrauen in der Deutschschweizerischen Gebärdensprache DSGS. 2000. Verein zur Unterstützung der Gebärdensprache der Gehörlosen. 81702109.
  2. Web site: Azure Data Marketplace- Microsoft Translator. 2014-12-08. https://web.archive.org/web/20180206203347/http://datamarket.azure.com/dataset/bing/microsofttranslator. 2018-02-06. dead.
  3. Web site: Microsoft Translator- Languages. Microsoft. January 6, 2023.
  4. Web site: Microsoft Research- Arul Menezes.
  5. Web site: Microsoft Research- Machine Translation.
  6. Web site: Microsoft Translator brings end-to-end speech translation to everyone with the world's first Speech Translation API.
  7. Web site: Microsoft Translator- Translator Text API. Microsoft. dead. https://web.archive.org/web/20190902184414/https://www.microsoft.com/en-us/translator/business/translator-api/. 2019-09-02.
  8. Web site: What is neural network based translation?. 2016-11-28. 2021-02-08. https://web.archive.org/web/20210208195336/https://microsofttranslator.uservoice.com/knowledgebase/articles/1099027-what-is-neural-network-based-translation. dead.
  9. Web site: A Discriminative Framework for Bilingual Word Alignment. https://web.archive.org/web/20080720142028/http://www.aclweb.org/anthology-new/H/H05/H05-1011.pdf. dead. 2008-07-20.
  10. Web site: Using Word Dependent Transition Models in HMM based Word Alignment for Statistical Machine Translation.
  11. Web site: Microsoft Translator Hub: Discussion of BLEU Score. 23 May 2013 .
  12. Web site: Microsoft Translator launching Neural Network based translations for all its speech languages. 16 November 2016.
  13. Web site: Microsoft Translator- Products. Microsoft.
  14. Web site: Microsoft Translator- Customization. Microsoft.
  15. Web site: Microsoft Translator live feature. Microsoft. 18 July 2023 .
  16. Web site: Microsoft Translator- Translator Hub. Microsoft.
  17. Web site: Where Language Meets the World: Microsoft Translator Hub. YouTube. 4 September 2014 . https://ghostarchive.org/varchive/youtube/20211220/UnROjeEHqDE . 2021-12-20 . live.
  18. Web site: Microsoft Translator- Multilingual App Toolkit. Microsoft.
  19. Book: Antoniou. Grigoris. Grobelnik. Marko. Simperl. Elena. Parsia. Bijan. Plexousakis. Dimitris. de Leenheer. Pieter. Pan. Jeff Z.. The Semantic Web: Research and Applications.. 2011. Springer Science+Business Media. Berlin. 9783642210334. 341.
  20. Web site: Translation powered by....Microsoft Translator! . Microsoft Translator (and Bing Translator) Official Team Blog . Microsoft Corporation . 8 September 2008 . 21 October 2010 .
  21. Web site: About Bing Translator. Bing Translator. Microsoft. 20 December 2014.
  22. Web site: Microsoft Translator- Products. Microsoft.
  23. Web site: Microsoft Translator- Collaborative Translation Framework. Microsoft. https://web.archive.org/web/20150330064733/http://www.microsoft.com/translator/collaborative-translation-framework.aspx. 30 March 2015. dead.
  24. Web site: Microsoft Translator- Translator Web Widget. Microsoft.
  25. Web site: Search Microsoft Translator . 2024-10-31 . www.bing.com.
  26. Web site: Translator Language Codes. 18 July 2023 .
  27. Web site: Translator Community Partners. Microsoft.
  28. Web site: New Bing Translator Option Lets You Translate... Klingon. 14 May 2013 .