Bing Translation Learns Hmong Language
Written by Ian Elliot   
Thursday, 23 February 2012

Microsoft has added Hmong  to the list of languages supported by Bing Translator. This was achieved using new features of Microsoft Translator to train a new translation engine using parallel documents. 

Hmong is spoken by over 6.5 million people mainly in Southeast Asia and China but also by communities in the United States, Australia, and France. The dialect supported by Bing Translator is Hmong Daw.

Hmong has a very long oral tradition, but the writing system was developed just 60 years ago. The Hmong community, which was instrumental in getting this language added to Bing, is concerned with the long-term preservation of the language and sees an important role for online machine translation in this context.

 

 

Members of the community used new tools currently in beta to train the new translation engine.

According to the announcement on the MSDN blog:

Microsoft Translator ... can learn how to translate from a set of parallel documents (same document in two languages), dictionaries and texts in the language to translate to (Hmong in this case).

In addition to teaching the engine a new language, they also involved members of the community, partners and collaborators to create and review improved versions of the automated translation system, and collect qualitative feedback about each “trained” system. Deploying a system that reaches a certain level of quality allows seamless use with the standard Microsoft Translator APIs, and many scenarios powered by the API, like the web translation widget. Feedback that is generated through these scenarios can be utilized again in the training process – creating a virtuous loop for improving the translation quality.


hmong

(click to enlarge)

Microsoft added this latest language on February 21st in celebration of International Mother Language Day.

According to its page on the United Nations website , International Mother Language Day  has been observed in February every year since 2000 to promote linguistic and cultural diversity and multilingualism. The date represents the day in 1952 when students demonstrating for recognition of their language, Bangla, as one of the two national languages of the then Pakistan, were shot and killed by police in Dhaka, the capital of what is now Bangladesh.

 

 

There are now 38 languages available using Bing Translator, which leaves a lot of scope for more to be added.

You can sign up for the Microsoft Translator API using Windows Azure Marketplace and can try out the functionality of the Translator API, without having to write any code, on the Marketplace itself.

More Information

Bing Translator

International Mother Language Day

Related Articles

Getting started with Bing Translate

Bing Translate - an alternative?

 

espbook

 

Comments




or email your comment to: comments@i-programmer.info

 

To be informed about new articles on I Programmer, subscribe to the RSS feed, follow us on Google+, Twitter, Linkedin or Facebook or sign up for our weekly newsletter.

Banner


52nd Mersenne Prime Found
27/10/2024

It has been nearly six years since the last Mersenne prime was discovered. Now, at last, we have Mersenne prime number 52 and it has 41,024,320 digits!



Google Opensources Privacy Library
08/11/2024

Google is making a new differential privacy library available as open source. PipelineDP4J is a Java-based library that can be used to analyse data sets while preserving privacy.


More News

Last Updated ( Thursday, 23 February 2012 )