An Indonesian Local Language Dataset Archive
Indonesian Languages is considered a "rising star" (microsoft) in terms of language resources (due to its strong web presence and thriving online). Indonesia also has more than 700 local languages spread over 17,508 islands, making Indonesia the second country with the world's largest spoken languages.
To start a grassroot effort in South East Asia to strengthen and spur MT research in the region and for researchers interested in collecting and working on any language data.
State-of-the-art only exist for 100 languages. Many languages still do not have training data or existing technologies for translation.