Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I loaded the language coverage into Datasette Lite and added some facets here:

https://lite.datasette.io/?json=https://gist.github.com/simo...

Here's how I did that: https://gist.github.com/simonw/63aa33ec827b093f9c6a2797df950...

Here are the top 20 represented language families:

    Niger-Congo 1,019
    Austronesian 609
    Sino-Tibetan 288
    Indo-European 278
    Afro-Asiatic 222
    Trans-New Guinea 219
    Otomanguean 149
    Nilo-Saharan 131
    Austro-Asiatic 100
    Dravidian 60
    Australian 51
    Creole 45
    Kra-Dai 43
    Uto-Aztecan 41
    Quechuan 36
    Language isolate 35
    Torricelli 32
    Maipurean 31
    Mayan 30
    Sepik 30


Thanks -- great way to visualize how massive this set of languages really is.


All this great work and the prior sota translations and Meta still only accepts Nort American english voice control in their VR equipment lol.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: