Link Search Menu Expand Document

Last update: 2021-04-17

Version 0.01

About

Why?

A linguist friend was looking for pronunciation data for a class, and I tried to help by writing a quick script to scrape Wiktionary. After playing with that, I thought it would be interesting to have this information indexed by language, so I decided to build this website. Might be useful for someone else?

How?

All this information was extracted from Wiktionary!

You can check out how the code part works by checking the Github repository for this project. Briefly, there’s this list called: Terms with IPA pronunciation by language, I go one by one, and save 1) the term and 2) the pronunciation written with the International Phonetic Alphabet (IPA). That goes into tables, which are later on read and converted into markdown files to make this website.

You can download the tables with these results from here, or browse the glossaries.

Who?

Issues and contributions

The way the pronunciation data is submitted into Wiktionary is not standard, so there are a bunch of errors in the glossaries. If you let me know of an error by submitting an issue, I’ll do my best to get it fixed!

Versions

  • 0.01: 2021-04-17 Incomplete first version to test the site.