Originally I'm thinking of making a phoneme inventory generator.
In order to do that, I need to figure out the commonality of each possible phonemes.
In order to do that, I need to catalogue phonemes of human languages.
I know this has been done before in UPSID, but I'm adding several new fields on my data. The following are the columns I have figured out so far for my table
- Language, self explanatory. This column will be filled with ISO 639-3 code of the language.
- Phoneme, self explanatory. IPA Unicode will be used to fill this column. I think four character is enough... or is it?
- Nativity, whether the phoneme are native or not. For example /f/ and /z/ are observed in Indonesian, but it's not native.
- Underlying, the underlying representation of the phoneme. For example, many South Asian languages are listed with /n̪ t̪ d̪/ without alveolars to contrast them. The underlying representation of these phonemes would be /n t d/.
Any other column I need to consider in designing the table?
How about actual frequency in a dictionary or sample text? That seems like a central concern.
It's a good idea, I'll add a column for that.
But, as of now, those column will have default value of 0. it takes major effort to getting the required corpus and then analysing them.
Something to consider with English is what dialect you're considering, especially with vowels.
Maybe middle english then? Shit's easy. It only has /i i: u u: e e: o o: ɛː ɔː a a:/ Just like a normal sane language.
Here's an abomination I came up so far:
PALM - ɑ
LOT - ɒ
CLOTH - ɒ
BATH - ɑː
TRAP - æ
PRICE - ai
START - ar
MOUTH - au
THOUGHT - o
DRESS - e
COMMA - ə
FACE - ei
SQUARE - er
LETTER - ər
KIT - i
HAPPY - i
FLEECE - i:
NEAR - ir
CHOICE - oi
NORTH - or
GOAT - ou
FORCE - our
FOOT - u
GOOSE - u:
CURE - ur
STRUT - ʌ
NURSE - ʌr
Alice locks her signed copy of the book in the chest with her lock. She sends the chest to Bob. He slaps his lock on it, and sends the chest back to Alice. She correctly interprets this as a big middle finger from Bob, and takes a chainsaw to open up the chest and get her damn book back since he obviously doesn’t want it.