I would love etymological trees linked in this way
Data is Beautiful
Be respectful
Is PIE something like proto-indo-eurasian, or just something to do with pies?
and while I'm at it, how do I pronounce *h3?
There's a bunch of guesses on how *h₁ *h₂ and *h₃ were pronounced in this Wikipedia page. They're usually defined by their effect in child languages though, so it's possible that some of those were actually multiple sounds.
For *h₃ you'll often see values like [ɣʷ] or [ʁʷ]; a labialised consonant (to explain why it often turns nearby vowels into [o] ) and voiced (as there are some claims that it voices nearby consonants, mostly Cowgill's Law)
My personal guess for *h₃ is completely heterodox, [ɸ]~[β]. I think that it's directly associated with *b being so uncommon in PIE.
Wow, that article is all proto-indo-european to me!
- Happy Lemmiversary
- I wish we could follow individual users because I could listen to you talk about PIE aitches for the next couple thousand years
Thank you! Sadly, I don't talk too much about PIE in Lemmy because... well, it's kind of a niche subject that most users don't care too much about.
Feel free to ask for further info on stuff, though. I do enjoy talking about it!
it's not eurasian because the family is centered around europe and only extends to about india, chinese/japanese/korean are a separate tree.
I wonder if something like the semantic tokenization method would benefit from using etymological data like this, particularly for a multilingual llm.
i know that my NN internally uses semantic tokenization method.
i literally often seek the word roots when talking to somebody. it helps me focus.
Cool diagram! Would be better if it pointed out that the Portuguese word "real" only refers to currency in Brazil, not Portugal. The origin appears correct and the word is used in Portugal either to say something is "regal" or "real".
Cross posted from: Latin@lemm.ee
...communitatem de linguā latinā habemus? Nesciebam! /me subscribit
Feel free to cross-post this stuff in !linguistics@mander.xyz by the way, it's right into the comm's alley.
Brought to you in indecipherable colors for color-blind people
That diagram 📏 !
This is very cool.
Makes me think about how "ject" lives on in so many words.
I injected [adjective] objects into an objecting subject, but was rejected and ejected for the lack of conjecture in my self-projecting project(!)
Did I miss any? Probably! ... What does "ject" even mean in and of itself?
My mind is really reading "tritium" instead of something like "re" with that h3reg in the middle.
Also interesting to note: the word "rial" in arabic, also used to denote currency, descends from the portuguese/spanish real
No ruler, as in the measuring device?
How come everybody dropped the h sound?
Those are placeholders. "We don't know what this sound is supposed to be, so we plop h+number there and call it a day." You'll see some reconstructions using *ə₁ *ə₂ *ə₃ instead, same deal.
That said, the Anatolian languages (Hittite, Luwian etc. - the whole branch is extinct) preserved a few of those laryngeals; compare for example Latin ⟨ouis⟩ and Hittite ⟨𒇻𒅖⟩ ḫāwis, from PIE *h₂ówis (sheep). Since Anatolian split way before the other languages, this makes me wonder if they weren't vocalised already in Late Proto-Indo-European.