lvxferre

joined 4 years ago
MODERATOR OF
[–] lvxferre@lemmy.ml 3 points 2 years ago

Even here in South America, depending on the region, they're invasive.

[–] lvxferre@lemmy.ml 3 points 2 years ago (1 children)

Finally, some actual argumentation. Enough to convince me, at least - specially the first paragraph.

[–] lvxferre@lemmy.ml 14 points 2 years ago* (last edited 2 years ago)

Let's go simpler: what if your instance was allowed to copy the fed/defed lists from other instances, and use them (alongside simple Boolean logic plus if/then statements) to automatically decide who you're going to federate/defederate with? That would enable caracoles and fedifams for admins who so desire, but also enable other organically grown relations.

For example. Let's say that you just joined the federation. And there are three instances that you somewhat trust:

  • Alice - it defederates only really problematic instances.
  • Bob and Charlie - both are a bit prone to defederate other instances on a whim, but when both defed the same instance it's usually problematic.

Then you could set up your defederation rules like this:

  • if Alice defed it, then defed it too.
  • else, if (Bob defed it) and (Charlie defed it), then defed it too.
  • else, federate with it.

Of course, that would require distinguishing between manual and automatic fed/defed. You'd be able to use the manual fed/defed from other instances to create your automatic rules, to avoid deadlocks like "Alice is blocking it because Bob is blocking it, and Bob is blocking it because Alice is doing it".

[–] lvxferre@lemmy.ml 11 points 2 years ago* (last edited 2 years ago) (3 children)

Do tell me more of how Old People are not the target of discrimination¹, yout'².

You're 1) distorting what I said, and 2) being an assumer.

Discrimination can happen against any group. However, it's considerably worse when it's geared towards marginalised groups, as they have less ways to deal with it. That makes your analogy with a racial group (black people) a lot flawed.

Note, I do not think that insults against old people are "cool". However they're considerably less worse than insults towards black people.

The links that you've posted - that you clearly didn't even bother to read yourself - are evidence of discrimination in a very specific environment (workplace). They are not evidence of marginalisation.

[–] lvxferre@lemmy.ml 13 points 2 years ago

[Speaking as a user] Yeah, it looks inorganic for me too. As another user said the gold trim signals that the post was gilded, gilding nowadays works like a "mega-upvote" and gives it that trim. It's possible that defenders of the Jewish genocide (Holocaust) and of the Palestinian genocide (the ongoing war) are dumping money in those posts, to promote their shitty discourses.

And that works really well in Reddit because the userbase there loves some oversimplification: they have a really hard time decoupling the Jewish people from the State of Israel.

[–] lvxferre@lemmy.ml 12 points 2 years ago* (last edited 2 years ago)

[Speaking as a mod] Given the topic I'll be monitoring this thread carefully. As such, if anyone here is eager to either promote fascism (rule #1) or witch hunt (to point fingers towards other users based on assumptions or faulty reasoning - rule #1, rule #4): don't. Also remember that the topic of this community is Reddit, there's a lot of leeway for non-divisive off-topic but please don't go overboard.

And if anyone here has concerns that some other user is doing either thing, please use the report button, OK?

[–] lvxferre@lemmy.ml 14 points 2 years ago (5 children)

Just because the bigotry is aimed at old People doesn’t make it cool.

I get your point and I partially agree with it, but note that there's a big difference between old people and people from racial groups in USA (based on your references): the later are marginalised groups.

...for keeping the rhyme, perhaps "Doom-like shoot and boom"? Lots of exploding enemies in Doom, but no boomer reference.

[–] lvxferre@lemmy.ml 3 points 2 years ago

Aaaaah. I really, really wanted to complain about the excessive amount of keys.

(My comment above is partially a joke - don't take it too seriously. Even if a new key was added it would be a bit more clutter, but not that big of a deal.)

[–] lvxferre@lemmy.ml 4 points 2 years ago* (last edited 2 years ago)

The source that I've linked mentions semantic embedding; so does further literature on the internet. However, the operations are still being performed with the vectors resulting from the tokens themselves, with said embedding playing a secondary role.

This is evident for example through excerpts like

The token embeddings map a token ID to a fixed-size vector with some semantic meaning of the tokens. These brings some interesting properties: similar tokens will have a similar embedding (in other words, calculating the cosine similarity between two embeddings will give us a good idea of how similar the tokens are).

Emphasis mine. A similar conclusion (that the LLM is still handling the tokens, not their meaning) can be reached by analysing the hallucinations that your typical LLM bot outputs, and asking why that hallu is there.

What I'm proposing is deeper than that. It's to use the input tokens (i.e. morphemes) only to retrieve the sememes (units of meaning; further info here) that they're conveying, then discard the tokens themselves, and perform the operations solely on the sememes. Then for the output you translate the sememes obtained by the transformer into morphemes=tokens again.

I believe that this would have two big benefits:

  1. The amount of data necessary to "train" the LLM will decrease. Perhaps by orders of magnitude.
  2. A major type of hallucination will go away: self-contradiction (for example: states that A exists, then that A doesn't exist).

And it might be an additional layer, but the whole approach is considerably simpler than what's being done currently - pretending that the tokens themselves have some intrinsic value, then playing whack-a-mole with situations where the token and the contextually assigned value (by the human using the LLM) differ.

[This could even go deeper, handling a pragmatic layer beyond the tokens/morphemes and the units of meaning/sememes. It would be closer to what @njordomir@lemmy.world understood from my other comment, as it would then deal with the intent of the utterance.]

[–] lvxferre@lemmy.ml 3 points 2 years ago

Soap and water do wonders for 90% of the restroom cleaning.

The problem is that the other 10% are important too.

[–] lvxferre@lemmy.ml 6 points 2 years ago (2 children)

Not quite. I'm focusing on chatbots like Bard, ChatGPT and the likes, and their technology (LLM, or large language model).

At the core those LLMs work like this: they pick words, split them into "tokens", and then perform a few operations on those tokens, across multiple layers. But at the end of the day they still work with the words themselves, not with the meaning being encoded by those words.

What I want is an LLM that assigns multiple meanings for those words, and performs the operations above on the meaning itself. In other words the LLM would actually understand you, not just chain words.

[–] lvxferre@lemmy.ml 1 points 2 years ago

Yup, that's the stuff. It's mostly a finishing touch, to get rid of bacteria.

 

The title is click-baity (e.g. what's an "ancient language"?), but the content is still interesting regardless.

A few highlights:

Acc. to the author Great Andamanese ("GA") speakers have been culturally isolated from speakers of other languages for millenniums. It has been described as a dialect continuum, but it's being replaced by Hindi and showing clear signs of language death.

What's unusual about its usage of body parts in the grammar is not the usage itself, but its frequency and how it does it. Excerpts from the article:

If the blood emerged from the feet or legs, it was otei; internal bleeding was etei; and a clot on the skin was ertei. Something as basic as a noun changed form depending on location.

My breakthrough was to realize that the prefix e-, which originally derived from an unknown word for an internal body part, had over eons morphed into a grammatical marker signifying any internal attribute, process or activity. So the act of seeing, ole, being an internal activity, had to be eole. The same prefix could be attached to -bungoi, or “beautiful,” to form ebungoi, meaning internally beautiful or kind; to sare, for “sea,” to form esare, or “salty,” an inherent quality; and to the root word -biinye, “thinking,” to yield ebiinye, “to think.”

 

For me it's pic related. Fairly cheap wine from my chunk of the Southern Cone; I pay the equivalent of two and half euros/dollars for it. And yet it's so good - fruity, but not sweet. It rolls so well on the tongue. Perfect with some head cheese or salami.

The grape is locally known as "Bordô" or "Terci" but you'll find better info about it by its posh name "Ives Noir". Annoying grape; not as much as Pinot Noir (aka The Devil's Grape), but it's rather sensitive.

 

Here's Johnny~

 

This should be a great resource for beginners interested on Historical Linguistics, specially Indo-European studies. Each "lesson" provides a few pieces of information about either one Indo-European language, focusing mostly on historical and cultural information, but they provide a lot of links for people willing to understand one or more languages better.

(And yes, the banner of this community was proudly stolen from that page.)

 

This paper is '98 and it contrasts the then prevailing theories, in contrast with dialectal and historical evidence, arguing that the origin of periphrastic "do" was a habitual aspect marker.

Two of the earlier hypotheses that the author addresses and criticise:

Contact with Celtic languages - the feature would be borrowed from other languages that English interacted with. Specially prominent due to distribution, as do-periphrasis appeared first on the Western dialects. However unlikely, given that Celtic substratum influence in English was relatively minor.

Some invoke more complex pathways, such as a potential early Germanic-Celtic creolisation; the author claims that this is unattested.

Causative 'do' - occasionally attested in Old English, and frequently in Middle English. I'll adapt the 5a example to highlight the construction:

  • I do to-you know[=witan]... that those devil-idols to-you are harm-bearing

Here the usage of "do" would initially mean something like "make", "cause to", "have". For another example [from my own], consider "I did her tell me what was going on" - the "do" has some meaning but it's rather messy, and dependent on the sentence. (Does that "did" mean "encouraged?" "forced?" "asked?")

The author sees the following problems with this hypothesis:

  • Origin - do-periphrasis originated in the Western dialects, but those were the one that used causative do the least.
  • Motivation - it's harder to claim that an optional causative "do", with no independent semantic value, would eventually evolve into the do-support currently used.

Other hypotheses addressed were the usage of 'do' as a perfective aspect marker and verbal ellipsis. And then the author actually addresses the hypothesis he believes to be correct, linking current do-support to the habitual aspect; for example, in the sentence "I do browse Lemmy", that "do" can be understood as both an emphasiser and as conveying "by rule, usually".

 

Hey everyone, new mod here. I'd like to hear you on a few things, in order to make this community grow:

1. Who should be the primary target audience of this community?

We could tailor it primarily for layperson or for people with deeper Linguistics knowledge. Or we could simply let it roll.

2. Which type of moderation do you guys like? Stricter or laxer?

A stricter moderation would include rules like "quote your sources", "no crack theories" (proto- or pseudo-scientific hypotheses lacking methodological rigour), stuff like this; it would also mean that I'd discourage off-topic a bit further.

3. "Almost no crown or cross" rule: yes, no, indifferent?

By "almost no cross or cross" I mean that posters would only be able to talk about politics and religion as much as necessary for the subject of Linguistics. For example you'd be still fine posting something like this, but you wouldn't be able to discuss here the Marxist side of the matter, only the Linguistic one. Just an example, mind you.

4. How much do you know about Linguistics?

Are you a grad, undergrad, informed layperson, or just curious? Are there areas that you feel confident on, like Sociolinguistics or Phonetics or something like this?

5. Which type of content do you want to see here?

Papers? Videos? Discussions? Historical Linguistics? Sociolinguistics? Phonetics and Phonology? Since mods are IMO responsible to nurture a community, I don't mind looking for stuff to post here, but I'd like to know which one.

Thank you!

EDIT: I'm reading all your comments, even the ones that I didn't reply to, OK?

 

I just posted a paper, but I think that this should be more approachable for people here. It shows a rather interesting pattern between Germanic languages (English, German, Icelandic, Gothic...) and most other Indo-European languages, caused by a sound change. A few examples using Latin vs. English:

  • pēs (foot) vs. foot
  • trēs (three) vs. three
  • canis (dog) vs. hound; see German "Hund" dog for reference
  • decem (ten) vs. ten
  • gelū (ice) vs. cold
  • frāter (brother) vs. brother

Note how the consonants look like they went a "merry-go-round" from one language to another:

  • Latin fricative vs. English voiced stop
  • Latin voiced stop vs. English voiceless stop
  • Latin voiceless stop vs. English fricative

That's all caused by the regular sound changes explained in the Wiki link.

 

This paper from Silvia Luraghi explores the origin of PIE grammatical gender system, as well as proposing how it appeared in the language, in a way that accounts for the following discrepancy:

  • Hittite - two genders system: animate and inanimate
  • Late PIE - typically three genders system: masculine, feminine, and neuter

The animate and inanimate genders would've been inherited by late PIE as the masculine and neuter genders respectively, while the feminine would be the result of a derivational suffix *-h₂ being attached to words, and eventually triggering agreement. Note that the typical IE feminine /a/ (you see it in Latin/Romance and Slavic languages, for example) is believed to be from *h₂, as it's the a-colouring laryngeal.

I know that this paper might be a bit too deep for most folks here to parse, so if you feel intimidated, don't be afraid to ask for help.

 

The sole current moderator of that community is banned, and I wish to see that community thrive. That's it.

 

Inflammatory picture obviously related.

A lot of deletion/edition tools rely on the API in order to work. As such, they'll stop working on the 1st of July, as the API price changes start being enforced. Better do it now than never.

In my humble opinion, the easiest of them to use would be Power Delete Suite. [See EDIT 2] It allows you to export your content if you so desire.

There's also dessalines' Reddit-History (yup, the Lemmy admin!) that helps you to export your stuff, in case that you want to do it in a more fine-grained way.

There are other tools to do it, however, so pick the one that you like the most.

Should I delete my content? Or edit it?

Up to you. I think that it simple deletion is enough.

However, if you want to edit your content, keep in mind that Reddit likely wants to capitalise on data model training, so pick one of the two choices:

1. Blabber.

Random words, in an agrammatical word order, devoid of any sort of meaning. Introducing noise on the data models makes the data from Reddit considerably less reliable.

If you feel specially uninspired, here's a random word generator. You could also try Zompist's gen, it's a tool intended for people constructing languages, but it's damn good to generate non-linguistic babble.

2. Some message against Reddit.

Don't use swear words, as those might get filtered. Something as simple and short as "potatoes are great because Reddit is a scummy company" goes a long way.

That would also help to pinpoint which companies are doing businesses with Reddit, and pressure them to stop doing it.

Noooo! Think on all the helpful threads!

You can't make an omelette without breaking some eggs. While it's true that some information will be lost (less if you back up your content elsewhere), overall you're doing better for the internet by not playing along a clearly hostile agent in it.

EDIT:

Thinking about Reddit Inc.'s strategy to get money, perhaps it is better if you replace the content of your comments with random babble than if you simply delete it, if you want them to lose money.

Random babble is not just less useful for those large language models; it's outright poisonous, it's worth less than nothing. I might be wrong but I think that it decreases the value of the platform even further than plain deletion.

(EDIT 2 REASON: replacing j0be's Power Delete Suite link with one for a more updated version, that avoids a bug where posts/comments remain unedited.)

 

Popcorn, anyone?

view more: ‹ prev next ›