Qwen

50 readers
1 users here now

A community all about the Qwens! (LLMs, VLMs, WANs...)

Here their blog page and their free chat interface

Post are allowed to have any format.

It is advised to put "Qwen" into the title somewhere.

Da Rules

  1. please be nice <3 ๐Ÿงธ
  2. no bigotry or general evil-doings please! ๐Ÿ’–
  3. no politics ๐ŸŒโŒ
  4. please don't make me add more rules <3

founded 2 weeks ago
MODERATORS
1
8
new qwen architecture? :o (lemmy.blahaj.zone)
submitted 1 week ago* (last edited 1 week ago) by Smorty@lemmy.blahaj.zone to c/qwen@lemmy.blahaj.zone
 
 

i jus wanted to get dis outta my system >v< ...

i dun like those boring linear model structures... they work... bt they dun look fun, nor intuitive. they jus produce output... which is boring!

pls, if some researcher with lotsa gpus sees this, maybsies try this kinda architecture... u dont evn have to credit me, just try it out n see where it goes ~ ~ ~

2
 
 

It's okay to mess up! We all do.

We improve our overthinking problems just like how Qwen optimizes its CoTs!

But when you give it a hard problem without the right tools for the job, well... A human wouldn't do good on that either..

3