394
submitted 1 month ago by Cr4yfish@lemmy.world to c/opensource@lemmy.ml
top 50 comments
sorted by: hot top controversial new old
[-] iopq@lemmy.world 63 points 1 month ago

I have some suggestions: let's not make people translate to English unless they are learning English. I don't want to be thinking about whether "I'm coming Friday" is correct grammar in English. I want to be thinking about my target language!

[-] Cr4yfish@lemmy.world 33 points 1 month ago* (last edited 1 month ago)

Thanks for the suggestion, I’ll definitely try to make the app as language inclusive as possible!

Also, sorry if I might’ve been too vague with the post title. The app is just similar to Duolingo in terms of structure and the idea, however it’s not specific to language learning but supposed to cater to any subject, really.

For example, I personally use it to study for my university subjects.

[-] OsrsNeedsF2P@lemmy.ml 13 points 1 month ago

This app seems to be about any generic courses, not just language learning. So someone can make a language course in the way you've described

[-] maniel@sopuli.xyz 11 points 1 month ago* (last edited 1 month ago)

Yeah, it's my minor pet peeve with Duolingo, like source language and my language doesn't have/need suffixes like "the" or "a" so I often forget about it, it's soo annoying to fail because of such minor thing, especially when their suggested English often looks terrible

load more comments (8 replies)
[-] foster@lemmy.fosterhangdaan.com 19 points 1 month ago
[-] eutampieri@feddit.it 52 points 1 month ago* (last edited 1 month ago)

In case OP doesn’t know, if a repo hasn’t got a licence it’s implied it’s licensed under "all rights reserved", so not open source! You need to https://choosealicense.com

[-] Cr4yfish@lemmy.world 46 points 1 month ago

it’s implied it’s licensed under "all rights reserved", so not open source!

Oh, I actually did not know that. I'll try to remember adding a License right from the get-go from now on, thanks :)

[-] wiki_me@lemmy.ml 23 points 1 month ago

I think you want to use AGPL. people can still make a closed source website out of your project due to the ASP loophole.

[-] Cr4yfish@lemmy.world 17 points 1 month ago

Yeah you're right. I switched it to AGPL.

[-] rmuk@feddit.uk 8 points 1 month ago

Ah, yes, GPLv3, exquisite choice.

[-] eutampieri@feddit.it 3 points 1 month ago
[-] Cr4yfish@lemmy.world 17 points 1 month ago

It's GPLv3 now.

[-] Cr4yfish@lemmy.world 5 points 1 month ago

oh, right. Forget that every time. I'll add one.

[-] 01189998819991197253@infosec.pub 17 points 1 month ago* (last edited 1 month ago)

This is a really great use of LLM! Seriously great job! Once it's fully self-hostable (including the LLM model), I will absolutely find it space on the home server. Maybe using Rupeshs fastdcpu as the model and generation backend could work. I don't remember what his license is, though.

Edit: added link.

[-] Cr4yfish@lemmy.world 10 points 1 month ago

Thanks! I'm already eyeing ollama for this.

[-] AliasAKA@lemmy.world 13 points 1 month ago

Is there any interest in getting local models to run using this? I’d rather not use Gemini, and then all the data can reside locally (and not require a login).

I’d be happy to work on this, though I’m a python developer not a typescript one.

[-] Cr4yfish@lemmy.world 6 points 1 month ago

Yeah, good idea. It's possible to do that with WebLLM & Langchain. Once Langchain is integrated, it's kinda similar to the Python Version so should be do-able I think.

load more comments (2 replies)
[-] GravitySpoiled@lemmy.ml 12 points 1 month ago* (last edited 1 month ago)

That sounds cool! Is there already a release? If not, don't rush it :)

Edit: never mind. I just saw the website 😅

[-] Cr4yfish@lemmy.world 10 points 1 month ago

Thanks :). Yeah, it's publicly accessible: nouv.app/. I use it daily already but it still has tons of bugs.

[-] jjlinux@lemmy.ml 2 points 1 month ago

Unless I remove the "Always use secure connections" it breaks on the cert.

[-] Cr4yfish@lemmy.world 5 points 1 month ago

Hm that's very weird. I can't replicate it and I used some random SSL checker website and it checks out as well.

Really not sure why that's happening.

[-] rmuk@feddit.uk 4 points 1 month ago

WFM. Looks like you're using Let's Encrypt, which is fine, and everything seems to be consistent. I think you're good.

[-] jjlinux@lemmy.ml 3 points 1 month ago

It's a great looking site at first glance (haven't signed up yet). I just sandboxed a browser and let it run without forcing HTTPS. Funny thing is that it does show it as being https when disabling https enforcement.

I'll take it for a spin this afternoon when I get back home (or in my phone when I get bores at the recital my wife is forcing me to go to 🤣🤣🤣).

load more comments (1 replies)
[-] silmathoron@floss.social 9 points 1 month ago* (last edited 1 month ago)

@Cr4yfish nice project 🙂
I'm a bit worried about the AI part, though, as you'd want an app whose main purpose is "learning" to guarantee, if not the reliability of the material (since anyone can contribute), at least the reliability of the course generation process that it proposes.
As far as I know, this is not possible with current generative AI tools, so what's your plan to make sure hallucinations do not creep in?

[-] Cr4yfish@lemmy.world 4 points 1 month ago

Thanks. My general strategy regarding GenAI and reducing the amount of hallucinations is by not giving it the task to make stuff up, but to just work on existing text - that's why I'm not allowing users to create content without source material.

However, LLMs will be LLMs and I've been testing it out a lot and found already multiple hallucinations. I built in a reporting system, although only reporting stuff works right now, not viewing reported questions.

That's my short term plan to get a good content quality, at least. I also want to move away from Vercel AI & Gemini to a Langchain Agent system or Graph maybe, which will increase the output Quality.

Maybe in some parallel Universe this really takes off and many people work on high quality Courses together...

[-] foremanguy92_@lemmy.ml 9 points 1 month ago
[-] Cr4yfish@lemmy.world 3 points 1 month ago
[-] foremanguy92_@lemmy.ml 3 points 1 month ago

Hope to see it growing the best way

[-] Auster@thebrainbin.org 6 points 1 month ago
[-] Cr4yfish@lemmy.world 6 points 1 month ago

The UI mostly works offline once loaded in due to aggressive caching. Downloading Course Content was on the initial Roadmap but I removed it since I wasn't sure if anyone would like the feature.

Syncing stuff is a real pain in the ass but I'll implement it if at least a couple people want it.

[-] nichtburningturtle@feddit.org 10 points 1 month ago

An offline mode would definitely be something I would want, tho it isn't high priority.

[-] Cr4yfish@lemmy.world 8 points 1 month ago

I added it back to the roadmap :).

[-] Auster@thebrainbin.org 4 points 1 month ago

I don't know how much of a subset I am, but I still use dictionary softwares from Windows 95~2000 era and Android softwares on a completely offline and vanilla VM, partly due to internet randomly going bad, and partly because I am neurotic about digital contents vanishing once support ends.

[-] Cr4yfish@lemmy.world 3 points 1 month ago

Understandable. I added a proper offline mode back to the Roadmap on github.

[-] nichtburningturtle@feddit.org 5 points 1 month ago

How does the level creation from a pdf work and does it support languages other than English?

[-] Cr4yfish@lemmy.world 8 points 1 month ago

I use Gemini, which supports PDF File uploads, combined with structured outputs to generate Course Sections, Levels & Question JSON.

When you upload a PDF, it first gets uploaded to a S3 Database directly from the Browser, which then sends the Filename and other data to the Server. The Server then downloads that Document from the S3 and sends it to Gemini, which then streams JSON back to the Browser. After that, the PDF is permanently deleted from the S3.

Data Privacy wise, I wouldn't upload anything sensitive since idk what Google does with PDFs uploaded to Gemini.

The Prompts are in English, so the output language is English as well. However, I actually only tested it with German Lecture PDFs myself.

So, yes, it probably works with any language that Gemini supports.

Here is the Source Code for the core function for this feature:

export async function createLevelFromDocument(
    { docName, apiKey, numLevels, courseSectionTitle, courseSectionDescription }: 
    { docName: string, apiKey: string, numLevels: number, courseSectionTitle: string, courseSectionDescription: string }) 
    {
    
    const hasCourseSection = courseSectionTitle.length > 0 && courseSectionDescription.length > 0;

    // Step 1: Download the PDF and get a buffer from it
    const blob = await downloadObject({ filename: docName, path: "/", bucketName: "documents" });
    const arrayBuffer = await blob.arrayBuffer();
    
    // Step 2: call the model and pass the PDF
    //const openai = createOpenAI({ apiKey: apiKey });
    const gooogle = createGoogleGenerativeAI({ apiKey: apiKey });

    const courseSectionsPrompt = createLevelPrompt({ hasCourseSection, title: courseSectionTitle, description: courseSectionDescription });
    
    const isPDF = docName.endsWith(".pdf");

    const content: UserContent = [];

    if(isPDF) {
        content.push(pdfUserMessage(numLevels, courseSectionsPrompt) as any);
        content.push(pdfAttatchment(arrayBuffer) as any);
    } else {
        const html = await blob.text();
        content.push(htmlUserMessage(numLevels, courseSectionsPrompt, html) as any);
    }

    const result = await streamObject({ 
        model: gooogle("gemini-1.5-flash"),
        schema: multipleLevelSchema,
        messages: [
            {
                role: "user",
                content: content
            }
        ]
    })
    

    return result;
}
[-] trashgirlfriend@lemmy.world 5 points 1 month ago
[-] Cr4yfish@lemmy.world 6 points 1 month ago

Haha. Well we can't all actually be Duolingo and employ people to create the courses :D

[-] trashgirlfriend@lemmy.world 8 points 1 month ago

It's all just flashcards with extra steps and anki already exists. /shrug

[-] Robust_Mirror@aussie.zone 3 points 1 month ago

I've made custom flashcards for anki to study stuff and I tested this for some similar things and it was a lot faster and easier. Anki feels like it takes forever so the investment to make a custom set is only worth it for things you need to study for a long time.

If all you want is to generate a bunch of flashcards fast and you have a pdf with the info presented clearly it's an easy method.

load more comments (1 replies)
[-] grapemix@lemmy.ml 4 points 1 month ago

Is it for self-host ppl too?

For all projects/apps, I am looking for OIDC, S3 and PgSQL. It's easier to implement these features earlier and these features will make any projects more popular in the self host community.

load more comments (4 replies)
load more comments
view more: next ›
this post was submitted on 03 Nov 2024
394 points (98.0% liked)

Open Source

31712 readers
186 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS