112
submitted 6 days ago by yogthos@lemmy.ml to c/opensource@lemmy.ml
top 12 comments
sorted by: hot top controversial new old
[-] utopiah@lemmy.ml 46 points 6 days ago* (last edited 5 days ago)

FWIW if you are interested in such tooling consider also soffice and pandoc which have (as far as I can tell) similar features but have been existing for years now and are not related to Microsoft.

Edit: not related to Microsoft AND Google, seems the transcription aspect (which IMHO is still weird in that context but OK) is done via Google servers, cf https://lemmy.ml/post/23629310/15586865

[-] haverholm@kbin.earth 7 points 6 days ago

The single exception to this (which is actually buried fairly deep in the feature list) is the audio transcription tool. I didn't take a closer look at what is used to perform this, but at least it's not "just" document conversion like pandoc.

[-] utopiah@lemmy.ml 5 points 5 days ago

audio transcription tool

Thanks for the clarification but I'm a bit confused here, like audio transcription, STT, done by e.g. Whisper? If so what's the use case? When I think of Office documents audio transcription is not something I have in mind.

[-] utopiah@lemmy.ml 3 points 5 days ago
[-] JackbyDev@programming.dev 1 points 5 days ago

You should open a fresh issue for questions like that instead of asking on an unrelated one.

[-] haverholm@kbin.earth 2 points 5 days ago

I'm not completely clear either on how Microsoft have implemented this previously. As I said, I didn't look very deep into the repository.

If these are indeed other Python projects they piled together, as others suggest, I'd be happy to hear what speech recognition library this might've built on.

[-] davel@lemmy.ml 11 points 6 days ago* (last edited 2 days ago)

Huh, Beautiful Soup is still relevant. I was using it twenty years ago when it first came out.

[-] charles@lemmy.ca 1 points 2 days ago

FYI the link in your comment got cut off before the last bracket so it's not linking to the wiki page directly.

[-] davel@lemmy.ml 2 points 2 days ago

Fixed, thanks. Though it's 4 days later, so I'm not sure it will help anyone 🤷

[-] ksynwa@lemmygrad.ml 8 points 6 days ago

This could be useful to me. A while ago I was trying to make something that take all unread posts from my feed reader, make an epub out of them and then put it behind an OPDS server.

I found converting HTML from RSS to first markdown and then compiling them to an epub the most reliable way to take out the unnecessary markup from the source HTML. I used pandoc for this.

[-] utopiah@lemmy.ml 4 points 5 days ago

I used pandoc for this.

Please come back and share if it's done better or worst and if so along which dimensions. Quite curious to better understand the differences.

[-] yogthos@lemmy.ml 4 points 6 days ago

oh yeah that's definitely a good use case

this post was submitted on 16 Dec 2024
112 points (95.2% liked)

Open Source

31696 readers
464 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS