this post was submitted on 03 Sep 2025
2 points (100.0% liked)

Ada

58 readers
3 users here now

Ada programming language. For memory safe multi task programming, elegant embedded bit fiddling and everything else in a readable way

founded 4 months ago
MODERATORS
all 3 comments
sorted by: hot top controversial new old
[โ€“] brucethemoose@lemmy.world 1 points 1 week ago* (last edited 1 week ago)

I wonder what the errors were?

This seems like a much better task for a finetune with constrained decoding (with is not possible with OpenAI), though I'm not sure how much 'thinking' about the function needs to be done here.

[โ€“] gnufuu@infosec.pub 1 points 1 week ago

The performance of Marmaragan with GPT-4o on the benchmark is promising, with correct annotations having been generated for 50.7% of the benchmark cases. The results establish a foundation for future work on combining the power of LLMs with the reliability of formal software verification.

That's basically a coin flip. Even if you made it less successful a user would have higher chances of guessing correctly whether an annotation is true of false.