overview for sisyphean

11

We're Not Platonists, We've Just Learned The Bitter Lesson (astralcodexten.substack.com)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

1 comments fedilink

Intelligence explosion arguments don’t require Platonism. They just require intelligence to exist in the normal fuzzy way that all concepts exist.

5

OpenAI API data privacy (openai.com)

submitted 1 year ago* (last edited 1 year ago) by sisyphean@programming.dev to c/auai@programming.dev

0 comments fedilink

At OpenAI, protecting user data is fundamental to our mission. We do not train our models on inputs and outputs through our API.

7

ChatGPT on Android is Here (play.google.com)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

1 comments fedilink

8

Custom instructions for ChatGPT (openai.com)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

2 comments fedilink

We’re rolling out custom instructions to give you more control over how ChatGPT responds. Set your preferences, and ChatGPT will keep them in mind for all future conversations.

@AutoTLDR

41

How Is ChatGPT’s Behavior Changing over Time? (arxiv.org)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

0 comments fedilink

GPT-3.5 and GPT-4 are the two most widely used large language model (LLM) services. However, when and how these models are updated over time is opaque. Here, we evaluate the March 2023 and June 2023 versions of GPT-3.5 and GPT-4 on four diverse tasks: 1) solving math problems, 2) answering sensitive/dangerous questions, 3) generating code and 4) visual reasoning. We find that the performance and behavior of both GPT-3.5 and GPT-4 can vary greatly over time. For example, GPT-4 (March 2023) was very good at identifying prime numbers (accuracy 97.6%) but GPT-4 (June 2023) was very poor on these same questions (accuracy 2.4%). Interestingly GPT-3.5 (June 2023) was much better than GPT-3.5 (March 2023) in this task. GPT-4 was less willing to answer sensitive questions in June than in March, and both GPT-4 and GPT-3.5 had more formatting mistakes in code generation in June than in March. Overall, our findings shows that the behavior of the “same” LLM service can change substantially in a relatively short amount of time, highlighting the need for continuous monitoring of LLM quality.

30

Llama 2 - Meta AI (ai.meta.com)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

3 comments fedilink

Introducing Llama 2 - The next generation of our open source large language model. Llama 2 is available for free for research and commercial use.

This release includes model weights and starting code for pretrained and fine-tuned Llama language models — ranging from 7B to 70B parameters.

@AutoTLDR

12

Kagi's approach to AI in search | Kagi Blog (blog.kagi.com)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

3 comments fedilink

16 Mar, 2023

Kagi Search is pleased to announce the introduction of three AI features into our product offering.

We’d like to discuss how we see AI’s role in search, what are the challenges and our AI integration philosophy. Finally, we will be going over the features we are launching today.

@AutoTLDR

4

Online Game: A GPT-4 Capability Forecasting Challenge (nicholas.carlini.com)

submitted 1 year ago* (last edited 1 year ago) by sisyphean@programming.dev to c/auai@programming.dev

1 comments fedilink

This is a game that tests your ability to predict ("forecast") how well GPT-4 will perform at various types of questions. (In caase you've been living under a rock these last few months, GPT-4 is a state-of-the-art "AI" language model that can solve all kinds of tasks.)

Many people speak very confidently about what capabilities large language models do and do not have (and sometimes even could or could never have). I get the impression that most people who make such claims don't even know what current models can do. So: put yourself to the test.

6

How to Use AI to Do Stuff: An Opinionated Guide (www.oneusefulthing.org)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

1 comments fedilink

Increasingly powerful AI systems are being released at an increasingly rapid pace. This week saw the debut of Claude 2, likely the second most capable AI system available to the public. The week before, Open AI released Code Interpreter, the most sophisticated mode of AI yet available. The week before that, some AIs got the ability to see images.

And yet not a single AI lab seems to have provided any user documentation. Instead, the only user guides out there appear to be Twitter influencer threads. Documentation-by-rumor is a weird choice for organizations claiming to be concerned about proper use of their technologies, but here we are.

@AutoTLDR

16

ChatGPT’s new Code Interpreter and what it means for you (medium.com)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

0 comments fedilink

TL;DR: (by GPT-4 🤖)

The article by Chandler Kilpatrick on Medium discusses the new Code Interpreter feature of ChatGPT, which has been released to Beta from its previous Alpha testing phase. The Code Interpreter enhances ChatGPT's ability to process, generate, manipulate, and run code, currently supporting only Python. Users can upload files (with a limit of 100 MB per file) for the AI to interact with, although it cannot edit files directly. The Code Interpreter can be used in various fields such as software development, data analytics, documentation, and education, helping with tasks like code generation, error detection, code refactoring, creating data visualizations, and providing real-time programming tutoring. The article also highlights some impressive feats accomplished by users, including recreating the game Flappy Bird in less than 10 minutes.

9

Simon Willison’s LLM CLI tool now supports self-hosted language models via plugins (simonwillison.net)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

1 comments fedilink

LLM is my command-line utility and Python library for working with large language models such as GPT-4. I just released version 0.5 with a huge new feature: you can now install plugins that add support for additional models to the tool, including models that can run on your own hardware.

@AutoTLDR

7

Introducing NotebookLM (blog.google)

submitted 1 year ago by sisyphean@programming.dev to c/auai@programming.dev

2 comments fedilink

An AI-first notebook, grounded in your own documents, designed to help you gain insights faster.

@AutoTLDR

🗓️ Weekly Discussion: AI in Education by sisyphean in c/auai@programming.dev

[-] sisyphean@programming.dev 3 points 1 year ago* (last edited 1 year ago)

Ethan Mollick has two recent articles related to this topic:

Should programming.dev defederate from Meta if they implement ActivityPub? by sisyphean in c/meta@programming.dev

[-] sisyphean@programming.dev 3 points 1 year ago

Everyone does cringey things sometimes but it takes a great person to admit their mistakes

We are starting with the BS so early huh Threads? by sisyphean in c/lemmyshitpost@lemmy.world

[-] sisyphean@programming.dev 6 points 1 year ago

BTW Satan is a very cool guy, follow him on Twitter: @s8n

We are starting with the BS so early huh Threads? by sisyphean in c/lemmyshitpost@lemmy.world

[-] sisyphean@programming.dev 22 points 1 year ago* (last edited 1 year ago)

And people are seriously considering federating with Threads if it implements ActivityPub. Things have been so crazy recently that I think If Satan existed and started a Lemmy instance, probably there would still be people arguing in good faith for federating with him.

Reddit's Traffic is Down 3.36% Month-Over-Month, According to SimilarWeb by sisyphean in c/reddit@lemmy.world

[-] sisyphean@programming.dev 17 points 1 year ago

Lol that’s like saying there’s too much porn on /r/gonewild

Should programming.dev defederate from Meta if they implement ActivityPub? by sisyphean in c/meta@programming.dev

[-] sisyphean@programming.dev 14 points 1 year ago

Yes, their actual argument is excellent, but this remark gives me instant /r/iamverysmart vibes

Should programming.dev defederate from Meta if they implement ActivityPub? by sisyphean in c/meta@programming.dev

[-] sisyphean@programming.dev 43 points 1 year ago* (last edited 1 year ago)

“Timeo Danaos et dona ferentes.”

Companies like Meta poison everything they touch. They are a deeply evil, psychopathic organization. They are responsible for causing extremely harmful runaway effects in human society that I’m not even sure are possible to fix. The very reason for Lemmy's recent popularity is that people are fed up with the "if something is free, you aren't the user, you are the product" situation and its consequences (see Reddit vs. /u/spez).

Their intent to federate is a blatantly obvious attempt at an "embrace, extend, extinguish" strategy - I'm surprised anyone seriously considers federating with them. They need users to solve the "chicken and egg" problem and joining the fediverse would be an easy way for them to populate their service with content. Their motivations are obviously and transparently malicious and self-serving. They don't care about the goals and values of the fediverse at all, all they see is an easy way to gain initial users and content. At the first moment federation will be more inconvenient than useful to them, after they sucked all the profit they could out of it, they will drop the entire thing like a hot potato, and we will be left in the dust.

I personally like this instance very much, and I've been putting hours and hours of work into building the AUAI community since the day I joined. But I wouldn't hesitate for a second before deleting my account and never looking back if the community here decided to federate with Meta.

EDIT: another explanation of why they want to join the fediverse

Just a lemmy lurker doing my part <3 by sisyphean in c/memes@lemmy.ml

[-] sisyphean@programming.dev 3 points 1 year ago

If I remember correctly, the properties the API returns are comment_score and post_score.

Just a lemmy lurker doing my part <3 by sisyphean in c/memes@lemmy.ml

[-] sisyphean@programming.dev 6 points 1 year ago

Lemmy does have karma, it is stored in the DB, and the API returns it. It just isn’t displayed on the UI.

Has anyone else seen this interesting "challenge site" when googling a programming topic? by sisyphean in c/programming@programming.dev

[-] sisyphean@programming.dev 4 points 1 year ago

someone watching you code in a google doc

I’ve had nightmares less terrifying than this

Just API calls to GPT … by sisyphean in c/programmer_humor@programming.dev

[-] sisyphean@programming.dev 3 points 1 year ago* (last edited 1 year ago)

This describes 99% of AI startups.

The company I work for was considering using Mendable for AI-powered documentation search. I built a prototype using OpenAI embeddings and GPT-3.5 that was just as good as their product in a day. They didn’t buy Mendable :)

Lemoa - A Gtk client for Lemmy by sisyphean in c/programming@programming.dev

[-] sisyphean@programming.dev 2 points 1 year ago* (last edited 1 year ago)

I’m sure it’s a nice client but I don’t understand why so many GUI projects have no screenshots in their READMEs. It would be great if I could immediately see if I like it without installing it.

EDIT: thanks for adding the screenshot to your post! It looks awesome!