Technology

72646 readers

5105 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

958

AI agents wrong ~70% of time: Carnegie Mellon study (www.theregister.com)

submitted 3 days ago by eli001@lemmy.world to c/technology@lemmy.world

284 comments fedilink hide all child comments

(page 5) 50 comments

sorted by: hot top controversial new old

[–] SocialMediaRefugee@lemmy.world 1 points 2 days ago* (last edited 2 days ago)

I use it for very specific tasks and give as much information as possible. I usually have to give it more feedback to get to the desired goal. For instance I will ask it how to resolve an error message. I've even asked it for some short python code. I almost always get good feedback when doing that. Asking it about basic facts works too like science questions.

One thing I have had problems with is if the error is sort of an oddball it will give me suggestions that don't work with my OS/app version even though I gave it that info. Then I give it feedback and eventually it will loop back to its original suggestions, so it couldn't come up with an answer.

I've also found differences in chatgpt vs MS copilot with chatgpt usually being better results.

[–] brown567@sh.itjust.works 5 points 3 days ago

70% seems pretty optimistic based on my experience...

[–] Ileftreddit@lemmy.world 1 points 2 days ago

Hey I went there

[–] lmagitem@lemmy.zip 2 points 3 days ago

Color me surprised

[–] dan69@lemmy.world -1 points 2 days ago

And it won’t be until humans can agree on what’s a fact and true vs not.. there is always someone or some group spreading mis/dis-information

[–] dylanmorgan@slrpnk.net 2 points 3 days ago (1 children)

Claude why did you make me an appointment with a gynecologist? I need an appointment with my neurologist, I’m a man and I have Parkinson’s.

load more comments (1 replies)

[–] NuXCOM_90Percent@lemmy.zip 2 points 3 days ago

While I do hope this leads to a pushback on "I just put all our corporate secrets into chatgpt":

In the before times, people got their answers from stack overflow... or fricking youtube. And those are also wrong VERY VERY VERY often. Which is one of the biggest problems. The illegally scraped training data is from humans and humans are stupid.

[–] lemmy_outta_here@lemmy.world 2 points 3 days ago

Rookie numbers! Let’s pump them up!

To match their tech bro hypers, the should be wrong at least 90% of the time.

load more comments