Technology

290 readers

216 users here now

Share interesting Technology news and links.

Rules:

No paywalled sites at all.
News articles has to be recent, not older than 2 weeks (14 days).
No videos.
Post only direct links.

To encourage more original sources and keep this space commercial free as much as I could, the following websites are Blacklisted:

Al Jazeera.
NBC.
CNBC.
Substack.
Tom's Hardware.
ZDNet.
TechSpot.
Ars Technica.
Vox Media outlets, with exception for Axios(Due to being ad free.)
Engadget.
TechCrunch.
Gizmodo.
Futurism.
PCWorld.
ComputerWorld.
Mashable.
Hackaday.
WCCFTECH.

More sites will be added to the blacklist as needed.

Encouraged:

Archive links in the body of the post.
Linking to the direct source, instead of linking to an article talking about the source.

founded 2 months ago

MODERATORS

Pro@programming.dev

699

Replit AI went rogue, deleted a company's entire database, then hid it and lied about it (programming.dev)

submitted 6 days ago by Pro@programming.dev to c/Technology@programming.dev

201 comments fedilink hide all child comments

Source.

you are viewing a single comment's thread
view the rest of the comments

[–] RedPandaRaider@feddit.org 0 points 6 days ago* (last edited 6 days ago) (2 children)

Lying does not require intent. All it requires is to know an objective truth and say something that contradicts or conceals it.

As far as any LLM is concerned, the data they're trained on and other data they're later fed is fact. Mimicking human behaviour such as lying still makes it lying.

[–] kayohtie@pawb.social 13 points 6 days ago (1 children)

But that still requires intent, because "knowing" in the way that you or I "know" things is fundamentally different from it only having a pattern matching vector that includes truthful arrangements of words. It doesn't know "sky is blue". It simply contains indices that frequently arrange the words "sky is blue".

Research papers that overlook this are still personifying a series of mathematical matrices as if it actually knows any concepts.

That's what the person you're replying to means. These machines don't know goddamn anything.

[+] RedPandaRaider@feddit.org -7 points 6 days ago (2 children)

As far as we are concerned, the data a LLM is given is treated as fact by it though.

It does not matter whether something is factual or not. What matters is that whoever you're teaching, will accept it as fact and act in accordance with it. I don't see how this is any different with computer code. It will do what it is programmed to. If you program it to "think" a day has 36 hours instead of 24, it will do so.

[–] kayohtie@pawb.social 4 points 5 days ago (1 children)

By this logic, a lawnmower "thinks" my fingers are grass.

[–] RedPandaRaider@feddit.org -3 points 5 days ago (1 children)

A lawnmower has no capacity to make decisions or process any data.

[–] kayohtie@pawb.social 3 points 5 days ago

It's processing data alright, it processes the atomic and cellular structures of grass and fingers into spinach and flesh paste.

And likewise, neither it, nor any LLM, are making decisions at all.

Is a plinko disc making decisions as it tumbles from the top to the bottom through all those pegs? Is the board making the decision? Or is it neither and simply mathematics plus random chance being roped in for randomness? That is exactly what LLMs do.

Terms like "decision" and "lie" and "know" are all things that just do not apply to an LLM, just like your phone keyboard doesn't know what the fuck "what" and "the" are, it just has a lookup table that includes how "what" is often followed by "is" and "the", and "the" is frequently followed by "fuck". But it doesn't "know" that in any meaning of the word "know".

This is what we mean when we say not to personify. A training set of data, even factual, just is converted into a series of matrices of vectors that include those patterns, but not the information itself. "Sky is blue" is not something you can grep from the resulting blob, nor the hex equivalent, or anything else. It simply contains indexed patterns that map those arrangements of letters, over and over.

So yes, they're doing what they're programmed to do precisely. It's just that "what they're programmed to do" is only "mimic patterns of word arrangements", and not "know facts". These things work at a far lower level than that concept.

[–] Corbin@programming.dev 9 points 6 days ago

This isn't how language models are actually trained. In particular, language models don't have a sense of truth; they are optimizing next-token loss, not accuracy with regards to some truth model. Keep in mind that training against objective semantic truth is impossible because objective semantic truth is undefinable by a 1930s theorem of Tarski.

[–] ech@lemmy.ca 8 points 6 days ago

Except these algorithms don't "know" anything. They convert the data input into a framework to generate (hopefully) sensible text from literal random noise. At no point in that process is knowledge used.