this post was submitted on 20 Apr 2025
1234 points (95.6% liked)

Fuck AI

2528 readers
663 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] ChairmanMeow@programming.dev 20 points 5 days ago (1 children)

Using open datasets means using data people have made available publicly, for free, for any purpose. So using an AI based on that seems considerably more ethical.

[–] kibiz0r@midwest.social 10 points 5 days ago (2 children)

Except gen AI didn’t exist when those people decided on their license. And besides which, it’s very difficult to specify “free to use, except in ways that undermine free access” in a license.

[–] unhrpetby@sh.itjust.works 3 points 5 days ago* (last edited 5 days ago)

The responsibility is on the copyright holder to use a license they actually understand.

If you license your work with, say, the BSD 0 Clause, you are very explicitly giving away your right to dictate how other people use your work. Don't be angry if people do so in ways you don't like.

[–] ChairmanMeow@programming.dev 2 points 5 days ago (1 children)

How does a model that is trained on an open dataset undermine free access? The dataset is still accessible no?

[–] kibiz0r@midwest.social 4 points 4 days ago (1 children)
[–] ChairmanMeow@programming.dev 1 points 4 days ago

This specifically talks about AI data scrapers being an issue, and some general issues that are frankly not exclusive to open access info.

Exploitative companies are always a problem, whether it's AI or not. But someone who uses the Wikipedia text torrents as a dataset isn't doing anything of what is described in that article for example.