Microblog Memes

8415 readers

1998 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

Rules:

Please put at least one word relevant to the post in the post title.
Be nice.
No advertising, brand promotion or guerilla marketing.
Posters are encouraged to link to the toot or tweet etc in the description of posts.

Related communities:

founded 2 years ago

MODERATORS

ReadyUser31@lemmy.world

aeronmelon@lemmy.world

needanke@feddit.org

2115

Save The Planet (lazysoci.al)

submitted 4 days ago by sabreW4K3@lazysoci.al to c/microblogmemes@lemmy.world

302 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Jakeroxs@sh.itjust.works 0 points 4 days ago (16 children)

I do, because they're not at full load the entire time it's in use

[–] FooBarrington@lemmy.world 1 points 4 days ago (15 children)

They are, it'd be uneconomical not to use them fully the whole time. Look up how batching works.

[–] Jakeroxs@sh.itjust.works 1 points 3 days ago* (last edited 3 days ago) (14 children)

I mean I literally run a local LLM, while the model sits in memory it's really not using up a crazy amount of resources, I should hook up something to actually measure exactly how much it's pulling vs just looking at htop/atop and guesstimating based on load TBF.

Vs when I play a game and the fans start blaring and it heats up and you can clearly see the usage increasing across various metrics

[–] PeriodicallyPedantic@lemmy.ca 3 points 3 days ago (1 children)

He isn't talking about locally, he is talking about what it takes for the AI providers to provide the AI.

To say "it takes more energy during training" entirely depends on the load put on the inference servers, and the size of the inference server farm.

[–] Jakeroxs@sh.itjust.works 3 points 3 days ago (1 children)

There's no functional difference aside from usage and scale, which is my point.

I find it interesting that the only actual energy calculations I see from researchers is the training and the things going along with the training, rather then the usage per actual request after training.

People then conflate training energy costs to normal usage cost without data to back it up. I don't have the data either but I do have what I can do/see on my side.

[–] PeriodicallyPedantic@lemmy.ca 2 points 3 days ago

I'm not sure that's true, if you look up things like "tokens per kwh" or "tokens per second per watt" you'll get results of people measuring their power usage while running specific models in specific hardware. This is mainly for consumer hardware since it's people looking to run their own AI servers who are posting about it, but it sets an upper bound.

The AI providers are right lipped about how much energy they use for inference and how many tokens they complete per hour.

You can also infer a bit by doing things like looking up the power usage of a 4090, and then looking at the tokens per second perf someone is getting from a particular model on a 4090 (people love posting their token per second performance every time a new model comes out), and extrapolate that.

load more comments (12 replies)