@ag10n

ag10n@lemmy.world · 9 days ago

Marketing incumbent

ag10n@lemmy.world · 15 days ago

Your company has bought you the latest and greatest and likely supports commercial token usage too

You can’t compare LLMs at scale to running it locally; same experience and capabilities

ag10n@lemmy.world · 15 days ago

Which are increasingly out of reach for a normal person. Phones let alone PC hardware have increased exponentially in recent history

ag10n@lemmy.world · 15 days ago

Describe greased lightning, because it’s much slower and needs to handle compression for context

We’re moving in that direction but an M5 is not what the majority of people are running at home

ag10n@lemmy.world · 15 days ago

How does that compare to closed models that Anthropic offers, at the context and scale they offer.

I run Qwen3.6 27B locally and it’s usable with 16G vram but still not the same as a data centre of Blackwell clusters.

ag10n@lemmy.world · 15 days ago

It’s not the 90s anymore. Unless there’s a compression algorithm putting billions of relationships into a manageable size, local AI is highly specific under 8G vram (text-to-speech as an example is under 1G) let alone the context required for keeping a conversation or writing code.

ag10n@lemmy.world · 15 days ago

What’s the cost of the compute you have to run something locally?

Majority of people don’t have 32G of vram to run something remotely as capable

ag10n@lemmy.world · 17 days ago

Hacktivists on GitHub. Show me your forgejo

ag10n@lemmy.world · 17 days ago

https://youtu.be/k_d_z46blQM

ag10n@lemmy.world · 19 days ago

deleted by creator

ag10n@lemmy.world · 19 days ago

These systems support a latent load so it’s not all at once. Something like this but at a massive scale.

https://www.ti.com/lit/an/slva670a/slva670a.pdf

Very cool engineering.

ag10n@lemmy.world · 19 days ago

Yes, this is similar to opencode or hermes. A gateway platform to integrate LLMs and tools

ag10n@lemmy.world · 23 days ago

Likely something like this

https://www.fresh222.com/loss-prevention-system-for-retail-stores-and-warehouses/

ag10n@lemmy.world · 28 days ago

This is the way. Computer use agents are common and can easily ‘browse’ to a page and grab the content.

ag10n@lemmy.world · 1 month ago

Tmux/screen foreign concepts

ag10n@lemmy.world · 2 months ago

Quote me in full.

You can run it at scale, on huawei. You can also run it on a cpu

ag10n@lemmy.world · 2 months ago

Thank you for proving my point. It can be run on a cpu

“It’s slow, it’s inefficient” it still runs

It’s a foundational model just like R1 was.

ag10n@lemmy.world · edit-2 2 months ago

https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro

ag10n@lemmy.world · 2 months ago

Yes, you can run it at scale. Which is why it uses Huawei hardware.

You can run it on anything, scaled or not

ag10n@lemmy.world · 2 months ago

You can run it on CPU alone. Not surprising they’re building their own AI ecosystem