AI Megathread

  • 🔧 Site instability resolved. You can report double-posts and broken attachments. For bigger issues, use the Technical Grievances thread.
    🇵🇦 Nuestro primer dominio localizado está en español en kiwifarms.pa. Our first localized domain is on Spanish on kiwifarms.pa.
  • Want to keep track of this thread?
    Accounts can bookmark posts, watch threads for updates, and jump back to where you stopped reading.
    Create account
Completely wrong. It has 8GB of DDR5 RAM. The 4000 series has GDDR6X.

That makes it absolute dogshit for LLM performance. It also needs a NVME drive to do anything useful which adds another hundred or two.
If it were at least as capable as the 2060 12gb I bought a few years back for under $250, I'd probably try it out just for the novelty. But this looks like another crippled hand-me-down, dregs that the enterprise space doesn't want.
 
Completely wrong. It has 8GB of DDR5 RAM. The 4000 series has GDDR6X.

That makes it absolute dogshit for LLM performance. It also needs a NVME drive to do anything useful which adds another hundred or two.
yeah I guess you're right, if you check the operations per second and the memory bandwidth a 4080 is around 10 times faster at each
it looks like the jetson is also using the old die that the 3090 used, not the 40 series die
 
If it were at least as capable as the 2060 12gb I bought a few years back for under $250, I'd probably try it out just for the novelty. But this looks like another crippled hand-me-down, dregs that the enterprise space doesn't want.
I think what's going to happen is that Nvidia is going to eventually go with subpar fabs with high defect rates for raw output. Like the recently built ones in the U.S. Then those defective GPUs will just get binned for the consumer and tossed onto pcie cards or garbage like this.

Edit:I suck dicks. Apparently the only 4nm foundry in the U.S. has the same quality standards/yields as the other ones owned by TSMC in Taiwan. Still think all consumer cards will be enterprise rejects though.
 
Última edición:
If this keeps up, dropping costs and improving capability on the low-end local devices at the current rate of change it won't take all that long until dirt-cheap consumer electronics will be good enough to completely displace the AIaaS subscriptions for 90-95% of all use cases. That would really fuck up the economy of building and operating AI datacentres.
That is the dream. AI is such a cool tool conceptually and even in practice given self-hostable tools like Open Claw and the like, and being able to fully disentangle it from the same issue that most other soy dev projects are faced with (see: Wayland, Mac, Rust etc.), is the cultist community. Once you can run your own powerbox that can help you code without creating a feedback loop to the globohomo giga models, I can easily forsee the advent of "clean" local models that can ape personal coding style and quality much better than current public models. Or, even better, to help with research and text synthesis without feeding said text to public models.
 
How many years away are we from fully AI-driven RPGs?
You can make a relatively decent one using SillyTavern and offloading persistent stuff to world books. Use the world books to keep information about the setting in them to keep the context low. The main issue is keeping track of inventory and stat calculations which LLMs have an issue with.
 
One of the coolest videos on artificial intelligence, bridging the gap between "physical dumbness" of robots by emulating nature's way of dealing with the present, such as how humans live .1 second behind reality: predict the future and correct itself when things doesn't line up.
 
From 2022-2025 the ai discourse was the same, but something is different now. At first I thought it was facebook, x, etc algorithm trying to ragebait me, and that very may well be a part, but somethings brewing. An incredible amount of retards are foaming at the mouth against AI. Its downright political right now, Seems to be very liberal/women driven, like all other retarded protests of the century. I could see a civil war forming over this
 
I’m choosing to interpret this as a based anti pajeet coder power move because of all the shit they gave him with t series.
 
That is the dream. AI is such a cool tool conceptually and even in practice given self-hostable tools like Open Claw and the like, and being able to fully disentangle it from the same issue that most other soy dev projects are faced with (see: Wayland, Mac, Rust etc.), is the cultist community. Once you can run your own powerbox that can help you code without creating a feedback loop to the globohomo giga models, I can easily forsee the advent of "clean" local models that can ape personal coding style and quality much better than current public models. Or, even better, to help with research and text synthesis without feeding said text to public models.
you already see this with the image generation models, i've only dabbled in it because it really is mostly porn but it's orders of magnitude more advanced and specific and capable than what you get out of the huge monolithic models. if you're clever you can make stuff that people really won't be able to tell is AI. it's actually really funny that people are specifically trying to get the bizarre overcranked HDR look when it's a telltalle sign they're retards who paid for midjourney and are using what is essentially the first thing the AI cranked out
 
I’m choosing to interpret this as a based anti pajeet coder power move because of all the shit they gave him with t series.
As if most, if not all jeet code slop isn't already AI generated.

I like Null's interpretation. AI coding just makes REAL coders more productive.
 
1780356940036.png
Sold. I recent acquired a V100 and it's just sitting there waiting for me deploy something....
 
How is it that the various kinds of AI had accelerated development at the same time?

Like why is AI image and music generation really taking off along with LLMs as well?
 
How is it that the various kinds of AI had accelerated development at the same time?

Like why is AI image and music generation really taking off along with LLMs as well?
They're all based on the same basic idea of training a neural net on an enormous amount of data. We've only relatively recently gotten to a point with hardware where this is possible at the necessary scale. Doesn't matter if that data is text, images or audio, the same principle seems to apply. If you asked any AI scientist in 2017 when the transformer paper came out whether the results we've seen would be possible by just feeding that architecture the entire content of the Internet, they would be very skeptical.

In the last year or so we've been seeing smaller, more capable local models, so it's not an ironclad rule that more data = better and less data = worse, but we had to get to the point with hardware where models like GPT-3 could be trained and used to discover the value of the approach.

It's really kind of a dumb, brute-force approach to building AI, but you see this all the time with computer science: people make clever algorithms and hardware designs to work around resource constraints and then Moore's Law kicks in and you end up getting better results with simple methods on improved hardware.
 
In the last year or so we've been seeing smaller, more capable local models, so it's not an ironclad rule that more data = better and less data = worse, but we had to get to the point with hardware where models like GPT-3 could be trained and used to discover the value of the approach.
Yea, small local models are getting much better. I'm using Gemma 4 and Qwen 3.6 for coding. And while I'm pretty sure they're not trained on a ton of C code they know enough to look at headers in the libraries in use and work stuff out. Where in the past it was just a giant model that knew all the stuff. Can I ask them about random facts, probably not, but for "Hey, this program is doing X, please fix it" they can work it out... slowly.
 
How is it that the various kinds of AI had accelerated development at the same time?

Like why is AI image and music generation really taking off along with LLMs as well?
It's the same reason why Vibe Coding is sky-rocketing in usage as well.
More powerful and adaptive models, means not blowing your brains out into making prompts, which means, people don't need to blow a boat load of moolah into an entire crew just to make a convincing visual / music piece, and thus, people that use this can actually gain more than they lose if they know how to make the prompts:slayer:
 
Atrás
Top Abajo