AI Megathread

  • 🔧 Site instability resolved. You can report double-posts and broken attachments. For bigger issues, use the Technical Grievances thread.
    🇵🇦 Nuestro primer dominio localizado está en español en kiwifarms.pa. Our first localized domain is on Spanish on kiwifarms.pa.
  • Want to keep track of this thread?
    Accounts can bookmark posts, watch threads for updates, and jump back to where you stopped reading.
    Create account
Has anyone thought of ideas to make money with agentic coding?

Something scalable. Greyhat, blackhat, whitehat, anything.
Be good at something, and then use vibe coding to augment your talent, and automate it. I think that's really all it's good for from a product design point. It needs to be paired with something that isn't AI for it to work, and have a value proposition.

If you're making money on something because it's easy, don't expect to be making money on it long.
 
Just switched from OpenClaw to Hermes. WOW. Hermes is so much better. MiMo-V2.5-Pro is my favorite affordable model.
 
Looks like the new wave of pajeets arrived at OpenAI with their fake H1Bs

1780812004503.png
 
My go-to when I want to use a chatbot for info is for it to prompt some sort of roleplay and so far it has never refused to provide. Maybe it's a matter of how prominent the character is but I never hit a wall ever.
wwii.jpg
That method is also a great way to have info without making it a blatant propaganda piece since the opposing viewpoint is shown in earnest.
 
That method is also a great way to have info without making it a blatant propaganda piece since the opposing viewpoint is shown in earnest.
You could just prompt it to have the goal of defending a specific view point and you start poking holes. It's a good way to get your nogging jogging. I sometimes do it when trying to figure out things like design.
 
You can't do anything remotely fun with these AIs. Trying to teach myself the Comfy UI stuff at the moment.

Takes about 5 minutes to generate an image on my card though.
comfy is very good. I haven't played with the qwen model yet but it's highly-regarded. Flux in my opinion is too big and too slow to use. Illustrious is more or less only for anime

you should definitely learn to use ControlNet and the ControlNet model named 'anytest'
 
Trying to teach myself the Comfy UI stuff at the moment.

Takes about 5 minutes to generate an image on my card though.
comfy is very good. I haven't played with the qwen model yet but it's highly-regarded. Flux in my opinion is too big and too slow to use. Illustrious is more or less only for anime

you should definitely learn to use ControlNet and the ControlNet model named 'anytest'
Qwen Image is good if you have the hardware for it. Flux 2 Dev is definitely too slow for general use, but Flux 2 Klein 4B and 9B are very solid models with both generation and edit capabilities that are a lot smaller and faster. I've been using 9B over Qwen Image for a while now.

The new Ideogram 4 model is very cool as well if you have the hardware to run it and use it with the KJNodes prompt builder node, e.g.


And ComfyUI also supports speech models like QwenTTS and music models like ACE Step 1.5, which are comparable in size to smaller image models. And there's LTX 2.3 for Sora-quality local video, though again that may require some beefier hardware.
 
what node are you using for regional prompting:? My regional prompting setup is way more complex than that
It's KJNodes Ideogram 4 Prompt Builder, which is a bit different from regional prompting, in that the Ideogram 4 model is trained to understand JSON prompts containing bounding boxes. This node just lets you construct the JSON prompt visually.

I haven't properly tested whether this approach is significantly better than equivalent regional prompting with a different model.
 
Última edición:
I haven't properly tested whether this approach is significantly better than equivalent regional prompting with a different model.
In my experience it seems like each prompt affects execution time heavily regardless of whether it's regional or not. Each "conditioning" on the clip network nonlinearly increases it. The third adds more time than the second did, the fourth adds more time than the third, etc. Most other operations have a linear effect on execution time, like they just add a flat amount.

Also the standard way of regional prompting gets hairy when you have overlapping segments and quickly turns into a game of AI shamanism
 
Well, Anthropic has finally released a lobotomized version of Mythos to the public. It eats usage limit like Heather Heyer eats anything else, and its safeguards get triggered and force it to bug out if you so much as ask it for a sandwich recipe, but its benchmark scores show a substantial lead over OpenAI and Google. A shame to see the most censorious company running away with the ball.

They're tentatively removing it from the plan limits by next month, but I doubt they have the balls to go through with that. Nonetheless, if you've got a subscription, you may as well use it now.

Victor Taelin, on Twitter, says that AGI is imminent, and he's a lot more rigorous than the usual crop of street preachers.
 
Victor Taelin, on Twitter, says that AGI is imminent, and he's a lot more rigorous than the usual crop of street preachers.
he doesn't understand teleoperation is impossible and that every factory that wants to run an AGI is going to need a server closet full of hardware to run the AI

currently he believes an AGI running in the cloud is capable of operating a robot

he did not see Google try to launch Samurai Shodown on the Stadia
 
he doesn't understand teleoperation is impossible and that every factory that wants to run an AGI is going to need a server closet full of hardware to run the AI
I've done a bit of work in this area, and there's more than one way to skin this particular cat. Yes, you can use a VLA model to directly output motor signals and control a robot that way, but you can also use something like Code as Policies (which has progressed quite a lot since 2022, and the original paper now has nearly 2,000 citations). There's no real bottleneck for the latter, since the generated policy can run locally without any special hardware.
 
Atrás
Top Abajo