- Registrado
- 15 de Dic, 2022
AI companies are already running out of high-quality training sources for their LLM’s, so they’re filtering in more garbage from websites like Reddit and Twitter. It’s projected that tech companies will have exhausted all (accessible) human-made data by 2030. If this were to happen, AI would have to start training off bot activity, though doing so has proven to severely degrade the quality of the AI’s output.
The only way I could see companies getting around this is if they invested billions into acquiring corporations to gain access to more data, like publishing companies. Otherwise, it seems like AI is going to hit a wall, or start regurgitating more crap.
The only way I could see companies getting around this is if they invested billions into acquiring corporations to gain access to more data, like publishing companies. Otherwise, it seems like AI is going to hit a wall, or start regurgitating more crap.