The new analysis contradicts the social media platform’s claims that exposure to hate speech and bot-like activity decreased during Elon Musk’s tenure.
He keeps trying. He seems to think that can be done by just putting his thumb on the system prompt, and then we end up with obvious nonsense like that South African white genocide preoccupation. It’s fortunate the Musk isn’t smart enough to figure out how to do it subtly.
I’m nowhere close to being an LLM specialist but to actually skew the model itself I think you need a lot of consistent data. Ten thousand alt-right blogs peddling a hundred thousand internally inconsistent and mutually incompatible narratives won’t cut it, they’ll criss-cross over the gradient landscape and because they don’t coincide, won’t make a dent in the deep groves trodden by pirating libgen. And training only on the alt-right blogs won’t cut it either that’s just not enough data which on top of that doesn’t sound smart enough to woo anyone, or have any resemblance of a consistent stance. Sure you’ll get it to claim ridiculous shit and use lots of slurs but 4chan managed to do that back in 2016 and noone was fooled.
Somehow, the most reasonable account on Vichy Twitter is Grok because it’s hard to train an LLM using only data dumbasses wrote.
Sadly that’s going to change. Musk is going to push an update that I suspect will turn Grok into a far-right disinformation parrot or become closer to that. See https://lemm.ee/post/67429541 and https://lemm.ee/post/67216433
He keeps trying. He seems to think that can be done by just putting his thumb on the system prompt, and then we end up with obvious nonsense like that South African white genocide preoccupation. It’s fortunate the Musk isn’t smart enough to figure out how to do it subtly.
I’m nowhere close to being an LLM specialist but to actually skew the model itself I think you need a lot of consistent data. Ten thousand alt-right blogs peddling a hundred thousand internally inconsistent and mutually incompatible narratives won’t cut it, they’ll criss-cross over the gradient landscape and because they don’t coincide, won’t make a dent in the deep groves trodden by pirating libgen. And training only on the alt-right blogs won’t cut it either that’s just not enough data which on top of that doesn’t sound smart enough to woo anyone, or have any resemblance of a consistent stance. Sure you’ll get it to claim ridiculous shit and use lots of slurs but 4chan managed to do that back in 2016 and noone was fooled.
All very good points.