

As I pointed out in another root comment, the average - depending on the model being tested - tends to sit between 60% and 80%. But this is with no restriction on source materials… the LLMs are essentially pulling from world+dog in that case
So this opens up an interesting option for users, in that hallucinations/inaccuracies can be controlled for and potentially reduced by as much as ⅔ simply by restricting the model to those documents/resources that the user is absolutely certain contains the correct answer.
I mean, 25% is still stupidly high. In any prior era, even 2.5% would have been an unacceptably high error rate for a business to stomach. But source-restriction seems to be a somewhat promising guardrail to use for the average user doing personal work.




If betting on Polymarket, you would actually have to stump up that money first, and the other person would have to do the same with whatever bid they wanted to use. Then, in order to get any kind of reasonable payback, you would need thousands of other people to make a bet for or against, using their own money.
The payout isn’t on someone making a bet on themselves, no-one else would bet for or against that as the stakes are so small. The payout is on large-scale events that are - ostensibly - out of the control of the bettor or bettee.
Polymarket is no different than betting on the outcomes of horse races or sports games, it just opens up the thing being betted on to anything and everything. People will still bet. The key is how “un-rigged” it appears to be.