misk@sopuli.xyz to Technology@lemmy.worldEnglish · edit-26 days agoApple Intelligence summary botches a headline, causing jitters in BBC newsroomwww.theregister.comexternal-linkmessage-square69fedilinkarrow-up1286arrow-down14
arrow-up1282arrow-down1external-linkApple Intelligence summary botches a headline, causing jitters in BBC newsroomwww.theregister.commisk@sopuli.xyz to Technology@lemmy.worldEnglish · edit-26 days agomessage-square69fedilink
minus-squarebrucethemoose@lemmy.worldlinkfedilinkEnglisharrow-up5·6 days agoFor RAG data? It works. But its too slow for the weights. What generative models fundamentally do is run a full pass through the multi-gigabyte weights for every ‘word’ or diffusion step, so even 128-bit DDR5 like you find on desktop CPUs is too slow.
For RAG data? It works.
But its too slow for the weights. What generative models fundamentally do is run a full pass through the multi-gigabyte weights for every ‘word’ or diffusion step, so even 128-bit DDR5 like you find on desktop CPUs is too slow.