This is the technology worth trillions of dollars huh

  • panda_abyss@lemmy.ca
    link
    fedilink
    English
    arrow-up
    21
    ·
    edit-2
    4 hours ago

    Yesterday i asked Claude Sonnet what was on my calendar (since they just sent a pop up announcing that feature)

    It listed my work meetings on Sunday, so I tried to correct it…

    You’re absolutely right - I made an error! September 15th is a Sunday, not a weekend day as I implied. Let me correct that: This Week’s Remaining Schedule: Sunday, September 15

    Just today when I asked what’s on my calendar it gave me today and my meetings on the next two thursdays. Not the meetings in between, just thursdays.

    Something is off in AI land.

    Edit: I asked again: gave me meetings for Thursday’s again. Plus it might think I’m driving in F1

    • achance4cheese@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      4
      ·
      2 hours ago

      Also, Sunday September 15th is a Monday… I’ve seen so many meeting invites with dates and days that don’t match lately…

      • panda_abyss@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        24 minutes ago

        Yeah, it said Sunday, I asked if it was sure, then it said I’m right and went back to Sunday.

        I assume the training data has the model think it’s a different year or something, but this feature is straight up not working at all for me. I don’t know if they actually tested this at all.

        Sonnet seems to have gotten stupider somehow.

        Opus isn’t following instructions lately either.

    • FlashMobOfOne@lemmy.world
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      1
      ·
      4 hours ago

      A few weeks ago my Pixel wished me a Happy Birthday when I woke up, and it definitely was not my birthday. Google is definitely letting a shitty LLM write code for it now, but the important thing is they’re bypassing human validation.

      Stupid. Just stupid.