• wulrus@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    21 hours ago

    I am generally a sceptic myself, especially in my own area, which is software development. But recently in a board game community, someone was scolded for asking ChatGPT about a rule dispute (and it was wrong). All upvotes to unhelpful “AI bad” comments. I pointed out that while this was true 3 months ago, ChatGPT 5 (and only that one) can very accurately answer such questions when asked the right way, showed how to ask the user question and the (now correct) response, and mentioned my 35 board game test questions and results with major LLM flagship models. (Almost all LLMs did horribly, under 70% even in yes/no questions, but ChatGPT 5 with specific instructions or “Thinking” model got 100%.)

    Even as a sceptic, I can acknowledge that LLMs just jumped from completely useless to perfect in the past few months when it comes to this specific niche.