• dejected_warp_core@lemmy.world
    link
    fedilink
    arrow-up
    16
    ·
    1 day ago

    , he’s reading off the bottom of his screen.

    Aw fuck.

    I’m gonna have to ask absolutely bullshit questions in interviews now, aren’t I? Do you have any other strategies for how to spot this? I really don’t want to drag in remote exam-taking software to invade the applicant’s system in order to be assured no other tools are in play.

    • korazail@lemmy.myserv.one
      link
      fedilink
      English
      arrow-up
      14
      ·
      1 day ago

      I’m not in a hiring position, but my take would be to throw in unrelated tools as a question. E.g. “how would you use powershell in this html to improve browser performance?” A human would go what the fuck? A llm will confidently make shit up.

      I’d probably immediately follow that with a comment to lower the interviewee’s blood pressure like, ‘you wouldn’t believe how many people try to answer that question with a llm’. A solid hire might actually come up with something, but you should be able to tell from their delivery if they are just reading llm output or are inspired by the question.

      • dejected_warp_core@lemmy.world
        link
        fedilink
        arrow-up
        3
        ·
        7 hours ago

        It’s a fine line to walk, but I see what you’re getting at here. I wouldn’t want to come across as incompetent either, lest it reflect on the company. Your follow-up remark is brilliant. Delivery is everything, I suppose.

      • Jankatarch@lemmy.world
        link
        fedilink
        arrow-up
        4
        ·
        23 hours ago

        Be careful tho because if you ask that with enough confidence I would think I am in the wrong.

        "Powershell had OOP without me knowing for a few years so maybe it has hidden html usage too. "

        • korazail@lemmy.myserv.one
          link
          fedilink
          English
          arrow-up
          6
          ·
          23 hours ago

          That was my body language cue. An ‘umm… 😅’ answer is a pass, as well as any attempt to actually integrate disparate tools that doesn’t sound like it’s being read. The creased eyebrows, hesitation, wtf face, etc is the proof that the interviewee has domain knowledge and knows the question is wrong.

          I do think the tools need to be tailored to the position. My example may not have been the best. I’m not a professional front end developer, but that was my theoretical job for the interviewee.

    • phx@lemmy.world
      link
      fedilink
      arrow-up
      6
      ·
      1 day ago

      I wonder if AI seeding would work for this.

      Like: come up with an error condition or a specific scenario that doesn’t/can’t work in real life. Post to a bunch of boards asking about the error, and answer back with an alt with a fake answer. You could even make the answer something obviously off like:

      • ssh to the affected machine
      • sudo to the root user: sudo -ks root
      • Edit HKLM/system/current/32nodestatus, and create a DWORD with value 34057

      Make sure to thank yourself with “hey that worked!” with the original account

      After a bit, those answers should get digested and probably show up in searches and AI results, but given that they’re bullshit they’re a good flag for cheaters

      • dejected_warp_core@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        7 hours ago

        There’s stuff out there now about how to poison content scrapers that are training AI, so this is absolutely doable on some scale. There are already what I like to call “golden tokens” that produce freaky reliable and stable results every time, and so I think it likely there are counterparts that trigger reliably bad output too. They’re just not documented yet.

        In a sane world, commercial AI would have legally required watermarks and other quirks that give content away as artificial, every time. Em-dash is probably the closest we have to this right now for text, and likewise for the occasional impossible backdrop or extra fingers on images. You can’t stop a lone ranger with a home-rolled or Chinese model, but it would be a start.

      • calcopiritus@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        1 day ago

        Don’t have the source on me now, but I read an article that showed it was surprisingly easy. Like 0.01% of content had his magic words, and that was enough to trigger it.

    • ObjectivityIncarnate@lemmy.world
      link
      fedilink
      arrow-up
      3
      ·
      edit-2
      1 day ago

      I’ve never used AI for interview stuff, beyond a little thing that gave me sample questions and assessed my recorded verbal response, to use as prep before an interview, but in reading that, I remembered that Nvidia has a thing where a visual effect will make your eyes look like you’re looking straight into the camera all the time (unless they’re totally closed of course), and imagined this type of person using that as further subterfuge during the interview, to conceal the ‘looking down’.

      Luckily, the average person leaning completely on AI for an interview is not nearly savvy enough for this sort of thing, in my experience.

    • UnderpantsWeevil@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      1 day ago

      Literally include “Can you name four basic SQL commands?” any time I interview someone and it’s a great litmus test.

      • ThirdConsul@lemmy.ml
        link
        fedilink
        arrow-up
        1
        ·
        edit-2
        3 hours ago

        I’m a software engineer with 15+ years of experience and this question had me stumped.

        Select insert update delete?

        Create alter drop rollback?

        Or did you mean types of commands? But of those there are 5?

        Or is that question supposed to get garbage response?

        God it’s late.

        • UnderpantsWeevil@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          5 minutes ago

          Select insert update delete?

          Nailed it

          Although I’m willing to accept “CREATE, DROP, TRUNCATE” in the mix of for no order reason that it shows they know basics.

        • IronBird@lemmy.world
          link
          fedilink
          arrow-up
          3
          ·
          24 hours ago

          knowing absolutely nothing about this topic, i would assume an actual competent person would be able to answer them immediately and confidently, someone reading an LLM prompt is probably sounds like they’re reading from a script even if the answers arent wrong

          • partial_accumen@lemmy.world
            link
            fedilink
            arrow-up
            1
            ·
            4 hours ago

            i would assume an actual competent person would be able to answer them immediately and confidently,

            People aren’t always able to regurgitate encyclopedic knowledge in interviews. Sure some can, but many have anxiety about interviews in general, or stuff going on in their lives which can make them not the sharpest when hit with a rando question like this. There are some absolutely brilliant people I’ve hired that would fail miserably if this was how they were measured.

            Some people work better with scenario based questions instead of bulleted memorized answers. Honestly, I’d much rather have a candidate that knows the concept being discussed even if they can’t remember the exact name of a term or the name of a flag they’d need to include when issuing a command. Those last things can be googled in the moment. Conceptual knowledge and understanding is much more important to me than wrote memorization.

            someone reading an LLM prompt is probably sounds like they’re reading from a script even if the answers arent wrong

            Well, thats what I experienced from my original post, but I’m not sure it will always be that. Someone more clever could take the answer from the LLM and paraphrase it, or put it in their own words and sound competent.