Considering that LLMs are trained on the whole of the internet, it's kind of amazing that they don't talk back to you like a condescending, smug asshole

Perspectivist@feddit.uk · 5 hours ago

Considering that LLMs are trained on the whole of the internet, it's kind of amazing that they don't talk back to you like a condescending, smug asshole

morto@piefed.social · 20 minutes ago

Maybe we underestimate people a bit. The assholes tend to be more impacting to us, but most people aren’t like that, and we tend not to notice the several neutral or good interactions the same way.

scytale@piefed.zip · 7 minutes ago

Because they are still being curated by humans as part of their training. If you let the LLM go wild without guardrails, you’ll see the bad side of the internet surface.

TheLeadenSea@sh.itjust.works · 5 hours ago

They have RLHF (reinforcement learning from human feedback) so any negative, biased, or rude responses would have been filtered out in training. That’s the idea anyway, obviously no system is perfect.

SpaceNoodle@lemmy.world · 5 hours ago

Then why are they all still smarmy assholes?

SkyNTP@lemmy.ml · edit-2 4 hours ago

That’s what was said. LLMs have been reinforced to respond exactly how they do. In other words, that “smarmy asshole” attitude, you describe was a deliberate choice. Why? Maybe that’s what the creators wanted, or maybe that’s what focus groups liked most.

BlackLaZoR@fedia.io · 4 hours ago

They’re talking neutral by default, but they absolutely talk trash if you prompt them to.

nesc@lemmy.cafe · 5 hours ago

They do.

ikt@aussie.zone · 4 hours ago

which llm are you using?

BlackLaZoR@fedia.io · 4 hours ago

4chanGPT maybe?

essell@lemmy.world · 3 hours ago

I want this so much.

BlackLaZoR@fedia.io · 1 hour ago

It was a thing, but Huggingface removed it due to the surrounding drama

essell@lemmy.world · 46 minutes ago

Oh wow

Yeah, I can imagine it’d be horrible and dark, a satire of the dark-side of humanity to the point of hilarity.

BlackLaZoR@fedia.io · 4 minutes ago

hmm, it has a torrent available on web archive https://archive.org/details/gpt4chan_model_float16, but it seems to be in very outdated .bin format. If you have experience with llama.cpp tooling you might be able to convert it into something usable… just be careful, this old format isn’t reinforced against malware

rozodru@piefed.world · 3 hours ago

with ChatGPT you can tell it to behave in certain ways. With Claude it’ll just start mimicking you.

nesc@lemmy.cafe · 3 hours ago

Any would talk however you prompt it to talk.