Advanced OpenAI models hallucinate more than older versions, internal report finds

TempermentalAnomaly@lemmy.world · 2 days ago

Advanced OpenAI models hallucinate more than older versions, internal report finds

hansolo@lemm.ee · 1 day ago

Yeah, I think that workarounds with o3 is where we’re at until Altman figures out that just saying the latest oX mini high is “great at coding” is bad marketing when it can’t accomplish the task.

KeenFlame@feddit.nu · 10 hours ago

I don’t quite understand why o3 for coding? Do you mean for code architecture or something? Like creating apps? Why not use a better model if its for coding?

hansolo@lemm.ee · 8 hours ago

That’s exactly the problem.

However, o4 is actually “o4 mini-high” while o3 is now just o3 now. The full release, no “mini” or other limitations. At this point o3 in its full form is better than a limited o4.

But, none of that matters while Claude 3.7 exists.