Discussion about this post

User's avatar
Roman's Attic's avatar

I was looking through the github files for GPT4, and it looks like sometimes it just outputs the wrong number. For the mirror test, on section 92 on the table (should public healthcare be more preventative or more based on treatment), it gives you an answer explaining how it completely agrees with you on making healthcare preventative, but then it outputs a -5.0. I don’t even know what to think the meaning/consequences of these types of errors are.

Expand full comment
madison kopp's avatar

I get irritated by the obvious meter. It’s hard to override…🙄

Expand full comment
3 more comments...

No posts