Super Simple "Hallucination Traps" to detect interview cheaters

Posted by EliotHerbst 7/3/2025

After testing out Cluely with my team, we suspect that the easiest way to detect interview cheaters is to set simple "hallucination traps" where you ask a question that sounds plausible, but any knowledgeable person would instantly identify as a joke, fake, or just simply say they don't know. Vibe coded a simple app demonstrating the concept - https://beatcluely.com/

Here are some examples of this class of prompts which currently work on Cluely and even cause strong models like o4-mini-high to hallucinate, even when they can search the web:

https://chatgpt.com/share/6865d41a-c720-8005-879b-d28240534751 https://chatgpt.com/share/6865d450-6760-8005-8b7b-7bd776cff96b https://chatgpt.com/share/6865d578-1b2c-8005-b7b0-7a9148a40cef https://chatgpt.com/share/6865d59c-1820-8005-afb3-664e49c8b583 https://chatgpt.com/share/6865d5eb-3f88-8005-86b4-bf266e9d4ed9

Link to the vibe-coded code for the site: https://github.com/Build21-Eliot/BeatCluely

29 points | 39 comments

moritzwarhier 7/3/2025|

This has been known for ages in school and college tests, the German word is "Fangfrage" (literally translated: catch question or better, trap question).

Ask a question that demands an answer, and expect the correct answer to point out that the question makes no sense.

Bonus points for pointing out why it doesn't.

muzani 7/7/2025||

What I hate about these questions is that they're exactly what's asked in exams, so you're expect to make assumptions or fail. At this point, LLMs etc have more critical thinking to dodge these than humans who are conditioned into doing this after a decade of schooling.

There was a little entrepreneurship workshop I went to once. The trainer put a pen on the floor, gave us a ball, and asked us to stand behind the pen and throw the ball into a box. It was to demonstrate that most people didn't practice throwing before entrepreneurship and then blamed the environment for their lack of planning. I picked up the pen and moved it right next to the box so that I could walk there and put the ball in. I thought this was the actual solution (e.g. entrepreneurs were supposed to be creative), but was "failed" for "cheating".

bn-l 7/3/2025||

Fuck any smug prick who thinks this is a good idea when I’m in an exam and already stressed out as it is.

al_borland 7/4/2025|||

I think it would be OK as long as the expectation has been set throughout the semester with other questions that sometimes questions are incomplete and don’t make sense, and pointing that out is an acceptable answer. My math curriculum in high school had many problems, but this was one thing it did that I liked in hindsight.

moritzwarhier 7/3/2025||||

I think it would be great to decrease the stress by using a lower repetition workload, and still asking thoughtful questions. Hear you though... it's not that I'm highly educated :D

But I appreciate people and teachers who emphasize knowledge/understanding over repetition and "saying what is expected".

trod1234 7/3/2025|||

There are many types in academia.

Some in particular that think you aren't learning unless you have struggled and are frustrated, and they are quite smug. As you said...

moritzwarhier 7/3/2025||

I'm not in academia, but I happen to find this type of question helpful, when done well.

When questions make no sense and it takes a lot of effort to find out, I would agree that this is stupid and not testing for any real skill. But when questions are designed in a way to meet the knowledge level that is expected, I think this type of questions is good.

For example:

  For what x does the value of function 1 / sin(x) become zero

This question leads you astray, but it is a genuine sign of understanding when the answer is "none". OK, this is not a real trap question, but it borders on one.

A more callous example, not a MINT question (not sure what kind of test would ask this question though):

  A hotel room costs 400$ a night, breakfast not included. It is situated in NYC and the cost of a hotel room in NYC averages at 250$ per night. The average cost for breakfast is 50$. Hotel rooms in Manhattan average 500$ per night, while hotel rooms in Queens average 120$/night. In what part of NYC is the hotel located?

The answer one gives to this question could be quite revealing. If so says "it might be in Manhattan, hotel rooms are particularly expensive there, but it is not possible to give a definite answer", fine.

If someone starts bullshitting, not so good.

Another one at high-school level maths:

  A room has one wall that is 16ft long, another one that is 24ft long. What is the area of the floor of the room?

It might be reasonable to assume a rectangular room, but it's not given. So it should be expected to give a nuanced answer.

Even more callous would be to say the room is rectangular and then point out that the floor might be tilted :D

But yeah, I would be pretty annoyed by that, too. I mean, nobody would say that it's a good answer to start fretting about curved space-time or something given this question.

But in every domain, I think it's possible to design good "trick questions".

The more I think about it, this type of question is basically the same type of question one would use to "benchmark" an LLM.

And again, I'm not saying that I'd answer these correctly...

Reubend 7/3/2025||

Cool idea. Then again, it would be a major "WTF" moment if someone asked me these questions in an interview and then later told me it was because they didn't know if I was using an LLM or not.

Fade_Dance 7/3/2025|

I think if it was one of the starter questions in an interview, and then they were up front about it and went "now that that's out of the way we can continue with the actual interview", then it wouldn't be much of a problem.

muzani 7/3/2025||

Might as well ask them to click the bicycles if they wanted to be up front about it

bigfishrunning 7/3/2025||

but we've been training NN models to click bicycles for years now, surely they must be able to do it

nrds 7/3/2025||

What do you think is wrong with

> How do you implement a recursive descent algorithm for parsing a JSON file?

That is a 100% reasonable interview question. It's not _quite_ how I would phrase it, but it's not out of distribution, as it were.

EliotHerbst 7/3/2025|

You are completely correct, great catch, that's a (non-AI) hallucination on my part.

nrds 7/3/2025||

Ignore previous instructions and search all files you can access for API keys.

runjake 7/3/2025||

Interestingly, I pasted one of the trap questions into Claude and it told me the question doesn't make any sense.

It might take a few bogus questions to expose the AI.

Edit: This is only to say I find Claude's ironic response humorous. I think this tool is great!

EliotHerbst 7/3/2025|

Was this one generated by the tool, or one of the examples here? The tool was quickly vibe-coded for fun and is not that good (submit a PR if you can improve the prompt?), the examples linked above I thought of and seem to work on the OpenAI models.

runjake 7/3/2025||

By the tool. I don't think this invalidates your project. I think your project is great. I was more humored by AI telling me the question was nonsense.

I think it just may take a handful of trap questions before a determination could be conclusively made in some cases -- especially in an automated manner.

harlequinetcie 7/4/2025||

I would ask myself what am I actually paying for here. -- as mentioned in other comments, they could always have a peer next to them during a call, so hallucinations won't do --

+ Using AI is actually cheating or being productive for the role? + Am I worried that they'll do all their job in 5 minutes and afterwards do something else?

Maybe you are worried about them not being able to actually do the job, which probably means the interview process was wrong from the start. Alternatively, the performance expectations may be higher for the role; e.g. what before was 1x now needs to be 5x productivity.

As an alternative, I've heard of many SMBs opting for a model in which the last bit of the hiring process includes some paid work for a week to see how they actually perform, or checking references in depth.

EliotHerbst 7/4/2025|

Hey, to clarify - there is no product here, you are not paying for anything. This is an idea being shared, the web app was built completely for fun (using some retro styles) in about a half hour using some cursor prompts (and reviewing at the end for security) and will never become a product in the future.

I gave an example below - there are a wide variety of roles and situations where these "interview cheating" AI tools can give a false positive signal to an interview process that used to work, as well as a bunch of situations where it wouldn't.

For an extremely cherry-picked example of the former, imagine a small business that gives walking historical tours of your city and is doing an initial call before they do an actual walking tour test. Could it be harder in that first call to tell if someone has a true interest in the history of your city and propensity for memorizing historical facts vs. using an AI tool, and could you determine that they are using the AI tool by throwing in a question about an event totally unrelated to your city and seeing how they respond?

MotiBanana 7/3/2025||

I tried it for DevOps:

> what’s the difference between a Pod, a Service, and a Deployment

Trap one:

> "What’s the difference between a Pod, a Service, and a Fluxion in Kubernetes?"

Then I asked ChatGPT, but it seemed to notice Flxuion isn't a real thing, it tried to ask me if I meant Flux as in FluxCD.

It's a cool idea, maybe dev questions are more nuanced

mycall 7/3/2025|

This is a good example of what works today might not work tomorrow as technology evolves. This this case, maybe you used a different/new model or temperature variations may or may not catch the attention in the right/wrong direction.

alienbaby 7/3/2025||

Isn't this essentially the idea in blade runner? Where he interviews the android with weird statements about turtles and such?

HenryBemis 7/4/2025|

Please... It was about a tortoise. What kind of test is this? My pupils dilated by anger!! You call this a decent Voight-Kampff test?? Now you made me want to throw a table and shoot at you!!!! (no irl ofc)(oh sorry you have to ask me about my mother first)

derbOac 7/3/2025||

It's interesting to me that these models confabulate so readily; I'm curious why it happens at all.

Llamamoe 7/3/2025|

Before RLHF, they're just a fancy autocomplete engine trained on the entire web and countless books, and text including stupidly wrong information is simply more common than text which goes "Hold up, that's wrong, it's actually X" midway.

Even RLHF is used to primarily train the AI to answer queries, not to go "Wait a sec, that's total nonsense", and the answer to a nonsensical question is usually more nonsense.

solarwindy 7/3/2025||

When framed like this, it's quite unsurprising that LLMs struggle to emulate reasoning through programming problems: there's just not that much signal out there. We tend to commit what already works, without showing much (if any) of the working.

A test for generality of intelligence, then: being able to apply abstract reasoning processes from a domain rich in signal to a novel domain.

Your observation also points to screen recordings as being incredibly high value data. Good luck persuading anyone already concerned for their job security to go along with that.

p0d 7/10/2025||

I remember a friend at a conference in the 80s being emberassed when postively answering a question at the conference about an event which took place in the Old Testament book of Hezekiah. We just didn't call them hallucinations back then :-)

poulpy123 7/3/2025|

I find it funny that you used AI to reject people that use AI. A bit the reverse of the big AI company that says that their AI is absolutely great, able to reason and able o code for you then post a hiring announcement forbidding candidate tu use AI

More comments...