超级简单的“幻觉陷阱”来识别面试作弊者
在与我的团队测试 Cluely 后,我们怀疑检测面试作弊者的最简单方法是设置一些简单的“幻觉陷阱”,即提出一个听起来合理的问题,但任何有知识的人都会立即识别出这是一个笑话、虚假信息,或者干脆说他们不知道。Vibe 编写了一个简单的应用程序来演示这个概念 - https://beatcluely.com
以下是一些当前在 Cluely 上有效的提示示例,甚至会导致像 o4-mini-high 这样的强大模型产生幻觉,即使它们可以搜索网络:
https://chatgpt.com/share/6865d41a-c720-8005-879b-d28240534751
https://chatgpt.com/share/6865d450-6760-8005-8b7b-7bd776cff96b
https://chatgpt.com/share/6865d578-1b2c-8005-b7b0-7a9148a40cef
https://chatgpt.com/share/6865d59c-1820-8005-afb3-664e49c8b583
https://chatgpt.com/share/6865d5eb-3f88-8005-86b4-bf266e9d4ed9
网站的 Vibe 编码代码链接:https://github.com/Build21-Eliot/BeatCluely
查看原文
After testing out Cluely with my team, we suspect that the easiest way to detect interview cheaters is to set simple "hallucination traps" where you ask a question that sounds plausible, but any knowledgeable person would instantly identify as a joke, fake, or just simply say they don't know. Vibe coded a simple app demonstrating the concept - https://beatcluely.com/<p>Here are some examples of this class of prompts which currently work on Cluely and even cause strong models like o4-mini-high to hallucinate, even when they can search the web:<p>https://chatgpt.com/share/6865d41a-c720-8005-879b-d28240534751
https://chatgpt.com/share/6865d450-6760-8005-8b7b-7bd776cff96b
https://chatgpt.com/share/6865d578-1b2c-8005-b7b0-7a9148a40cef
https://chatgpt.com/share/6865d59c-1820-8005-afb3-664e49c8b583
https://chatgpt.com/share/6865d5eb-3f88-8005-86b4-bf266e9d4ed9<p>Link to the vibe-coded code for the site: https://github.com/Build21-Eliot/BeatCluely