超级简单的“幻觉陷阱”来识别面试作弊者

6作者: EliotHerbst13 天前原帖
在与我的团队测试 Cluely 后,我们怀疑检测面试作弊者的最简单方法是设置一些简单的“幻觉陷阱”,即提出一个听起来合理的问题,但任何有知识的人都会立即识别出这是一个笑话、虚假信息,或者干脆说他们不知道。Vibe 编写了一个简单的应用程序来演示这个概念 - https://beatcluely.com 以下是一些当前在 Cluely 上有效的提示示例,甚至会导致像 o4-mini-high 这样的强大模型产生幻觉,即使它们可以搜索网络: https://chatgpt.com/share/6865d41a-c720-8005-879b-d28240534751 https://chatgpt.com/share/6865d450-6760-8005-8b7b-7bd776cff96b https://chatgpt.com/share/6865d578-1b2c-8005-b7b0-7a9148a40cef https://chatgpt.com/share/6865d59c-1820-8005-afb3-664e49c8b583 https://chatgpt.com/share/6865d5eb-3f88-8005-86b4-bf266e9d4ed9 网站的 Vibe 编码代码链接:https://github.com/Build21-Eliot/BeatCluely
查看原文
After testing out Cluely with my team, we suspect that the easiest way to detect interview cheaters is to set simple &quot;hallucination traps&quot; where you ask a question that sounds plausible, but any knowledgeable person would instantly identify as a joke, fake, or just simply say they don&#x27;t know. Vibe coded a simple app demonstrating the concept - https:&#x2F;&#x2F;beatcluely.com&#x2F;<p>Here are some examples of this class of prompts which currently work on Cluely and even cause strong models like o4-mini-high to hallucinate, even when they can search the web:<p>https:&#x2F;&#x2F;chatgpt.com&#x2F;share&#x2F;6865d41a-c720-8005-879b-d28240534751 https:&#x2F;&#x2F;chatgpt.com&#x2F;share&#x2F;6865d450-6760-8005-8b7b-7bd776cff96b https:&#x2F;&#x2F;chatgpt.com&#x2F;share&#x2F;6865d578-1b2c-8005-b7b0-7a9148a40cef https:&#x2F;&#x2F;chatgpt.com&#x2F;share&#x2F;6865d59c-1820-8005-afb3-664e49c8b583 https:&#x2F;&#x2F;chatgpt.com&#x2F;share&#x2F;6865d5eb-3f88-8005-86b4-bf266e9d4ed9<p>Link to the vibe-coded code for the site: https:&#x2F;&#x2F;github.com&#x2F;Build21-Eliot&#x2F;BeatCluely