问HN:谁从“人工智能代理”中获得了实际价值?
我看到YC的一篇帖子:“从OpenAI的DeepResearch到xAI的DeepSearch,我们正在看到向自主工具的首次真正推动,这些工具能够以最少的人类输入来规划、执行和完成诸如研究、外联和编码等任务。”
这让我思考,这些AI代理实际上对任何人有用或有价值吗?
我对技术相当了解,使用AI来生成代码,以加快POC项目的进展。不过,我并不支持盲目地“随意编码”,我认为这很危险,而且在后期尝试修复大型项目时可能会变得更慢。从其他讨论来看,许多HN的用户和我有类似的看法——也就是说,使用AI进行代码生成确实有价值,但让一个“代理”只是“为你工作”可能并不那么有用。
不过,关于研究和外联,大家对此有何看法?有没有人发现这些用例有用?我看到很多对这些深度研究产品的批评。它们似乎类似于总结谷歌搜索结果第一页的链接,而且充满了在严肃研究中你可能想要忽略的来源(例如,reddit帖子)。所以我对它们的质量持怀疑态度。
我认为另一个可能有用的主题是将大型语言模型(LLMs)用作更高级/更简单的RPA工具。基本上,可以将其视为基于不同上下文的灵活RPA,这些上下文可以通过文本捕捉。也许这就是所谓的“MCP”热潮的全部内容。
所以我很想知道,谁在使用可以算作“AI代理”的东西(即超越文本聊天/提示的东西),并从中获得真正的价值?
查看原文
I saw this post from YC: "From OpenAI’s DeepResearch to xAI’s DeepSearch, we’re seeing the first real push toward autonomous tools that can plan, execute, and complete tasks like research, outreach, and coding with minimal human input."<p>It got me thinking, are these AI Agents actually useful or valuable to anyone?<p>I'm fairly technical and I use AI to generate code to make POC projects go faster. I'm not on board with blindly 'vibe coding' though, I think it's dangerous and potentially slower when you get screwed over later trying to fix a large project. I think from other discussions, many on HN are in a similar boat as me - that is, using AI for code generation is certainly valuable, but having an 'agent' just 'work' for you probably isn't.<p>But anyways, research and outreach, how about those, anyone finding use for these use cases? I've seen a ton of criticism on these deep research products. They seem akin to summarizing the first page of links of a google search, and they're full of sources you'd want to ignore anyways for serious research (e.g. reddit posts). So I'm dubious as to their quality.<p>Another theme that I think could be useful is using LLMs as advanced/easier RPA tools. Basically, think RPA with some flexibility based on different contexts that can be captured via text. Maybe this is what this 'MCP' hype is all about.<p>So I'm very curious, who's using what would count as 'AI Agents' (i.e. something more than text chat/prompts), and getting real value from them?