HackerNews中文版

大家好，我正在进行一项研究，有一个问题想请教大家。当语音人工智能代理的流量激增时，你们通常会怎么处理？据我所知，Kubernetes 的原生自动扩缩容（HPA）往往无法及时跟上，导致延迟过高。如果你们愿意分享经验，我将非常感激。谢谢！

查看原文

Hey guys,<p>I'm doing a research and have a question. What do you do when traffic to voice AI agents spikes? As far as I know, native autoscaler of Kubernetes (HPA) doesn't catch up quite often with that - resulting into a prohibitively high latency. Would be glad to know your experience if you don't mind.<p>Thanks!

请问HN：在负载激增时，您是如何保持语音AI延迟低的？