请问HN:在负载激增时,您是如何保持语音AI延迟低的?

1作者: didro3 天前原帖
大家好, 我正在进行一项研究,有一个问题想请教大家。当语音人工智能代理的流量激增时,你们通常会怎么处理?据我所知,Kubernetes 的原生自动扩缩容(HPA)往往无法及时跟上,导致延迟过高。如果你们愿意分享经验,我将非常感激。 谢谢!
查看原文
Hey guys,<p>I&#x27;m doing a research and have a question. What do you do when traffic to voice AI agents spikes? As far as I know, native autoscaler of Kubernetes (HPA) doesn&#x27;t catch up quite often with that - resulting into a prohibitively high latency. Would be glad to know your experience if you don&#x27;t mind.<p>Thanks!