KVarN: Native vLLM KV-cache quantization back end by Huawei

· Hacker News