Cauchy: A Cost-Efficient LLM Serving System through Adaptive Heterogeneous Deployment
Published in ACM Symposium on Cloud Computing (SoCC), 2025
Recommended citation: To be Determined http://to-be-determined
Published in ACM Symposium on Cloud Computing (SoCC), 2025
Recommended citation: To be Determined http://to-be-determined