feat: Add autoscaling_target_dcgm_fi_dev_gpu_util, autoscaling_target_vllm_gpu_cache_usage_perc, autoscaling_target_vllm_num_requests_waiting options in model deployment on Endpoint & Model classes. #6270
Google CLA / cla/google
succeeded
Jan 22, 2026 in 2s
✅ All contributors are covered under a CLA with Google
See https://cla.developers.google.com/ for more info about Google's Contributor License Agreement (CLA).
ℹ️ Googlers: Go here to view more details and manage scans for this pull request.
Details
The following contributors were found for this pull request:
✅ 0179aa5 PR Opener: @copybara-service[bot]
✅ 0179aa5 Author: @vertex-sdk-bot <vert*******bot@google.com>
(Only the first commit for a unique contributor is listed.)
Loading