feat: Add autoscaling_target_dcgm_fi_dev_gpu_util, autoscaling_target_vllm_gpu_cache_usage_perc, autoscaling_target_vllm_num_requests_waiting options in model deployment on Endpoint & Model classes. #6270

✅ All contributors are covered under a CLA with Google

See https://cla.developers.google.com/ for more info about Google's Contributor License Agreement (CLA).

ℹ️ Googlers: Go here to view more details and manage scans for this pull request.

@vertex-sdk-bot

The following contributors were found for this pull request:

✅ 0179aa5 PR Opener: @copybara-service[bot]
✅ 0179aa5 Author: @vertex-sdk-bot <vert*******bot@google.com>

(Only the first commit for a unique contributor is listed.)