-
Notifications
You must be signed in to change notification settings - Fork 947
feat: replaced vm runner with test gpu arc from cncf #3067
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
+330
−318
Merged
Changes from all commits
Commits
Show all changes
30 commits
Select commit
Hold shift + click to select a range
bcc35bf
test: gpu based ARC from cncf instead of stanalone vm
jaiakash f392c21
fix: removed label for gpu e2e test
jaiakash 2959582
fix: remove delete cluster action
jaiakash d20c893
fix: rm seperate script, rm path and nvidia smi command for testing
jaiakash 71b83ff
fix: using nvkind with sudo
jaiakash 1fb948c
fix: move nvkind from noexec to local/bin
jaiakash ec90b13
tmp: install nvkind as from arc is not wroking
jaiakash 1c9cf3a
fix: run the commands as sudo (check https://github.com/NVIDIA/nvkind…
jaiakash c17e0c0
fix: nvkind as sudo
jaiakash 94a0da0
test: ignore the patch error
jaiakash f15ca1c
fix: downgrade the version for nvidia ctk
jaiakash 2983295
add: service restart
jaiakash fa261f1
fix: patch version for nctk
jaiakash 5fee385
fix: command
jaiakash 62cd2fb
refactor: split into different script
jaiakash 04edc60
chore: transfer to legacy mode
jaiakash d93b29d
fix: legacy mode for nctk
jaiakash a2106ef
chore: downgrade nctk
jaiakash c300fa7
fix: non root kubectl
jaiakash 0f81f11
fix: helm dirs
jaiakash db14744
chore: refactored the code
jaiakash 27eedb9
chore: rm seperate script, migrated to prod gpu arc, and other misc
jaiakash facda40
chore: rm sudo for kind
jaiakash cd59f0c
add: wait fo qwen to complete
jaiakash 36ec784
chore: fix qwen nb and fix gpu operator to use host driver
jaiakash e27657c
test: use default value for ntk toolkit
jaiakash ef72d9e
hotfix: patch CRDs to run on GPU nodes (Check #3067)
jaiakash f43c8a0
fix: the patching to mdoel and initializers
jaiakash f742201
fix: single node for gpu e2e test
jaiakash 45af997
revert: jax eg
jaiakash File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,3 +1,4 @@ | ||
| torchao>=0.9.0 | ||
| torchtune==0.6.1 | ||
| bitsandbytes>=0.41.1 | ||
| kagglehub>=0.4.0 | ||
|
Member
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We need to spefically add Error: |
||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.