-
Notifications
You must be signed in to change notification settings - Fork 216
Insights: GoogleCloudPlatform/cluster-toolkit
Overview
-
- 27 Merged pull requests
- 9 Open pull requests
- 4 Closed issues
- 0 New issues
Could not load contribution data
Please try again later
3 Releases published by 3 people
-
v1.58.1 v1.58.1 Hotfix: Resolve a3u/a4h slurm nvidia version mismatch error
published
Jul 18, 2025 -
v1.59.0 Release v1.59.0
published
Jul 22, 2025 -
v1.59.1 Release v1.59.1
published
Jul 24, 2025
27 Pull requests merged by 14 people
-
Merge
main -> develop
#4448 merged
Jul 24, 2025 -
Update Slurm image version to 6-10
#4439 merged
Jul 24, 2025 -
Hotfix release. Update Slurm images to
6.9 > 6.10
, Ubuntu20.04 > 22.04
, Debian11 > 12
#4442 merged
Jul 24, 2025 -
Updates the machine type of a4x in README.md file
#4447 merged
Jul 24, 2025 -
Updated reservation_definitions.tf for more precise cpu validation
#4444 merged
Jul 24, 2025 -
Adding bytetwin(hanu) to writers
#4445 merged
Jul 24, 2025 -
Update GKE release channel for a2 high Kueue integ. tests
#4438 merged
Jul 24, 2025 -
update terraform-provider version to 6.45.0
#4434 merged
Jul 23, 2025 -
adding name to cluster-toolkit-writers.json
#4420 merged
Jul 23, 2025 -
Adding new configurations to support IMEX in slurm
#4418 merged
Jul 23, 2025 -
Revert "Minimize tf drift for spot instances during re-deploy"
#4435 merged
Jul 22, 2025 -
Merge v1.59.0 into Develop
#4433 merged
Jul 22, 2025 -
Release candidate: v1.59.0
#4416 merged
Jul 22, 2025 -
Fix typo
start_instance_op
#4431 merged
Jul 22, 2025 -
Minimize tf drift for spot instances during re-deploy
#4430 merged
Jul 22, 2025 -
remove the GKE DWS Flex Start A3U integration tests
#4429 merged
Jul 22, 2025 -
[v2][Bugfix] Applying K8s manifests to GKE clusters via URL
#4352 merged
Jul 22, 2025 -
Change google cloud ops agent log file used for debugging
#4417 merged
Jul 21, 2025 -
Move disabling upgrades in shared file
#4424 merged
Jul 21, 2025 -
Disable upgrades for A* blueprints
#4423 merged
Jul 21, 2025 -
Disable upgrades for service images
#4422 merged
Jul 21, 2025 -
Update Toolkit release to v1.59.0
#4415 merged
Jul 18, 2025 -
Fix nvidia version mismatch for service images
#4413 merged
Jul 18, 2025 -
Merge v1.58.1 Hotfix to Develop
#4412 merged
Jul 18, 2025 -
Freeze accelerator image version to build service images
#4407 merged
Jul 18, 2025 -
Resolve a3u/a4h slurm nvidia version mismatch error
#4409 merged
Jul 18, 2025 -
Changes to Dockerfile for updating workstation image
#4379 merged
Jul 18, 2025
9 Pull requests opened by 7 people
-
Cluster Toolkit VDI Module
#4411 opened
Jul 18, 2025 -
Bump golang.org/x/oauth2 from 0.25.0 to 0.27.0
#4414 opened
Jul 18, 2025 -
Switch to 6-11 slurm image versions
#4425 opened
Jul 21, 2025 -
Hold google services updates in service images and A* blueprints
#4426 opened
Jul 21, 2025 -
Remove ambiguous controller NFS server_ip
#4427 opened
Jul 22, 2025 -
Use setsid resume.py to reduce reconfigure time
#4436 opened
Jul 22, 2025 -
Reduce terraform drift for spot instance config.
#4437 opened
Jul 23, 2025 -
Add optimized gcsfuse configurations to A3 Ultra and A4 blueprints.
#4441 opened
Jul 23, 2025 -
feat: add KMS support for encrypting munge key, JWT key, and DB secrets
#4449 opened
Jul 24, 2025
4 Issues closed by 3 people
-
errors in slurmsync.log
#4419 closed
Jul 23, 2025 -
Old python version causes silent error in cluster setup
#4279 closed
Jul 21, 2025 -
New DWS Flex breaks terraform state
#4079 closed
Jul 21, 2025 -
custom NodeName config
#4341 closed
Jul 19, 2025
9 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
Added capacity checks for reservations
#4372 commented on
Jul 25, 2025 • 4 new comments -
removed all parallelstore modules
#4351 commented on
Jul 24, 2025 • 3 new comments -
Provider produced inconsistent final plan when updating a cluster nodesets
#4272 commented on
Jul 22, 2025 • 0 new comments -
Missing step in uninstalling legacy monitoring agent README
#4349 commented on
Jul 23, 2025 • 0 new comments -
recreate cloud.conf
#4329 commented on
Jul 23, 2025 • 0 new comments -
chore/allow hyphens in partition_name and slurm_cluster_name, increase max length to 20 for slurm_cluster_name
#4316 commented on
Jul 23, 2025 • 0 new comments -
Bump django-extensions from 3.2.3 to 4.1 in /community/front-end/ofe
#4343 commented on
Jul 18, 2025 • 0 new comments -
Dev hybrid
#4385 commented on
Jul 18, 2025 • 0 new comments -
Feat/SlurmHighAvailability
#4386 commented on
Jul 23, 2025 • 0 new comments