Skip to content

Conversation

@gqiu-sycomp-com
Copy link
Contributor

Provide 4 blueprint examples to create/expand Sycomp Scale Storage and access Sycomp Scale Storage by NFS.

Submission Checklist

NOTE: Community submissions can take up to 2 weeks to be reviewed.

Please take the following actions before submitting this pull request.

  • Fork your PR branch from the Toolkit "develop" branch (not main)
  • Test all changes with pre-commit in a local branch #
  • Confirm that "make tests" passes all tests
  • Add or modify unit tests to cover code changes
  • Ensure that unit test coverage remains above 80%
  • Update all applicable documentation
  • Follow Cluster Toolkit Contribution guidelines #

@gqiu-sycomp-com gqiu-sycomp-com requested review from a team and samskillman as code owners October 28, 2025 20:09
@gemini-code-assist
Copy link
Contributor

Summary of Changes

Hello @gqiu-sycomp-com, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the Google Cluster Toolkit by introducing comprehensive support for the Sycomp Intelligent Data Storage Platform. It provides users with a suite of new blueprints and updated documentation, enabling seamless deployment, expansion, and integration of Sycomp Scale Storage clusters, including direct NFS access for SLURM environments, all within Google Cloud Platform. This addition aims to streamline the setup of high-performance storage solutions for HPC workloads.

Highlights

  • New Sycomp Storage Blueprints: Four new example blueprints have been added to support the deployment and expansion of Sycomp Intelligent Data Storage Platform clusters on Google Cloud Platform. These include blueprints for a standard 3-node cluster, an ECE (Erasure Code Edition) 7-node cluster, an expansion blueprint, and a combined SLURM and Sycomp Storage cluster.
  • SLURM Integration with Sycomp Storage: A dedicated blueprint (sycomp-storage-slurm.yaml) is now available to deploy a SLURM cluster alongside a Sycomp Storage cluster, configuring SLURM compute nodes as NFS clients to access the Sycomp Storage filesystem.
  • Updated Network Storage Documentation: The docs/network_storage.md file has been updated to include Sycomp Intelligent Data Storage Platform as a supported network storage option within the Google Cluster Toolkit, providing a link to its new README.
  • Sycomp Scale Module Documentation: New documentation for the sycomp-scale module has been added, detailing its purpose and referencing the new example blueprints for deploying Sycomp Storage on GCP.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for Sycomp Intelligent Data Storage Platform by adding four new example blueprints and associated documentation. The changes are well-structured and provide clear examples for deploying, expanding, and using Sycomp storage with a SLURM cluster. My review includes several suggestions to fix typos and improve formatting in the documentation and YAML comments for better clarity and correctness. I've also raised a concern about a documentation change that may refer to an unimplemented feature.

@cboneti cboneti self-requested a review October 29, 2025 04:22
@cboneti cboneti self-assigned this Oct 29, 2025
@cboneti
Copy link
Member

cboneti commented Oct 30, 2025

/gemini review

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for Sycomp storage by adding four example blueprints and associated documentation. The examples are well-structured, covering various deployment scenarios including initial setup, expansion, ECE configuration, and SLURM integration. My review focuses on enhancing the clarity and correctness of the documentation to improve the user experience. I have identified a typo and suggested improvements to make the instructions more clear and user-friendly.

Copy link
Contributor

@sarthakag sarthakag left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Left some comments related to better naming of deployments. PTAL.

@cboneti cboneti assigned gqiu-sycomp-com and unassigned cboneti Oct 30, 2025
@cboneti
Copy link
Member

cboneti commented Oct 30, 2025

please assign to me once comments are addressed. Thanks!

- Fix typo and grammer error
- Rename deployments name to something more relevant to sycomp
@gqiu-sycomp-com
Copy link
Contributor Author

/gemini review

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for Sycomp Intelligent Data Storage Platform by adding four example blueprints and associated documentation. The blueprints demonstrate how to deploy a new storage cluster, expand an existing one, use the Erasure Code Edition, and integrate with a SLURM cluster via NFS. The changes are well-structured and follow the project's conventions. My review includes one suggestion to improve the clarity of the documentation for a better user experience.

- Remove hardcode in the command
@gqiu-sycomp-com
Copy link
Contributor Author

/gemini review

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for Sycomp Intelligent Data Storage Platform by adding four new blueprint examples and associated documentation. The changes are well-structured and provide clear examples for users. My review includes a few suggestions to improve correctness and clarity. Specifically, I've identified a file extension inconsistency in the documentation, two blueprints where a required name_prefix is missing which would likely cause deployment failure, and one blueprint with a hardcoded value that could be improved for better usability. These are straightforward fixes that will enhance the quality of the examples.

- Fix typo of file name
- Add instructions for name_prefix
- Set the name_prefix for slurm and ece example
@gqiu-sycomp-com
Copy link
Contributor Author

/gemini review

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for Sycomp Intelligent Data Storage Platform by adding four new example blueprints. These blueprints demonstrate how to deploy a new Sycomp storage cluster, expand an existing one, use the Erasure Code Edition (ECE), and integrate it with a SLURM cluster. The implementation correctly sources the necessary Terraform modules directly from Sycomp's GitLab repository. The accompanying documentation is clear and helpful, though I've suggested a minor correction to a command example to prevent user confusion.

- Fix filename error in example command
@gqiu-sycomp-com
Copy link
Contributor Author

/gemini review

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for the Sycomp Intelligent Data Storage Platform by adding four example blueprints and accompanying documentation. The blueprints cover deploying a standard storage cluster, an Erasure Code Edition (ECE) cluster, expanding an existing cluster, and integrating with a SLURM cluster using NFS. The examples are well-structured and follow existing patterns in the toolkit. The review identified two issues in the documentation: a minor discrepancy in the number of servers for the SLURM example, and a more significant issue where module parameter documentation is located in a private repository, hindering usability for the community. Once these documentation issues are addressed, this will be a valuable addition for Sycomp customers using the HPC Toolkit.

- Fix server number in README.md
@gqiu-sycomp-com
Copy link
Contributor Author

/gemini review

@gqiu-sycomp-com
Copy link
Contributor Author

/gemini review

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for Sycomp Intelligent Data Storage Platform by adding four example blueprints and associated documentation. The blueprints cover basic deployment, deployment with Erasure Code Edition (ECE), cluster expansion, and integration with a SLURM cluster via NFS. The changes are well-structured, and the documentation is clear. The use of an external, version-pinned git repository for the Terraform modules is noted. My review found one area for improvement in the documentation to enhance user experience when expanding a cluster.

@gqiu-sycomp-com gqiu-sycomp-com removed their assignment Oct 31, 2025
@gqiu-sycomp-com
Copy link
Contributor Author

Hello, everybody, Could you please review the code I updated for me again? thank you!

@cboneti cboneti self-assigned this Nov 3, 2025
@cboneti cboneti assigned gqiu-sycomp-com and unassigned cboneti Nov 3, 2025
cboneti
cboneti previously approved these changes Nov 3, 2025
@cboneti cboneti assigned cboneti and unassigned gqiu-sycomp-com Nov 3, 2025
@cboneti cboneti added release-key-new-features Added to release notes under the "Key New Features" heading. release-new-modules Added to release notes under the "New Modules" heading. labels Nov 3, 2025
@cboneti cboneti enabled auto-merge November 3, 2025 17:22
@mufaqam-gcl mufaqam-gcl self-requested a review November 3, 2025 17:36
mufaqam-gcl
mufaqam-gcl previously approved these changes Nov 3, 2025
@cboneti
Copy link
Member

cboneti commented Nov 3, 2025

/gcbrun

@cboneti cboneti disabled auto-merge November 4, 2025 13:40
- use "use" in sycomp-storage-slurm instead of defining "network_storage"
- Put the four blueprint examples related to sycomp into the exclude list of unittest
@gqiu-sycomp-com gqiu-sycomp-com dismissed stale reviews from mufaqam-gcl and cboneti via a000341 November 4, 2025 16:16
@cboneti
Copy link
Member

cboneti commented Nov 4, 2025

/gcbrun

@cboneti
Copy link
Member

cboneti commented Nov 4, 2025

/gemini review

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for Sycomp Intelligent Data Storage Platform by adding four example blueprints and associated documentation. The changes are well-structured and the documentation is comprehensive. My review includes two suggestions for improvement: one to enhance clarity in the README to prevent user error, and another to improve the security posture of the sycomp-storage-slurm.yaml example by changing a default setting related to NFS security.

@cboneti cboneti enabled auto-merge November 4, 2025 19:58
@cboneti cboneti merged commit c5c9299 into GoogleCloudPlatform:develop Nov 4, 2025
15 of 72 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-key-new-features Added to release notes under the "Key New Features" heading. release-new-modules Added to release notes under the "New Modules" heading.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants