Create and manage containerized Slurm clusters for development, testing, and CI/CD workflows.
🚀 Getting Started 📖 Documentation
Inspired by kind (Kubernetes in Docker), sind offers a familiar CLI experience for quickly spinning up and tearing down Slurm clusters.
- Multi-node, multi-cluster & multi-realm — run controller, submitter, and worker nodes side by side, or spin up multiple clusters across isolated realms with shared networking
- System containers — full systemd-based nodes that emulate bare metal, compatible with Ansible, Chef, and other config management tools
- Designed for CI/CD — runs rootless on standard GitHub Actions runners; sind-action sets up clusters in a single step
- Worker lifecycle — dynamically add and remove worker nodes from running clusters
- Power cycle simulation — shutdown, reboot, freeze, and power-cycle nodes to simulate real-world failure scenarios
- Minimal dependencies — just Docker and a sind container image; usable as both a CLI tool and a Go library
- AI-ready via MCP — built-in MCP server lets AI assistants manage your Slurm clusters
Parts of this codebase were developed with the assistance of AI tools. All contributions are reviewed by humans.
sind is licensed under the GNU Lesser General Public License v3.0.
Copyright © GSI Helmholtzzentrum für Schwerionenforschung GmbH