Cerebras job scheduling and monitoring#
All jobs submitted to the Cerebras cluster are queued and assigned to resources in a first come first served basis.
You can monitor the Cerebras Wafer-Scale Cluster using these tools:
csctl: CLI tool for job monitoring: With this tool you can inspect submitted jobs, assign labels, get the mounted volumes and export logs.
Cluster monitoring with Grafana: Grafana dashboard shows job resource usage and possible software and hardware errors relevant to this job.
Integration with Slurm: Light-weight integration of Cerebras Wafer-Scale cluster with slurm
Resource requirements for parallel training and compilation: with limits on Memory and CPU usage inside the Cerebras Wafer-Scale cluster.