Slurm is a job scheduler designed for AI, machine learning, and HPC workloads.
It supports not only small to large-scale HPC cluster systems but also today’s GPU-based cluster environments.
In particular, Slurm has expanded its feature set for NVIDIA GPUs and has become the standard job scheduler for modern GPU clusters used in AI and machine learning.
It optimizes workloads for high-performance systems, ensuring maximum performance and efficient resource utilization.
Slurm provides the tools and features needed to effectively manage systems.
From job submission and monitoring to resource allocation and accounting, Slurm offers a complete set of capabilities for comprehensive workload management.
With advanced scheduling algorithms, Slurm efficiently allocates resources to maximize utilization.
It balances workloads across the entire cluster system, minimizing idle time and maximizing throughput.
Slurm includes powerful monitoring and reporting capabilities.
It provides real-time insights into performance and utilization, allowing users to review detailed metrics and further optimize workloads.
Optional plugins enable features such as accounting, resource limits, SLA/QoS management, advanced reservations, and backfill scheduling.
Consulting services for plugin configuration and customization are available upon request.
In environments where a large number of jobs are submitted and both large and small jobs coexist, large jobs may fail to start because their required resources cannot be secured.
Slurm addresses these issues through a variety of features, enabling efficient job scheduling that maximizes resource utilization.
By using Slurm, users can benefit from shorter computation times, fewer user errors, and reduced operational costs.
Our configuration support services can help you optimize your setup for maximum efficiency.
Pacific Teck, in partnership with SchedMD, provides commercial support for the Slurm Workload Manager.
Although Slurm is an open-source solution, over 95% of its source code is written by SchedMD, allowing them to deliver essential commercial support for secure and continuous system operation — including the provision of patch files in the event of system issues and responses to CVEs (security vulnerabilities).
Pacific Teck serves as the exclusive point of contact in Japan for Slurm commercial support.
When technical issues arise that require fixes to Slurm itself, Pacific Teck escalates them directly to SchedMD, the core developer of Slurm, to ensure proper resolution.
f the job scheduler the core of your system were to stop functioning, it could result in costly downtime and significant operational loss. With Slurm Commercial Support, you can rest assured knowing that professional assistance from Slurm experts is available even in critical situations.
SchedMD is a U.S. based company that serves as the primary developer of Slurm, having written over 95% of its source code.
In addition to Slurm development, SchedMD provides technical support in the event of system issues, as well as training for users, administrators, and system integrators.
Pacific Teck serves as the exclusive point of contact in Japan for Slurm commercial support, handling technical issues related to Slurm by escalating them to SchedMD, the core developer of Slurm, for resolution.
Slurm Commercial Support provides assistance according to the severity of the issue experienced by the customer.
A system failure is ongoing and affects a large number of end users.
The system is down or unusable due to a Slurm-related issue, and no workaround is available.
A major issue is causing intermittent outages or serious system malfunctions.
The problem consistently affects end users and significantly disrupts normal system usage.
An issue is causing partial or non-critical loss of system access, or certain operations within the system are affected.
Although the problem has a moderate to significant impact, end users can continue to use the system through available workarounds.
A minor issue has occurred with little to no loss of functionality in the customer’s environment.
(This level also applies to configuration questions or cases involving test or non-production systems.
Severity 4 issues may also be used to submit requests for future product improvements or enhancements.)
Pacific Teck Japan
Monday to Friday, 9:00 AM – 5:00 PM (JST)
(Excluding Japanese national holidays and the year-end/New Year holidays)
SchedMD (US and Europe)
Monday to Friday, 2:00 AM – 5:00 PM MST (Equivalent to 5:00 PM – 9:00 AM Japan time)
Excluding U.S. national holidays and the year end/New Year holidays.
Because business hours in Europe and the U.S. begin as Japan’s working hours end,
support coverage is available nearly 24 hours a day, except from Saturday 9:00 AM to Monday 9:00 AM (JST), and excluding holidays and the year-end/New Year period.
*Note: Daylight Saving Time (DST) is observed in the U.S. and Europe.
During DST (beginning in March), support hours shift one hour later, and after DST ends in November, they return one hour earlier. Please contact us if you would like details on DST schedules for a given year.
Training sessions are conducted by SchedMD engineers, either on-site or online. Content can be customized upon request to suit the needs of end users or system administrators.
To help you make the most of Slurm, our experts provide configuration assistance and guidance on system setup tailored to your specific goals.
Our Slurm experts can assist with requests such as adding new features or extending functionality in Slurm.
Support plans can be customized to meet your specific requirements. Please feel free to contact us to discuss your system configuration or any other requests.