Umbrella Maintenance 2025 Q3

25

Aug

-

27

Aug

TU/e Umbrella HPC Cluster has a scheduled downtime for maintenance from Monday 25 August 09:00 CET to Wednesday 27 August 17:00 CET. The cluster will be unavailable during this time. Please make sure that your jobs are finished before the start of the maintenance or that they can continue after they were (hard) killed/cancelled.

All running Jobs on Monday 25 August 2025 09:00 will be cancelled!

There are no backups on the HPC cluster — do not use it for archiving. You are responsible for your own data management!

Important Changes After Maintenance

Required Job Script Updates

Memory Requirement

After this maintenance, all jobs must specify how much memory they require. If you do not specify the memory, your job will receive 1 GB RAM per CPU core by default.

Update your SLURM job scripts to include one of the following:

#SBATCH --mem=10G
#SBATCH --mem-per-cpu=2G
#SBATCH --mem-per-gpu=2G  # This is CPU RAM, not VRAM!

ℹ️ This change helps prevent jobs from being killed unexpectedly when another job uses too much memory.

Maximum Run Time

After maintenance, jobs without a specified maximum run time will be automatically limited to 1 hour. Add this line to your job script if you need more time:

#SBATCH --time=<hh:mm:ss>

ℹ️ This change avoids “runaway” jobs and makes scheduling more efficient.

What happens if I don’t update my script?

Jobs without a memory request may fail or be killed if they use more than the default amount of memory.
Jobs without a run time will automatically stop after 1 hour.

🎉 New Feature: Single Sign-On

You will be able to log in to hpc.tue.nl (Open OnDemand) using TU/e Single Sign-On. No more separate username/password needed for the web interface.

ℹ️ SSH access remains unchanged: keep using your TU/e username & password for SSH.

Technical & Security Updates

Latest updates and patches to Rocky Linux 8 will be installed.
Security fixes and firmware upgrades will be applied across all nodes, improving reliability and safety.

What You Need To Do

Finish your jobs before the maintenance to avoid interruptions.
Update your job scripts to specify both required memory and run time.
Do not use the cluster during the maintenance window.
After maintenance, check that your applications and code run as expected.

Questions After Maintenance

If you encounter any issues after the maintenance window, with which you would like assistance, please let us know. We can be reached by e-mail and through Teams.

Immediately after the maintenance we'll also be available in person.

Dates: Thu 28 August – Fri 5 September
Times: 10.00–12.00, 13.00–15.00
Location: EAISI office (Neuron building, room 1.105)
Find the "Eindhoven Supercomputing Center" banner on the first floor!

February 16, 2026

Umbrella
maintenance

Umbrella Maintenance 2026 Q1

16

Feb

-

18

Feb

The TU/e Umbrella HPC Cluster will be undergoing scheduled maintenance, from: Monday 16 February 2026, 09:00 CET to Wednesday 18 February 2026, 17:00 CET.

The entire cluster will be offline during this period. Please make sure your jobs finish before the maintenance starts, or that they can safely be interrupted and rerun.

All running jobs on Monday 16 February 2026 09:00 will be cancelled/killed!

Continue reading
03

Mar

-

05

Mar

Mar 03, 2025
Umbrella
maintenance

Umbrella Maintenance 2025 Q1

26

Aug

-

28

Aug

Aug 26, 2024
Umbrella
maintenance

Umbrella Maintenance 2024 Q3

12

Feb

-

14

Feb

Feb 12, 2024
Umbrella
maintenance

Umbrella Maintenance 2024 Q1

07

Aug

-

09

Aug

Aug 07, 2023
Umbrella
maintenance

Umbrella Maintenance 2023 Q3

Umbrella Maintenance 2025 Q3

Important Changes After Maintenance

Required Job Script Updates

Memory Requirement

Maximum Run Time

What happens if I don’t update my script?

🎉 New Feature: Single Sign-On

Technical & Security Updates

What You Need To Do

Questions After Maintenance

RELATED ITEMS