Norwegian version of this page

TSD Operational Log - Page 2

Published Jan. 29, 2024 12:50 PM

Slurm has been restarted on several compute nodes to resolve an issue. Please check the output of your jobs to see if they've been affected.

Published Jan. 26, 2024 10:10 AM

We're currently experiencing issues with some nodes on Colossus. Jobs on these nodes might have crashed and been requeued. Please check the output of your jobs to see if they've been affected.

Published Jan. 26, 2024 8:59 AM

Some users are currently facing issues logging in to the Data Portal to export/import data. The specific error message they encounter is "An unexpected error has occurred which may affect the proper functioning of the application." If you also experience this error while attempting to log in to the Data Portal, please notify us by emailing tsd-drift@usit.uio.no.

Published Jan. 11, 2024 12:10 PM

TSD will be upgrading the storage system, which may cause some instability on the Windows and Linux vms.

Published Jan. 10, 2024 1:35 PM

We've updated our password policy. This change is part of our commitment to enhancing security protocols and safeguarding sensitive information, taking effect on January 8th, 2024.

All TSD users are now required to update their passwords at least once every year. This practice is essential to maintain a high level of security. You may change your password at any time by logging into TSD's Selfservice Portal: https://selfservice.tsd.usit.no/profile/change-password

You will receive an email notification 30 days before your password expiration date, providing sufficient time for a timely update.

Users with over due password changes will be contacted, with the first group of users contacted December 11th, 2023 and requiring a mandatory password change to be completed by January 8th, 2024.

Accounts that have not complied with the password update requirement by the deadline will be temporarily suspended. Access will be restored upon u...

Published Jan. 8, 2024 12:30 PM

ID Porten has logging problem, please follow:

https://status.digdir.no/incidents/ctml93xm9lnh

It impacts both TSD and Nettskjema logins

Published Dec. 22, 2023 9:50 AM

We are currently experiencing some issues with file import through the Data Portal and are looking into the cause of the problem.

Published Dec. 15, 2023 7:58 AM

This affected TSD systems that relied on NFS.

[Update 08:48]

The core problem is resolved and most systems are up again. We are still investigating the reason for the problems, and some system may still have instability.

[Update 11:00]

All systems should work as normal.

Published Nov. 28, 2023 10:11 AM

TSD will be upgrading software on the storage system Thursday, 2023-11-30 from 08:00 CET. We expect storage instability on the Windows and Linux vms throughout the day.

Around 10:00 the storage system will be shut down for an estimated 15min, which means network storage is inaccessible on all TSD hosts (Windows and Linux) as well as on our central services (file import/export, etc). To be on the safe side, please close any programs and log off from your vm prior to the downtime.

A maintenance reservation has been set on Colossus from 08:00. This means any jobs that cannot complete before the downtime will remain pending until after the maintenance completes. They'll resume automatically.

Our automation should fix any file system hangs that may occur, and we will be on standby to fix any remaining issues that do not automatically recover.

Apologies for the short notice, we've been in dialogue with IBM to alleviate storage instabi...

Published Nov. 22, 2023 3:00 PM

TSD will be upgrading software on the storage system tomorrow, 2023-11-17 08:00 - 09:00 CET. Our automation should fix any file system hangs that may occur, and we will be on standby to fix any remaining issues that do not automatically recover. Apologies for the short notice, we've been in dialogue with IBM to alleviate storage instability and want to act on their latest recommendations as fast as possible.

Published Nov. 16, 2023 1:33 PM

IBM will be upgrading software on the storage system tomorrow, 2023-11-17 07:00 - 09:00 CET. This upgrade is being done on short notice to remove bugs that have caused instability. We are taking the opportunity to improve stability as soon as we can, apologies for any inconvenience. Our automation should fix any file system hangs that may occur, and we will be on standby to fix any remaining issues that do not automatically recover.

Published Nov. 9, 2023 9:41 AM

Some users are reporting login issues and problem setting one-time codes. We are working to debug and fix the issue.

Published Nov. 8, 2023 3:39 PM

TSD Internal Publication goes for short maintenance. 

Published Oct. 17, 2023 2:18 PM

Due to an ESS upgrade at 13:30 we're experiencing some storage instability. This affects the internal mirrors (CRAN) too. Some vms will be rebooted in the process.

Published Oct. 15, 2023 8:39 AM

TSD services might be unstable for the moment - we are working to fix it.

Published Oct. 6, 2023 1:18 PM

Dear TSD-users,

At 07:00 the upcoming Tuesday we will be doing upgrades of the databases of our core services, and the databases in the following projects:

p11
p14
p23
p47
p57
p58
p96
p110
p166
p174
p189
p206
p302
p588
p594
p827
p874
p969
p1075
p1859
p2184


If all goes according to plan should be done around 11:00 at the latest.
During this time our services will be partially or fully unavailable.


--
The TSD team

Published Sep. 21, 2023 12:31 PM

The backend of several of our services is down.
This will affect file import and export, publication portal, nettskjema delivery and more.

-- 
TSD

Published Sep. 4, 2023 10:23 AM

We're experience instability with the TSD, affecting the timeliness of Nettskjema attachment delivery, and file import and export. We're working on solving it.

Published Aug. 8, 2023 8:13 AM

TSD Self Service is currently unreachable.

Published July 28, 2023 2:18 PM

The bigmem nodes on Colossus has been reserved for a single project until the end of September, which means that no other projects can use the bigmem partition until then. This has been done on the request of Sigma2, which owns the bigmem nodes.

Published July 7, 2023 4:17 PM

Starting at 09:30  on 2023.07.10 we will be upgrading the databases for our core services.

Due to this our services will at times be partially or fully unavailable at times during this upgrade.

We will update this message as we go along, and notify you when it's done.

 

-- 

On behalf of TSD

Published July 6, 2023 8:16 AM

We are currently experiencing instability in access to storage for multiple projects, affecting all services.

We are still investigating the problems.

-----

Update 09:05:

We have remounted the storage for the affected machines, and they seem to work now.

The instability affected around 30 projects from around 4am this morning.

-----

Update 10:00:

There are still reports of instability, and we will investigate further.

-----

Update 2023-07-07:

The reason for the instability was found and addressed yesterday. All systems should have worked normally since about 11am yesterday.

Published June 29, 2023 10:50 AM

Maintenance is being performed on our storage systems. We expect minimal issues. Some linux hosts may need to be rebooted. 

Published June 8, 2023 10:50 AM

Any paths under /cluster (e.g. software and projects) are unavailable. This affects software modules and project areas on Linux submit hosts (and other hosts with a /cluster directory). The cluster directory can still be reached via /tsd/pxx/cluster instead.

Published June 6, 2023 1:31 PM

SCCM group will be upgrading internal SCCM-site database in TSD on Thursday 2023-06-08 Software Center on all Windows VMs in TSD will be unavailable between 12:00 and 16:00.