[SOLVED] TSD instability 2024-11-08

Storage still have problems, and had a few events with NFS hangs in the afternoon:

14:45 We're notified of NFS-hangs, and notice one of two protocol nodes has a downed NFS service. Commence to restart it.

14:59 Node is back up again, mounts work.

15:36 Discovers the other protocol node have problems with write on its NFS exports. Proceed to restart this one as well.

15:46 Node 2 back up, production back to normal.

15:57 NFS went down again on node 1. Restarting services

16:07 Node back up, production back.

16:15 Again, discover that the other protocol node hangs NFS-hangs on write, and restart.

16:20 Both nodes are operational, and neither have hangs on write anymore.

 

We're still conversing with 3rd party vendor in a high severity case regarding the issues we've had with storage these last weeks.

Published Nov. 8, 2024 3:51 PM - Last modified Nov. 8, 2024 4:29 PM