[SOLVED] TSD instability 2024-11-08
Storage still have problems, and had a few events with NFS hangs in the afternoon:
14:45 We're notified of NFS-hangs, and notice one of two protocol nodes has a downed NFS service. Commence to restart it.
14:59 Node is back up again, mounts work.
15:36 Discovers the other protocol node have problems with write on its NFS exports. Proceed to restart this one as well.
15:46 Node 2 back up, production back to normal.
15:57 NFS went down again on node 1. Restarting services
16:07 Node back up, production back.
16:15 Again, discover that the other protocol node hangs NFS-hangs on write, and restart.
16:20 Both nodes are operational, and neither have hangs on write anymore.
We're still conversing with 3rd party vendor in a high severity case regarding the issues we've had with storage these last weeks.