I have considered doing weekly reboots. It would cause unwanted downtime, but on the other hand it is far cleaner and safer to do "shutdown -r now" than "reboot -nf" :) I still might do this -- or both.
Not sure what you mean about using Knoppix. I have no hands on the box. It's 2000 miles away and I don't have out of band management.
I wouldn't say that the OS is failing... In this state, the kernel is fine and processes are still responding (so long as they're not accessing the SSD). Since the daemon is running in memory, then it should be fine. It's the potential EIO failures that I want to detect that are the primary issue and if I can trigger the reboot -nf without any disk I/O then I think it will be an acceptable band-aid until the situation can be resolved permanently.
Thanks!In reply to Re^2: I/O Watchdog Daemon
by IdleResonance
in thread I/O Watchdog Daemon
by IdleResonance
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |