Excerpt |
---|
Too many power outages in Baker Lab! |
Date | Outage duration | Cause | Official link | ChemIT notes |
---|---|---|---|---|
|
|
|
|
|
1/27/2014 | 17-19 minutes | ? | http://www.it.cornell.edu/services/alert.cfm?id=3040![]() | Lulu, Michael, and Oliver shut down headnodes and other systems which were on UPS. (Those systems non UPS shut down hard, per usual.) |
12/24/2013 | Seconds to minutes? | Human error? |
| Terrible timing, right before the longest staff holiday of the year. |
Question: When power is initially restored, do you trust it? Or might it simply kick back off in some circumstances?
- Because we don't know the answer to this question following any specific power outage, we are reluctant to turn back on servers right away. Instead, we like to wait ~10-25 minutes.
Reboot switches
- 8 switches support the clusters (out of 12 in the room).
...