...
Date | Outage duration | Cause | Official link | ChemIT notes |
---|---|---|---|---|
2/27/2014 | 2 minutes | Procedural error? | No link to info in 3/3/14 email? | Michael led our effort to initially evaluate and restore systems, with Oliver adding to documentation and to-do's. Lulu completed the cluster restoration efforts. |
1/27/2014 | 17-19 minutes | ? | Lulu, Michael, and Oliver shut down headnodes head nodes and other systems which were on UPS. (Those systems non UPS shut down hard, per usual.) | |
12/23/2013 | 2 minutes | Procedural error? | Terrible timing, right before the longest staff holiday of the year. | |
7/17/13 | Half a morning (~2 hours) |
|
|
...
Assuming protection for 1-3 minutes MAXIMUM:
Do all headnodes head nodes and stand-alone computers in 248 Baker Lab
- Started getting done. About $170/ headnode every ~4 years.
CCB headnodes head nodes' UPS status:
Cluster | Done | Not done | Notes |
---|---|---|---|
Collum | X |
| |
Lancaster, with Crane (new) | X |
| Funded by Crane. |
Hoffmann | X |
|
|
Scheraga | X |
| See below chart for s4 tand-alone computational computers |
Loring |
| X | Unique: Need to do ASAP |
Abruna |
| X | Unique: Need to do ASAP |
C4 Headnode: pilot | X |
| Provisioned on the margin, since still a pilot. |
Widom |
| X | See "C4", above |
...
Cluster | Compute node count | Power strip equivalents | Cost estimate, | Notes |
---|---|---|---|---|
Collum | 8 ? | 1 | $900 | |
Lancaster, with Crane (new) | 10 12? | 2 | $1.8K | |
Hoffmann | 19 14? | 2 | $1.8K |
|
Scheraga | 91 92? | 13 | $11.7K | |
Loring | 4 6? | 1 | $900 | |
Abruna | 6? 9 | 1 | $900 | |
C4 Headnodehead node: pilot | N/A | N/A | N/A | This CCB Community headnode head node pilot has no compute nodes of its own. |
Widom | 2 | 1 | ? | Compute nodes are hanging off of "C4" head node, above. |
TOTALS | ~140? | 21 | $18K + Widom |
|
...
- 8 switches support the clusters (out of 12 in the room).
Start headnodeshead nodes, if not on already
- Only a few are on UPS. Those can obviously be left on.
- None should be set to auto-start on power-off.
Confirm headnodes head nodes accessible via SSH
PuTTY on Windows
...