Cluster built on Widom's headnode. 1 headnode and xx compute nodes.
See also
Difference from other cluster
Duo has been installed but not enabled now.
- Chemistry IT staff only: (/etc/ssh/sshd_config keeps original, we need modify this to make duo work, see duo documents)
Password lockout is enabled (Users can be locked out of their account if 20 of incorrect passwords are entered, The account will be unlocked after 600 seconds)
Iptables is enabled.
Node information
One can look up the ages of the processors, along with more technical information at at Wikipedia, which itself links to Intel's information for each processor:
Node | Motherboard version | Processor | Cores | Hyperthreading on | Memory | Hard Drive |
---|---|---|---|---|---|---|
headnode | Supermicro X8DTT 2.1c | Dual E5645 | 12 | N | 24GB | SW Raid 1 - (2) 2TB, no backintimelsh |
bw001 | Supermicro X8DTT 2.1c | Dual E5645 | 12 | N | 48GB | SW Raid 0 - 2*6TB |
bw002 | Supermicro X9DRT-F 3.2 | Dual E5-2620v2 | 12 | N | 64GB | SW Raid 0 - 2*6TB |
bw003 | Supermicro X9DRT-F 3.2 | Dual E5-2620v2 | 12 | N | 64GB | SW Raid 0 - 2*6TB |
bw004 | Supermicro X9DRT-F 3.2 | Dual E5-2620v2 | 12 | N | 64GB | SW Raid 0 - 2*6TB |
bw005 | Supermicro X9DRT-F 3.2 | Dual E5-2620v2 | 12 | N | 64GB | SW Raid 0 - 2*6TB |
bw006 | Supermicro X9DRT-F 3.2 | Dual E5-2620v2 | 12 | N | 64GB | SW Raid 0 - 2*6TB |
bw007 | Supermicro X9DRT-F 3.2 | Dual E5-2620v2 | 12 | N | 64GB | SW Raid 0 - 2*6TB |
rl001 | Asus DSBF-DE v1006 | Dual E5420 | 8 | N | 16GB | 1TB |
rl002 | Asus DSBF-DE v1006 | Dual E5420 | 8 | N | 16GB | 320GB |
rl003 | Asus DSBF-DE v1006 | Dual E5420 | 8 | N | 16GB | 320GB |
r004 | Asus DSBF-DE v1006 | Dual E5420 | 8 | N | 16GB | 320GB |
rlfl005 - former headnode | ? | ? | ? | ? | ? | ? |
Maintenance records
3/4/16 - noting that the DSBF-DE motherboard has a v1007 update, but not worth the trouble for the age of the machines. X9DRT-F has a 3.2 upgrade from 3.0, can do when nodes free. - meh26
3/10/16-3/11/16 - updated X9DRT-F motherboards from 3.0 to 3.2 during downtime due to headnode drive replacement resyncing. - meh26
3/16/16 - updated IPMI card firmware from 2.0 to 3.0 - meh26
8/2/2016: Lulu: No firmware update from Michael. There are no security updates available via YUM. One hard drive (sda) is failing (smartctl found 30 errors). Forced fsck, no errors found. Modified /etc/ssh/sshd_config to keep idle ssh connection alive.
11/1/2016: Lulu: No firmware update from Michael. There are no security updates available via YUM. One hard drive (sda) is failing. Backup the root partition by ddimage .
2/5/2017: Lulu: There was power outrage on 2/4 Saturday. No firmware update. Delete all running or queued jobs. No errors on hard drives. Forced fsck.