Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Inventory counts and other details related to CCB's HPC clusters.


List does not include Linux file servers under ChemIT management. Add later?

...

Columns needed - software installed / managed (do per cluster), Nodes/Cores, Age, storage (head/compute) and related (RAID: h/w or s/w?)

Cluster name

Number of node
(HN and CN's)

ChemIT
/ other
support

DNS name
( may have CNAME )

IP
Gateway

ChemIT NetworkHeadnode IPMI Network

Headnode
OS

OS Version

Provisioning
software

Provisioning
software version

Scheduler
software

Scheduler
version

Internal
network

UPS for
headnode

Maintenance
window

Upgrade status

Abruna

10

ChemIT

hartree.chem.cornell.edu

10.253.229.249

192.168.255.30none

Fedora

11

Perceus

 

Torque/Maui

torque-2.5.2 / maui-3.3

 

NONE
Unique: Need to do ASAP (no backup of OS!)

 

4/14: Within a year, upgrade OS?
No h/w upgrades planned

Ananth

41

CAC

astra.cac.cornell.edu

128.84.3.66

N/AN/A

CentOS

6.2

Rocks

 

 

 

 

 

 

n/a

Collum

9

ChemIT

tera.collum.chem.cornell.edu

10.253.229.248

nonenone

CentOS

6.4

Warewulf

3.4

Torque/Maui

torque-2.5.13 / maui-3.3.1

 

Done Spring'14

 

Fall'13: Upgraded OS and added 2 nodes

Hoffmann

20

ChemIT

sol.hoffmann.chem.cornell.edu

10.253.229.89

192.168.255.100none

CentOS

6.4

Warewulf

3.4

Torque/Maui

torque-2.5.13 / maui-3.3.1

 

Done Spring'14

 

Winter'13/14: Upgraded OS and added 2 nodes

Lancaster (w/ Crane)

11

ChemIT

revc.lancaster.chem.cornell.edu

128.253.229.213

nonenone

CentOS

6.5

Warewulf

3.4

Torque/Maui

torque-2.5.13 / maui-3.3.1

 

Done Spring'14
(Funded by Crane)

 

Spring'14: Upgraded OS and added 2 nodes

Loring

5

ChemIT

rutabaga.chem.cornell.edu

10.253.229.156

192.168.255.20none

Fedora

9

Perceus

 

 

 

 

NONE
Unique: Need to do ASAP (no backup of OS!)

 

4/14: When do OS upgrades, and why?
No hardware upgrades planned

Scheraga

95

ChemIT

scheraga.chem.cornell.edu

128.253.229.65

192.168.255.1192.168.255.5

Fedora

13

Perceus

 

Torque/Maui

torque-2.5.9 / maui-3.3.1

 

Done Fall'14
(See chart for stand-alone computational (GPU) computers)

 

Summer'14: $50K hardware upgrades; to include OS upgrade.

Widom

4 8

ChemIT

Connected to ChemIT ( C4 ) for now

 

  

CentOS  

6.5  

 

 

 

 

 

Yes
Waiting for head node to deploy (on C4 at the moment)

 

Spring'14: Upgraded OS and added 2 nodes

ChemIT (C4)

1

ChemIT

cluster.chem.cornell.edu

10.253.229.9

192.168.255.110 

CentOS

6.4 from 6.2

Warewulf

 

Torque/Maui

torque-2.5.12 / maui-3.3.1

 

Done
Old UPS; on the margin

 

4/14: When turn into production, and for whom?

Totals:

 

ChemIT

 

 

  

 

 

 

 

 

 

 

 

 

 

CCB non-cluster HPCs, summary information

...

Name of system, and purpose

ChemIT
/ other
support

DNS name
(may have CNAME)

IP
Gateway

ChemIT NetworkHeadnode IPMI Network

OS

OS Version

UPS

Maintenance
window

Upgrade status

 

Baird: 1 rack-mounted computational computer

ChemIT

new - pending

 

as-chm-bair-08.ad.cornell.edu

compute.baird.chem.cornell.edu

10.253.229.178

192.168.255.120192.168.255.121

Windows Server Windows

2012R2

NONE
Need? (Not backed up, but OS, config, and apps not too unique  ( Suggested but not done )

 

 

 

Freed: Eldor

ChemIT

eldor.acert.chem.cornell.edu

10.253.229.96

192.168.255.87 

CentOS

6.4

NONE
Unique: Need to do ASAP? (OS is backed up)

 

 

 

Petersen: 2 rack-mounted computational computers

ChemIT

calc01.petersen.chem.cornell.edu
calc02.petersen.chem.cornell.edu

10.253.229.196/192

  

Windows Server

2012R2

Yes, but needs to be deployed in true production; using Widom's UPS for now.
ChemIT using UPS for testing UPS-related control software.

 

 

 

Scheraga: 4 GPU rack-mounted computational computers

ChemIT

 

 

  

Linux: Which distro?

 

NONE
($900, estimate)
Need to protect? Data point: Feb'14 outage resulted in one of these not booting up correctly.