Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Excerpt

Eldore Eldor must find a physical new home by Feb. 1st. Priority is Liang's project, which in the short term is not likely to required EldoreEldor.

Project lead: Oliver <oh10>

Team: Zhichun Liang, Peter Borbat, and Lulu Zhu.

Goal

Run humongous jobs, and run them hundreds of times. Expect that having own equipment is most cost-effective, but reality-check against other options such as CAC's RedCloud service (not mutually exclusive).

Strategy: 2-part

Get Liang set-up with CAC's services (RedCloud?) so he can create the software and test it. And pay per drink at that smaller scale.

...

Identify a sustainable home for Eldore, longer-term

Resources

http://www.cac.cornell.edu/services/projects.aspx

http://www.cac.cornell.edu/Services/rates/

...

Oliver's meeting notes, 11/28/12's mtg

Barry: Need to reinstall OS (since depends on AFS and CCMR's infrastructure).

...

Oliver's understanding from discussion: Everyone else doesn't need the horsepower of an modern Eldore-class system. They can use CAC, with 32-bit (Skeeve-compatible) or 64-bit.

Decision at meeting:

Consider using CAC if it's a technical "fit", and pay per drink. Get estimate before committing. In the short-term, do this since need VERY high memory. BUT, note that doing this in productions likely not cost-effective compared to investing in own hardware.

...

Past email threads, pre-meeting

Barry, 11/20/12, 2:34 PM

Eldor will need to be reinstalled. Our installation is tied to AFS and our infrastructure. Going to windows or a standalone Linux system seems like a good idea.

Peter, 11/20/12, 12:51 PM

It may be better to set up user accounts on or to lease CAC v4-64g node (64 GB Read Hat Linux). Lease option may be practicable if the node is used intensively. CAC operates in cost recovery mode, we just need to estimate fees.

...

Alternatively, I can use it for 3D-EM simulations, for example, since its GPUs are not used by any NLLS code. We surely can find a good use to it.

Oliver, 11/20/12, 10:23 AM

(1) Hosting the Eldore server. (Jack will be speaking with Nandini about CAC's services)