You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

Ideas: Expand computational capabilities (power and/ or number of computers), increase efficiency of code, streamline workflows, etc. 

Purpose of page

Write down concerns, ideas, and efforts as understood by Chemistry IT so research group members can review and correct.

Request from group: Buy a server to process multiple Matlab calculation simultaneously

Outstanding questions:

  • Windows or Linux? Past testing by researchers have yielded faster processing on servers running Linux than Windows. Some researchers only comfortable working within Windows.
  • If Windows, how will user accounts and access work? How many simultaneously? Per user or shared accounts? How manage contention, if desired?
  • Coordinate monthly (or every 3 months) shutdown periods to ensure baseline patching and OS, file-share and hardware checking.

Server spec suggestions and costs

CriteriaCurrent deskop's specs~4 * desktop specs is minumum proposed serverCost increase if go up about a levelBorrowed Dell's specsNotes

Cores,

hyper-threading (HT)

4 cores

No HT

i5-6500, 3.20 GHz

16 cores (Two, 8 cores each)

HT-capable

Each: Xeon Silver 4110

$xx: 16 cores => 20 cores

$xx: 16 cores = 24 cores

16 cores (Two, 8 cores each)

HT-capable

Each: Xeon E5-2620v4

Q: Testing performance difference between using HT and not using HT?
Storage, all SSD500 GB2 TB

$800: 2 TB => 3TB

$1,700: 2TB = 3.8 TB

SSD (size n/a, at 400GB)

If Windows, buy $xx software on server (free clients) to enable moving large amounts of data to server more speedily.

Large amounts of data not needed to be stored on server, nor moved from server (simply deleted after processing).

RAM32 GB128 GB$xx: 128 GB => 256 GB?32 GB 
Other 

Need: UPS ($xxx)

Option: Redundant power supply unit $xxx)

n/a  
Approx. cost~$900?~$xxxn/a$xxx 

Status

10/5/17: Oliver met with Mahdi <mh2356> and Kushal <ks2285>. Action steps to have Peng review and refine:

(1) Group: Decide if worth having (select?) Matlab code reviewed by experts at CAC, focused primarily to increase efficiency. Secondary outcomes include:

  • May result in the ability to run older code on current version of Matlab, expanding where code could run on non-group computers.
  • May result in clarifying computational bottlenecks so the best fitted computational hardware is purchased. What does one prioritize when faced with choice to invest in: better processors, number of processors, number of cores per processors, bus speeds, SSD drives, and/ or RAM?
  • May result in a confirmation whether or not problem lends itself to parallelization. If so, can increase efficiency with the right hardware and expands the locations to efficiently run the code (RedCloud, etc.).

(2) Oliver: Have group test their code on test server in 248, initially by-passing using the network to get the data to the server.

  • Time comparisons of both single runs and simultaneous runs. Does the server reduce computational time for a single job, as compared to current workstations? To what degree does the server's performance drop as more jobs are added? Again, compare to current workstations.

(3) Oliver: Optimize getting data to test server in 248 via the network.

Other thoughs from Oliver:

  • Confirm if any campus computing is a good fit for the group: CISER, RedCloud (likely only if code can be and is parallelized), David Botsh's cluster(?), others?
  • Group may benefit from optimizing workflow at various workstations.

Prior conversations, for historical background

  • No labels