...
(1) Information related to testing
End-user reporting ticket number is INC...
- In any communication to ChemIT during testing, include this number in the subject line. Thank you.
- ChemIT will send email to your Cornell <NetID@cornell.edu> email address.
- Ensure you are getting that email where you normally read your email.
...
- Link to graphical representation comparing the old Matrix to the new Matrix, for researchers.
- Lin to text page containing full details on the new Matrix configurations. For both researchers and support staff.
- Matrix Partitioning Configuration 2014-09-15.pdf
- Includes conventions, intentions, and associated technically-imposed limitations.
Work to be done during new Matrix testing: Who is responsible for what?
Each researcher must confirm that the applications they depend on work for them on the new Matrix.
...
- Do not contact ChemIT for this, please.
Researchers in Poland should contact Czarek. Follow his instructions, please.
- To resolve your problem, Czarek may then need to then work with ChemIT staff.
All others should contact the group member assigned as lead tester for the application they are having problems with. Follow their instructions, please.
- Matrix end-user application information
- To resolve your problem, a lead tester may need to work with ChemIT staff.
Deadlines for researchers' testing
DATE 1: DayOfWeek, 10/xx/14: By this date, all researchers are expected to at least simply login (verifying that their account itself works).
...
- Any delays in your testing will extend the project schedule.
Testing schedule activities by ChemIT staff
What's been done , through Phase Oneto date:
- The system, with all requested software, has been installed.
- All hardware has been tested.
- All applications have been test and confirmed working tested by designated group researchers designated as application leads.
Phase Two's schedule (dovetailing Upcoming scheduled events (dove-tailing with above deadlines for researchers)
...
Notes to get you started and keep you going
During testing, there is a new and an old cluster available to you at the same time.
QuickStart info
- Thus, you will be able to log into two different home directories, within two completely different systems:
scheraga.chem.cornell.edu
matrixtest.scheraga.chem.cornell.edu
- Your account and password on the new Matrix are the same as the old Matrix.
- See section below, "Get access to your account", within the "(2) Information relevant for testing and after testing" section.
- Continue to use the old Matrix cluster for your production research work.
- Use the new cluster ONLY to confirm it will work for you once we cut-over from the old Matrix. Do not use it for production research.
- Interruptions to the new Matrix during this testing phase may occur at any time, with no advanced warning.
Detailed info, particularly if QuickStart above is insufficient
Elaborate on "We will provide a second email with instructions for getting to the new cluster and setting up your home directory, and how storage is set up."
...
Other details
- See the storage chart for more details on disks & partitions in the new cluster, and how that compares to the old Matrix.
- All research applications are installed under “/software”.
- Get instantly more space, for temporary use, in your home directory, by using the "/notbackedup" disk.
- See section above, "How is the new Fall 2014 Matrix different than the old Matrix?", for links to more details.
- Get info on the nodes on the new Matrix which are available during testing.
- See chart with full details:
- Summary info, true during Phase Two testing:
- Initially about 18 nodes will be made available to all researchers.
- This represents the newly purchased nodes and two of each of 5 types of old node.
- There are 8 new compute nodes (m108-m115) accessible. Each new node has 20 cores available.
- For testing, select old Matrix nodes will be moved to the new Matrix to ensure adequate computing capacity during testing.
- During testing, GPU's are not available in the new cluster.
- However, 2 of their nodes will be running as just CPU nodes.
- Initially about 18 nodes will be made available to all researchers.
- See chart with full details:
- Queuing information
- No Express queue available for testing.
- An Express queue will be provisioned when new Matrix is in production.
- The default queue is dque instead of express as no node has been assigned to be available for Express queue.
- No Express queue available for testing.
...
Learn to effectively use your home directory, along with /storage, /notbackedup, and /software.
- Your home directory is "/users/netID".
- In this location, just retain the files you need for running jobs.
- Place files into storage which you don't need for your current jobs.
- KEY: Keeping your home directory small aids in disaster recovery for the entire group.
- Get instantly more space, for temporary use, in your home directory, by using the "/notbackedup" disk.
- See section below, "How to run jobs requiring lots of space in ones home directory".
- KEY: Keeping your home directory small aids in disaster recovery for the entire group.
- Your storage directory is "/storage/netID".
- Storage is where non-actively used, Scheraga-related, research files belong.
- In production (NOT DURING TESTING!), you will move results and data you want to save long term back to storage
- REMEMBER: All date in "/storage/netID" will be deleted after this testing phase.
- This deletion will allow us to move your current, production data from the old Matrix, when we cut-over.
- All research applications are installed under “/software”/software”.
- Get instantly more space, for temporary use, in your home directory, by using the "/notbackedup" disk.
- See section below, "How to run jobs requiring lots of space in ones home directory".
How to run jobs requiring lots of space in ones home directory
...