Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Excerpt

Prevent delays be directing support requests to NMR, not Chemistry IT or CIT. Thank you!


...

Summary expectations

  • The owner and manager of this service is Ivan Keresztes <ik54>.
  • Ivan is responsible for this server's function and continued maintenance, its operations, and patron and back-end documentation related to this service.
  • Ivan is responsible for creating and implementing any desired enhancements.
  • The server and its infrastructure reside or depend on resources controlled or otherwise managed by Ivan
    • To be super-clear, not Chemistry IT or CIT.

======== DRAFT ========

 

...

Table on Contents

Table of Contents

This is a public-facing web server critical to NRM's service offerings.

The tolerance for outage of this server is (define, please), per Ivan and Coates.

(Consequence if not available for hours? days? lost data going back how long? etc.)

 

Purpose of this write-up from Oliver, Chemistry IT Manager

This write-up represents an investment by Oliver to help inspire greater clarity in problem resolution by NMR so the right people and groups work on the problem, and no delays occur by the problem being routed to the wrong groups or people. I also hope the write-up will inspire taking NMR staff to take additional steps to help prevent a crisis when problems to do occur.

 

Summary

The owner and manager of this service is Ivan Keresztes <ik54>.

Ivan is fully responsible for this server's function and continued maintenance and operations. As well as any desired enhancements.

That server and its infrastructure resides or depends on resources controlled or otherwise managed by Ivan, not Chemistry IT or CIT.

 

This is a public-facing web server critical to NRM's service offerings. Specifically, the tolerance for outage of this server is (define, please), per Ivan and Coates.

  • Consequence if not available for hours? days? lost data going back how long? etc.)

Key recommendation from Chemistry IT

Much of the below recommendations presume are simply best practices. They are certainly worth investing in if the scheduling server is critically important to NMR's service delivery.

  • Develop break/ fix procedures for Ivan's group which are independent from Ivan so it can serve the group when Ivan is away, if necessary.
  • Document processes to ensure server software remains patched, while ensuring continued functionality. This would include, especially for a public-facing web server, patching regularly, or upgrading over time, the OS, Apache, Perl, and their associated programs.
  • Line up, document, and test processes to ensure server is backed up and restorable to an acceptable period of time in the past.

 

Example: Contact CIT to determine if they would be capable and willing to provide expert support services via their fee services. Information available at <http://www.it.cornell.edu/about/atsus/iws/>. Explore if CIT (or other firm) could provide support backup to what Ivan knows about the server's set-up, especially important if he's away and a crisis occurs. Doing so before there are problems can increase the chance of getting expert and rapid responses, as compared with what you will get if waiting for a problem to occur. CIT (or other firm) might also be able to expertly and cost-effectively facilitate adding reasonable security, or functional enhancements, over time.

 

Contextual information

The service runs on an Apache web server running on a Linux server, and depends on files and Perl scripts.

The Linux server is hosted within Amazon Web Services (AWS), via Cornell's contract. This incurs a monthly charge (amount?). The server is managed remotely by Ivan.

N.B. The AWS charges are currently going through CIT. CIT is processing the charge to their account as a favor to Chemistry so we did not have to create an account ourselves with AWS. (This can be changed, if desired.) CIT currently has no other persistent responsibilities or connections to this server.

 

Clarifying CIT

...

and Chemistry IT

...

staff are not responsible

  • CIT and Chemistry IT are not responsible for break/ fix of the NMR web scheduler or any of its or related infrastructure.
  • CIT and Chemistry IT are not responsible for enhancements to the NMR web scheduler or related infrastructure.

 

With Chemistry IT's assistance, CIT did provide a generous amount of free consulting technical expertise, and implementation work and debugging to migrate the server from the extremely old hardware in 248 Baker Lab into the Amazon Web Services (AWS) infrastructure. They ensured correctly configured networking. They also de-bugged the software to ensure it would run correctly on more contemporary software (OS, Apache, and Perl). Migration occurred Tuesday, Oct. 11, 2016, from about 8:45 am to 10-ish. Ivan signed off on migration's success on (date?). (Q: Was the last problem detected on 12/19/16, which then hopefully was subsequently resolved to completion? That incident inspired this write-up since Chemistry IT was contact when the problem with the NMR Scheduler was detected.)

 

Historical notes

Chemistry IT has served as trusted consultants to Ivan regarding this server, through helping to get it migrated to AWS.

...

See also

======== DRAFT ========