Key questions to address as we progress on finishing the clean-up from the Matrix failure.

Questions originally seeded by Michael Hint, 11/8/13, and subsequently elaborated on.

RAID Card Management

Talking about scripts to check on RAID? Has the 3ware web interface gui been installed that would deal with this issue?

To do:

Jorge Files

I believe a large amount of his files were moved to the USB drive. What happened to that USB drive? Is it still connected? Are we shipping it to him? We bought a second drive, do we take a copy for Harold before it leaves?

To do:

  • Once Matrix is up again, Jorge will confirm the hard drive has the files he wants there.
  • Then we will disconnect the drive, and duplicate its contents to second drive.
  • Then Jorge will pick up one drive in December. And Harold will hold on to the second "just in case".

Result:Jorge will again have access to his (static, large amounts of old) data, but it doesn't have to be served (and backed up or restored) on Matrix.

Temporary files idea

What is the story on these temporary files both in location and backup strategy as affected by their existence? Documentation to all users concerning how this works? I can see later when someone thinks these temporary files are real files.

Done:

  • A temporary directory has been created called, "notbackedup".
  • Lulu has confirmed that items in this temp spaceare counted as part of a user's home directory quota.
  • Harold has agreed to designate a researcher to monitor disk space usage on Matrix regularly (~every 2 weeks or month, to start?).

To do:

  • Communicate (document) use of temporary.
  • Develop initial disk usage reports, for group's storage monitor to run.

With monitoring, hopefully feedback will result in large, transitional files not being backed up by EZ-Backup. Nor needing to be restored, if another failure.

Extra drive/spare drive discussion

I rushed and rushed to get the 3TB constellation drive RMA’d and no it sits as downtime is not scheduled. The RAID card must be told about the drive. Then there are discussions on adding more drives that I don’t understand and I don’t think the research group understands the physical space issues.

Done:

  • Michael installed the HD as a hot-spare to the RAID 10.

Quotas

What is the story on quotas in both file count and volume count?

Done:

  • Harold has agreed to designate a researcher to monitor disk space usage on Matrix regularly (~every 2 weeks or month, to start?).
  • No quota on number of files, only volume. But monitor number of files for high-fliers.
    • An older program in use generates inordinate amount of data files.

To do:

  • Develop initial disk usage reports, for group's storage monitor to run.
    • To include the number of files, not just space.

With monitoring, hopefully feedback will result limiting the number of files needing to be backed up by EZ-Backup, and hence possibly restored.

Backups

What is the story on EZBackup? Log? Confirmation from EZBackup staff independently if possible as to how much data is over in Rhodes?

Done:

  • Harold authorized EZ-Backup for 2 months.
  • Logs reviewed by Lulu and Michael.
  • Lulu successfully restored random elements.
  • All but Li's directory has been backed up.

To do:

  • After Matrix is available to Li, she just further clean up her directory (within a week, please!).
    • Once Li's directory is small enough (to be decided by Yi and Harold), it can be backed up to EZ-Backup
  • Review first bill.
    • Confirm it is the right service/ cost to keep running beyond 2 month trial period.
  • No labels