MPU Meeting Tuesday 15th September 2009
Power Management Project
The sleep component is now running on all the HP DC7900 machines in the student labs.
rpmsubmit
The dates for the change to the new system have changed. Alastair now plans to do this on Tuesday 22nd September.
AFS Component
As part of the pandemic planning work Stephen has been working on preparing for the roll-out of the new openafs component to the DB and file servers.
TIBs
Not much happened. Chris will put together a plan for how the new TIBS LCFG component can be brought into service.
LCFG Server Refactoring
Nothing happened
Miscellaneous Development
- release scripts
- Some changes were made to the release scripts to make them more standard. Also a new script was added for creating branches and this was documented on the ReleaseManagementProcedures page.
Pandemic Planning
Whilst writing up the details of how to restore the LCFG master if it dies Stephen came up against a few awkward issues related to retrieving data from the mirror servers.
He suggested that the rmirror component could be improved with the addition of a spanning map which holds an inventory of all the data being backed up. This could work in a similar way to the standard inventory component. The addition of a simple CGI script would make it easy to view and search the information. We will bring up this suggestion at the Technical Meeting in October. For now Stephen will make a list of all the MPU data being mirrored so we can find it easily in a hurry.
Stephen also noted there are no clear instructions on the best way to get data back whilst preserving modes and permissions. This can be a pain when there is a lot of data involved.
It was agreed that we should look into having a virtual machine at KB which acts as a hot spare for the LCFG master to avoid bootstrapping issues and to make the replacement as speedy as possible.
The addition of ethernet bonding to all MPU servers is now completed with the switch to the new header.
Operational
- New forum-server-room header
- The new new
forum-server-room.h
header was added to all the relevant MPU machines and the nut
component started. A question was raised about the best approach for virtualised servers.
- bakerloo
- bakerloo has been switched back to the stable release.
- New R710 machines
- Once the KB machine room is ready we will get a couple of R710 machines up there so we can deploy some machines off-site (like a hotspare for the LCFG master).
This Week
Alastair will:
- research pandemic interests
- write pandemic howtos
- finish Nagios testing in etherbond.h
- reboot central
Chris will:
- research pandemic interests
- TIBS adoption plan
Stephen will:
- research pandemic interests
- document disaster recovery for LCFG master
- Make a list of MPU data backups
- update LCFG server documentation
- test AFS component database servers
--
StephenQuinney - 18 Sep 2009