MPU Meeting Thursday 25th August 2016

Inventory

Alastair has publicised the REST API, this includes examples using curl and Perl/LWP.

Chris would like to wrap the Dell dsu tool as a firmware report.

LCFG Client Refactoring

Stephen has been learning about how to use flex and bison for parsing the LCFG contexts. He has a very simplistic prototype working but it needs a lot more work before it is ready to be shipped as part of the core library.

MPU SL7

NX servers
Both of the NX servers - piccadilly and northern have been upgraded to SL7. This revealed a little problem with the fstab component attempting to create a partition with negative space as there was not sufficient for the proposed partitioning scheme. It really needs the error handling improved.

PXE / tftp / rpmaccel
Stephen has been working on upgrading the cache/PXE servers to SL7, the services are all running on a test VM, he plans to upgrade the backup server wildcat next week.

KVM servers
The two new KVM servers now have FC cards installed. There have been some issues with the serial consoles hanging and requiring a hard powercycle to clear. Ian Durkacz has been investigating.

Miscellaneous Development

SL6.8
SL6 machines on the develop release are now using SL6.8. Stephen remembered that the initscripts package needed updating, thankfully the changes were minor this time.

dsu
Chris plans to create an LCFG header and include the dsu utility on all servers.

qemu rhev
Chris has been working on the qemu rhev packages for SL7

lcfg slave stats
We discussed the stats from recent lcfg slave runs on various different hardware/OS combinations, it looks like the P310 provides significant speed improvements.

Operational

SSH servers
The SSH servers brendel and schiff were rebooted to pick up the latest kernel. The network interface names were also fixed at the same time.

Network names
Stephen has been working through the hardware models on which SL7 is being used and checking the network interface names are correct. He has also been updating the SL7 hardware support page and the Consistent Network Interface Naming page.

virtualbox
A new version of virtualbox - 5.1.4 - was released. This has been pushed out immediately so we start semester 1 with the latest version. Upgrading again should fix any remaining problems from the previous update to 5.1.2.

KVM crashes
Roger has been experiencing regular crashes of his SL7 VMs. We need to understand what is happening.

fastbugs
Stephen has added the SL fastbugs package repository to the standard rpmpath which makes it much easier to pull in special updates when required.

This Week

  • Alastair
    • Inventory project
      • continue working through InvProjectWorkFlow
      • Document clientreport (eg how to add modules)
      • Document order sync code
      • Document hpreport processing script
      • Continue work on RESTful API - InvProjectRESTapi
      • Start work on final report!
    • Remove default pool if ops meeting agrees
    • Dump 'atom'
    • Deploy encrypted /tmp and swap conversion script
      • Deploy on office desktops September 7th/8th
      • Need to warn users that Gnome3 may pop up a window about /tmp being full (when script is run)
    • Schedule MPU meeting to discuss systemd ordering
    • Reschedule MPU futures meeting
    • Continue building computing.help honeypot
    • package up ILW stuff and document process
    • submit polkit bug to redhat - with Stephen
    • After next kernel update - Run named existence report on bandama
    • Continue researching whether 'discard' or fstrim is appropriate/possible for cryptab partitionsNo further success
    • Once Stephen updated DNS part, submit SL7 server base project to August devel meeting for closing
    • Look at MPUActivitiesList
    • MPU SL7
      • Try bringing up an SL7 test server akin to 'otter' - package slave exportReady to go, once nagios problems resolved.
      • Chase Toby again about testing latest perl-Moose under prometheus (and then make live) after October 1
    • Add need LCFG compiler analysis / benchmark to MPUActivitiesList
    • Add documentation on ssd-disk.h on LCFG wiki https://wiki.lcfg.org/bin/view/LCFG/SSDdisk
    • Add looking at cgroups for NX service to MPUActivitiesList
    • Check sysmans (et al) have 'nograce'.
    • Review computing.help pages at https://computing.help.inf.ed.ac.uk/pages-with-last-reviewed-date
      • sleep, NX, ssh on windows

  • Chris
    • Inventory project
      • Continue work on clientreport modules for replacing firmwarereport
    • pkgsearch for SL7
      • reimplement as a yum web front end (yum search for keyword produce an html file of links to cgi to do yum info)
      • Need support multiple platforms
    • MPU SL7
      • Investigate KDE problems on staff.nx (SL7)
      • continue work on qemu header
      • Schedule some migrations to new SL7 kvm server
    • Investigate R730 iDRAC with Ian D
      • firmware upgrade - play more with 'dsu'
      • install 'dsu' on SL6 and document (look at wrapper to pretend about /etc/redhat-release contents)
    • Look at MPUActivitiesList
    • Add SL6<->SL7 KVM migration info to MPU wiki docs on virtualisation
    • Check with RAT whether we still need SL6 32bit
    • Look to see if there's a Dell R series server which has the same CPU as 'muro'
    • Investigate Roger's mysterious KVM guest dying problem
    • Review computing.help pages at https://computing.help.inf.ed.ac.uk/pages-with-last-reviewed-date

  • Stephen
    • LCFG client refactor stage 1
      • schedule debrief meeting
    • LCFG client refactor stage 2
      • testing and documentation
      • blog article (once documentation complete)
    • Investigate kernel component pipe moan by using shell commands instead of RPM module => waiting on 7.2 => activities list
    • LCFG server symlink to exam branches - produce reporting script and discuss with Graham
    • Circulate dmesg proposal
    • submit polkit bug to redhat - with Alastair
    • SL7 MPU
      • Continue work on package caches (PXE server and NFS to go)
    • Work on RT tickets
    • Add something about DNS to FinalProjectReport356
    • Look at MPUActivitiesList
    • Document BIOS settings for Lenovo box
    • Check hardware model headers to make sure all models support new network naming scheme for SL7
    • Review computing.help pages at https://computing.help.inf.ed.ac.uk/pages-with-last-reviewed-date

-- AlastairScobie - 25 Aug 2016

Topic revision: r11 - 27 Sep 2016 - 08:01:52 - StephenQuinney
 
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback
This Wiki uses Cookies