RAT Unit Meeting -- 18-11-2019

Projects

CompProj:379 Review of DICE desktop platform

  • gdutton: completion report in progress
  • gdutton: create new project once report done purpose has changed to wider than original remit

CompProj:392 Live Chat Service

  • discussed with CSO's, many opinions, another non-queued interruption would be a problem
  • looked at some packages
  • can we use teams - this has mobile app, web client is fine on linux, but automation may be needed to make this work in ways like live chat, including returning to RT
  • gdutton: written report on status and options
  • low priority

CompProj:405 Map of the Taught Student's Labs

  • timc: did completion report and DPIA in BPW
  • timc: added iOS web interface onto main site (or groups.inf) and add link on documentation page in BPW
  • gdutton: security review done

CompProj:417 Roles Management

  • gdutton: met Toby for remaining deliverables, report written
  • finish project when roles supremo page is sufficient

CompProj:420 Hadoop

  • chris completed docs
  • timc complete all other deliverables
  • check if dpia needed, do one if so
  • we need to decide if we can auto-delete and/or retention policy
    • could have two stage process i.e. archive first then delete, with email to say accessible but not run jobs, will be deleted after set period of time otherwise

CompProj:455 Mock REF Reviewer System

  • timc: did completion report in BPW
  • timc: decommissioned

CompProj:463 Merged MLP/MSC Teaching Clusters

  • script done to generate auto.homes file driven by caps - important so we can separate users onto the correct filesystem, teaching or research
    • add ability to get config from file
    • now shifting people about, along with the amount of data involved means this will take a while to do
  • script managing home directories nor running in reporting mode for safety
  • then finish component, hopefully LCFG mangaeable
  • need to a DPIA

CompProj:464 User access to last/ps/w etc

  • gdutton: concrete proposals report next week

CompProj:465 Teaching Software 2018/19

  • timc: tick off deliverables etc

CompProj:470 Personalised Portal Page for Academic Staff

  • ongoing, "select" academics testing; need to demo for admin; core development is done; many reports to add or transfer from Portal but that is not part of this project
  • continue on documentation

CompProj:472 Procure PGR GPU Cluster

  • done bar 10gb infrastructure
    • 10gb cards now pulled
    • 10gb switch now populated with 10gb modules
  • timc: done completion
  • iainr: discuss infrastructure with gdmr/idurkacz

CompProj:506 Teaching Software 2019/20

  • S2: kaldi done
    • included on cluster
  • S2: python related teaching requests still todo
  • S2: nbgrader request

HTTP -> HTTPS check

  • rwb to investigate

CompProj:539 User Facing MHR Data Asset Register

  • gdutton: liaise with cms, we have an internal system - capture requirement and develop for users

GPU Approved Supplier Procurement

  • timc: speak to iainr/gdutton
  • mini tender in progress for specific order, tight deadline

Misc Development

  • continuing on migrating some TSP processes
    • server-side validation done and working
    • added access control to allow admin staff to view
    • still todo form 2 handling
    • overall reports done
    • gdutton: other TSP enhancement

    • RT lifespan/retention plus other things - discussed at ops
      • add accounts by ldap
      • look at merging emails
      • look at purging and retention rules
      • run pre move to postgresql scripts

Operational

  • 7.6 upgrades - scheduling nodes - python change breaks scripts and they need rewritten so hold off on these
    • all done bar CDT scheduler
    • CDT scheduler can now be done
      • maybe look at live migration to avoid cluster downtime

  • h/w security decommission
    • wasserboxer - move bridgeport off and just switch off
    • fondant - off, needs to be physically removed
    • arcsim - ssh access only, also wants a VM server for SVN
    • redsea - off, needs to be physically removed
    • bocian/blanik/karenin
      • all ssh access only while migration completed by Rob, manual iptables configuration
    • henwen + wilbur - chased Amos/Charles

  • iainr/rwb/gdutton: crypt server
    • working!

  • iainr: 1 gpu server still to install
    • done

  • iainr has got power figures from "hannah", max power draw about 8.5A with all GPUs running but if running CPUs power draw goes down
    • data can be collected live from Dells so can get historical maximum, need to check if higher than aircon maximum

  • doing DPIAs:
    • we need to do all our services
      • Webmark
      • Theon
      • TheonPortal
      • ProjSubs, Projects-Archive
      • DPMT
      • Slurm
      • RT4 (can use some of Unidesk replacement one)
      • License server logs
      • Lab exam ?
    • aburford: need to do one for ProctorU
    • timc: doing WhosOff (maybe)

  • tophat attendance * privacy question, may not be specifically tophat, sitting with IS for comment

  • looking at live capture options
    • live broadcast audio much better, latency in transcription may be significant
    • doing more testing with disability office
    • likely to be a compromise approach

  • course questionnaire mid semester feedback
    • using microsoft forms rather than learn quiz
    • done questionnaire for each course in semester2
    • organised output with web links for lecturers

  • exam prep machines
    • wrong online procedural docs need to be removed and replaced - USU doing

  • intermittent filesystem weirdness on damnii nodes
    • probably SSD
    • going to upgrade BIOS to see if fixes console problem
    • another SSD problem fixed by power cycle

  • lennoxtown still with Novatech, no response since 1st Jan

  • ubatuba also still broken (2 GPU failures)
    • will wait on lennoxtown and then do a GPU swap to test

  • PG v12 - no OIDs anymore, affects TheonUI

  • shuffling data from teaching cluster to research cluster
    • iain will do a few and write procedure so richard can then assist

AOCB

Edit | Attach | Print version | History: r296 < r295 < r294 < r293 < r292 | Backlinks | Raw View | Raw edit | More topic actions...
Topic revision: r293 - 13 Jan 2020 - 15:24:55 - Main.TimColles
 
This site is powered by the TWiki collaboration platformCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback
This Wiki uses Cookies