Core Committer Weekly Interlock - September 7th 2017

Attendees

Ahmed Osama

Former user (Deleted)

Bishoy youssef

Erik Smith

Jean Pierre

Former user (Deleted)

Leo Zhang

Michael Hepfer

Former user (Deleted)

Former user (Deleted)

Thomas Sullivan

Tim Larson

Former user (Deleted)

Former user (Deleted)


Agenda

  1. 2.20.0 Release 9/4/17, next release 2.21.0 target 9/11/17
  2. Ease of use of the Vagrant based demo
    1. Week of 8/7 comment on the slack channel: "At this point I've given up on rackhd. If even the demo requires an old version of ubuntu to run an old version of virtualbox to get it working, I will stick with something simpler."
      1. priority must be on keeping it simple and easy to use
      2. Former user (Deleted) to create draft requirements that the team agrees upon, use that to come to consensus on the next steps
    2. Suggestion 1 - easier to create script to re-use the nightly built docker images + infrasim.  
      1. disadvantage is using docker compose yml missing the network setup, linux specific demo only
    3. Suggestion 2 - use vagrant script to deploy rackhd (docker) + infrasim (similar to Suggestion #1, but using the vagrant script for deployment to meet an OS agnostic requirement)
      1. advantage - OS agnostic, virtual switch deployment can be done in a ruby script
      2. disadvantage - multiple vagrant versions , also thought to be an advantage as the vagrant file version can be bumped.
    4. Suggestion 3 - run docker in docker, leveraging existing docker images
      1. privileges, networking, and performance concerns with this solution
  3. RackHD Tooling Updates
    1. Ubuntu to be upgraded to 16.04, 18 to be released ~ April 2018
      1. RackHD epic / Concourse KI to include migration of the CI environment move to Ubuntu 16.04.  What has been developed to date for the Concouse env includes the 16.04 migration.
        1. Former user (Deleted) to provide details on what is available.
      2. ova scripts will need to be updated (passing a parameter) to move to 16.04
    2. Node v6 is the current available version, RackHD is running v4.
      1. v4 will be EOL 4/18
      2. v8 releases next month 
      3. RackHD Epic to be created to migrate from v4→ v8
    3. RackHD Epic to be created such that the CI env is testing the latest MongoDB version (Mongo recommending using 3.X + versions only, not supporting anything in the 2.X version family)
  4. Review proposal per Former user (Deleted) email on 8/29 for moving CI to container , moving CI to cloud .

    1. AI: CC team to review the slides, come back with feedback/answers to the questions posed in the slide deck.
  5. From QRB meeting notes
    1. Agreement at OLT that we will be going fully wsman-based and eliminate racadm from workflow support.  
      1. Content finalized,  Leo Zhang / Maglev team will work with Thomas Sullivan on generating the official  KI.
      2. Test plan: the idrac simulation tool will be used for virtualized testing (PR quality gates/MasterCI) and introduce more Dell physical hardware to the Regression-Baremetal job.   

        1. Former user (Deleted) has downloaded the idrac simulation tool, currently under evaluation
          1. tool supports only read operations
          2. RackHD Epic to be created that introduces workflow testing to rackhd CI/CD.  This will cover smi service testing, does not cover "plugin" integration tsting 


Did not get to the agenda items below:

  1. Plugfest demo 
      1. Remaining stories to polish-up and merge into master (currently on a fork) have been transitioned back to Veyron, working in this current sprint as the priority for this has escalated.
  2. BareMetal Regression Pipeline now created/monitored. 
    1. Plan is to monitor for a few weeks, should it then be a gate?
    2. BareMetal OS install on real hardware currently runs every 2 hours on the nightly docker images.  Will need to kick off BareMetal at same time as CI
    3. Do we then continue to run BareMetal every 2 hours
    4. Should this be part of the Master CI pipeline, if so then we would need a modification to the Merge Freeze tool to freeze on failure of BareMetal regression tests.
  3. All SMI Services  have been published.  
    1. What documentation is needed, what kind of communication is needed for the open source community?
  4. Process change for Master CI failures - how long can a developer work on a fix for a Master CI Failure before requiring to back out the change and get back to green?
  5. How to add stand alone services to the Master CI/CD pipeline (ex SMI Micro Services, UCS etc)  right now Master CI is strictly core RackHD


Next meeting will be Thursday September 14.