Friday, 2022-05-06

rlandy|bbl"tripleowallabycentos9" promoting01:18
rlandy|bbldasm|off: looks stuck on cloud-init01:23
*** rlandy|bbl is now known as rlandy|out01:23
*** soniya29 is now known as soniya29|ruck02:50
* soniya29|ruck need to step out 04:29
soniya29|ruckysandeep|out, i need to step out for a while04:30
*** ysandeep|out is now known as ysandeep04:43
*** ysandeep is now known as ysandeep|rover04:43
ysandeep|roversoniya29|ruck, ack o/ Let's sync when you are back.04:44
ysandeep|roversoniya29|ruck, new ruck/rover hackmd: https://hackmd.io/zjCkCnVeQ06bnBq836iF8g 04:51
ysandeep|roverdasm|off, rlandy|out thanks for cloud-init pinning patches, /me checking what's the status currently.04:53
ysandeep|roverrlandy|out, I see a note on hackmd that we can skip fs035 and promote and you were rerunning testproject for fs035- I see your testproject failed but fs035 passed later in periodic run - /me triggering fs035 for older hash and other failed jobs for current hash - hopefully we will get better results now.05:08
ysandeep|roversoniya29|ruck, I am looking into the c8 train cloud-init issue.05:10
soniya29|ruckysandeep|rover, shall we sync?06:33
ysandeep|roversoniya29|ruck, I am in a debug session with cedric, I will ping back after few mins06:34
soniya29|ruckysandeep|rover, sure06:34
soniya29|ruckysandeep|rover, i will start looking on downstream or do you want me to look at something else?06:35
ysandeep|roversoniya29|ruck, please start with monitoring gate first and then go to rdo promotion pipelines - see if we have any new issue.06:36
soniya29|ruckysandeep|rover, ansible-collections-openstack-functional-devstack-octavia is failing with error - 'Provided object does not match schema' but it seems it has been made non-voting06:42
soniya29|ruckcockpit reports it as a gate failure06:42
soniya29|ruckysandeep|rover, ansible-collections-openstack-functional-devstack-releases is green with latest run on stable/1.0.0, earlier it was also failing with above error06:44
ysandeep|roversoniya29|ruck, leave  ansible-collections to jm1 , please check tripleo repos instead06:45
soniya29|ruckysandeep|rover, cs9 standalone is failing because of ssh connection issues06:46
soniya29|ruckthough cs8 standalone has passed06:46
soniya29|ruckysandeep|rover, cs9 standalone multinode ipa is failing on tempest test - TestNetworkBasicOps..it seems a race condition with the resources06:48
ysandeep|roverack please report and debug if they are consistent if we don't have a bug already06:49
soniya29|ruckysandeep|rover, sure06:50
jm1soniya29|ruck: thank you :) We have a couple of failing jobs in ansible openstack collection but this is expected. I will talk about our CI in next ci community meeting. I am continously monitoring our aom jobs, so you can just ignore them ;)06:53
soniya29|ruckjm1: sure, thanks :)06:53
*** jpena|off is now known as jpena07:01
soniya29|ruckysandeep|rover, cs9  master has been promoted 8 days ago..looking into it07:08
ysandeep|roversoniya29|ruck, ovb issues 07:09
soniya29|ruckysandeep|rover, should i go looking into it or chase different stuffs?07:10
ysandeep|roverYou can leave ovb debug on me and check new failures if any07:13
soniya29|ruckysandeep|rover, okay, thanks07:16
ysandeep|roverrlandy|out, dasm|off hopefully https://review.opendev.org/c/openstack/tripleo-ci/+/840826 will fix the c8 train issue, /me testing 07:37
ysandeep|roversoniya29|ruck, I am available to sync now07:40
* ysandeep|rover looking into ovb tempest issue now, i got a reproducer07:45
soniya29|ruckysandeep|rover, give me 10 min08:15
ysandeep|roversoniya29|ruck, let's meet at 02:30 pm - I will also head out for lunch soon08:15
*** ysandeep|rover is now known as ysandeep|rover|lunch08:20
soniya29|ruckysandeep|rover|lunch, ack08:36
*** soniya29|ruck is now known as soniya29|ruck|lunch08:36
*** ysandeep|rover|lunch is now known as ysandeep|rover08:59
*** soniya29|ruck|lunch is now known as soniya29|ruck09:09
soniya29|ruckysandeep|rover, meet.google.com/twa-avjs-dtq09:10
rlandy|outsoniya29|ruck: ysandeep|rover: hello 10:21
soniya29|ruckrlandy|out, hello10:21
rlandy|outsoniya29|ruck: ysandeep|rover: let's touch base pls10:21
*** rlandy|out is now known as rlandy10:21
rlandysoniya29|ruck: ysandeep|rover: re: status of OVB10:22
ysandeep|roverrlandy, ack10:22
soniya29|ruckrlandy, ack10:22
rlandyysandeep|rover: soniya29|ruck: https://meet.google.com/kcm-dyso-xbq?pli=1&authuser=010:22
rlandysoniya29|ruck: https://bugs.launchpad.net/tripleo/+bug/1971465 fs001 and fs035 OVB jobs failing tempest - identity/haproxy connection errors10:29
ysandeep|roverhttps://review.opendev.org/q/topic:discover-latest-image10:35
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-component-compute/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-standalone-compute-rhos-17/fedcc2e/logs/undercloud/home/zuul/install_packages.sh.log10:44
rlandyProblem: package python3-tripleoclient-16.4.1-0.20220428005553.095182c.el8osttrunk.noarch requires python3-tripleo-common >= 15.2.0, but none of the providers can be installed10:44
ysandeep|roverrlandy, http://pastebin.test.redhat.com/105001310:53
soniya29|ruckrlandy, ysandeep|rover https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4260111:13
rlandysoniya29|ruck: thank you - merging - pls revert once master promotes11:14
soniya29|ruckrlandy, sure11:14
soniya29|ruckysandeep|rover, cs8 standalone victoria is failing on tempest because of timed_out/connection issues11:17
ysandeep|roversoniya29|ruck, failure consistent?11:17
soniya29|ruckysandeep|rover, yes11:17
soniya29|ruckit is consistently failing for 3 runs11:17
ysandeep|roveron same test?11:17
rlandychandankumar: you joining review time?11:18
soniya29|ruckysandeep|rover, let me cross-verify for same tests11:18
soniya29|ruckysandeep|rover, not on same tests particularly but on compute, network and in one of the runs all the tests are failing11:19
soniya29|ruckbut failure reason seems similar11:19
rlandydviroel|out: we're looking at your review https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/4237811:24
rlandycan you explain when you are in?11:25
*** dviroel|out is now known as dviroel11:35
* soniya29|ruck takes a break11:49
*** soniya29|ruck is now known as soniya29|ruck|afk11:49
*** ysandeep|rover is now known as ysandeep|afk11:56
*** soniya is now known as soniya|ruck|afk11:57
*** pojadhav is now known as pojadhav|break11:58
chandankumardviroel: please approve this one https://review.opendev.org/c/openstack/tripleo-ci/+/839149 , thanks!11:59
rlandysoniya29|ruck|afk: ysandeep|afk: going to BZ this https://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-component-glance/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-8-scenario004-standalone-glance-rhos-17/f6483d5/logs/undercloud/home/zuul/install_packages.sh.log12:01
* dviroel looks12:03
*** pojadhav|break is now known as pojadhav12:29
rlandychandankumar: let's sync12:31
rlandyhttps://meet.google.com/ggp-fbnq-xea?pli=1&authuser=012:31
*** ysandeep|afk is now known as ysandeep|rover12:42
ysandeep|roverrlandy, ack 12:47
rlandyysandeep|rover: soniya|ruck|afk: master promoted - reverting criteria patch12:48
ysandeep|rovernice :)12:48
rlandydone12:48
ysandeep|roverperf issue on vexx is not consistent, podman ps is now returning in micro seconds now..12:49
rlandyysandeep|rover: welcome to all of last week :)12:51
rlandymaster not looking great12:52
rlandysoniya|ruck|afk: ysandeep|rover: let's rekick c8 line when your patch merges12:53
ysandeep|roverack12:53
ysandeep|roverwe don't create dstat.html on overcloud  nodes, would have helped us with performance issue.12:58
* ysandeep|rover checking if we can enable that for overcloud nodes 12:58
rlandypojadhav: just logging a bug - will be there in 513:00
pojadhavrlandy, yep13:00
rlandypojadhav: https://jenkins-cloudsig-ci.apps.ocp.ci.centos.org/view/phase-1-pipelines/13:11
rlandyhttps://github.com/rdo-infra/ci-config13:11
dviroelysandeep|rover: would help a lot - yes13:14
*** chem_ is now known as chem13:17
*** chem is now known as Guest26813:18
*** chem_ is now known as chem13:23
rcastilloo/13:25
rlandyrcastillo: hello13:26
*** soniya|ruck|afk is now known as soniya|ruck13:43
rlandyysandeep|rover: soniya|ruck: added BZ to the rr hackmd13:43
soniya|ruckrlandy, ack13:43
rlandypinged lon and jon on it13:43
ysandeep|roverrlandy++13:43
rlandyysandeep|rover: soniya|ruck: https://review.opendev.org/c/openstack/tripleo-ci/+/840826 merged13:43
rlandyrekicking wallby c8 line13:43
ysandeep|roverack13:43
rlandywill deuque enqueue13:44
rlandyok - let's see what that does13:45
rlandyysandeep|rover: soniya|ruck: wil rekick victoria afterwards13:45
rlandysoniya|ruck: you testproject failed on rhos-17 on rhel-913:48
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/99/407599/3/check/periodic-tripleo-ci-rhel-9-scenario010-standalone-rhos-17/8885df0/logs/undercloud/home/zuul/standalone_deploy.log13:48
soniya|ruckrlandy, on it..checking them13:48
rlandyyou want to see if that is a consistent failure13:49
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/zuul/t/tripleo-ci-internal/builds?job_name=periodic-tripleo-ci-rhel-9-scenario010-standalone-rhos-17&project=testproject&skip=013:49
rlandynot too bad13:49
soniya|ruckthough seems unstable but13:50
rlandyhttps://sf.hosted.upshift.rdu2.redhat.com/logs/openstack-periodic-integration-rhos-17-rhel9/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-rhel-9-scenario010-standalone-rhos-17/459bc65/logs/undercloud/home/zuul/standalone_deploy.log13:50
rlandysoniya|ruck: ^^ passed deploy - pls recheck again13:50
rlandyand watch to see if it is consistent13:50
soniya|ruckrlandy, yupp13:51
rlandysoniya|ruck: https://code.engineering.redhat.com/gerrit/c/testproject/+/407601 - a rekcik won't help13:51
rlandyplease read the BZ logged13:51
rlandyrekick eth rhel-9 one not the rhel-8 one13:51
soniya|ruckrlandy, https://code.engineering.redhat.com/gerrit/c/testproject/+/40759913:52
soniya|ruckrekicked rhel9 one13:52
rlandyyep - that one - thank you13:52
*** pojadhav is now known as pojadhav|afk13:57
dasm|offo/13:59
*** dasm|off is now known as dasm13:59
rlandyoh great - now all the component jobs are red14:00
ysandeep|roverfolks, happy friday call o/14:01
* ysandeep|rover starts the cleanup script, last reproducer env - not replicating issue anymore14:04
rlandyysandeep|rover: thanks - just talking with reldel14:05
ysandeep|roverrlandy, For Monday i have following plan if vexx doesn't solve the issue by then, validate-perf test (which only run on undercloud right now) - make it run on overcloud node as well so that we have some data to show to vexxhost.14:07
rlandyysandeep|rover: ok - let's touch base before you are EoD14:07
rlandyare you out now?14:07
ysandeep|roverI am on happy friday call atm, but can sync in few mins before logging off14:08
rlandyok14:10
rlandypls ping14:10
ysandeep|roverrlandy: meet.google.com/fny-kpzc-seh14:14
soniya|ruckshall i join?14:14
soniya|ruckysandeep|rover, rlandy ^^14:16
rlandysoniya|ruck' hey - we just dropped14:34
rlandysoniya|ruck: ysandeep|rover wil carry on with vexx investigation14:35
rlandysoniya|ruck: can you keep an eye on the component lines? 14:35
rlandythank you14:35
soniya|ruckrlandy, ack14:35
rlandyI am checking into the c8 image build failure14:35
soniya|ruckrlandy, sure14:35
rlandywe have a real bug in c8 image builds14:41
rcastillopackage libguestfs-1:1.44.0-6.module_el8.7.0+1140+ff0772f9.x86_64 requires libvirt-daemon-kvm >= 8.0.0, but none of the providers can be installed14:45
rcastillorlandy: we're excluding libvirt*14:45
rlandyrcastillo: ack - in process of testprojecting putting that back14:47
rlandywe excluded it because the last build was bad14:47
soniya|ruckrlandy, standalone rhel8 full tempest scenario rhos-17 is failing on tempest test and is a consistent issue14:49
soniya|rucki think we need to file a bug for it14:49
rlandysoniya|ruck: pls paste links and I'll check14:50
soniya|ruckhttp://pastebin.test.redhat.com/105012114:53
rlandysoniya|ruck: one sec - starting testproject - then will review14:54
soniya|ruckrlandy, okay14:55
rlandyrcastillo: trying https://review.rdoproject.org/r/c/testproject/+/3625414:57
rlandysoniya|ruck: k- looking 14:57
*** ysandeep|rover is now known as ysandeep|out14:58
*** ysandeep|out is now known as ysandeep14:58
rlandysoniya|ruck: looking through the tests ... let's sync on this ...15:04
rlandyhttps://meet.google.com/sgv-aafe-tuz?pli=1&authuser=015:04
ysandeeprlandy, fyi.. https://user-images.githubusercontent.com/435815/166965207-7279a94e-3361-498d-a206-3f89e7be954f.png load on vexx compute going as high as 500 - I am almost certain its because of that.15:12
ysandeepto me looks like host is stealing vm cpu cycles15:12
*** ysandeep is now known as ysandeep|out15:18
rlandyysandeep|out: ack- thanks for collecting that 15:19
*** dviroel is now known as dviroel|lunch15:40
soniya|ruckrlandy, https://bugzilla.redhat.com/show_bug.cgi?id=208263216:10
rlandyrcastillo: soniya|ruck: https://review.rdoproject.org/r/c/testproject/+/36254 - depends on should fix wallaby c8 image builds 16:12
rlandytesting OVB jobs now with those images16:12
rlandyhttps://review.rdoproject.org/r/c/testproject/+/3996016:13
rlandylet's see what that does16:13
soniya|ruckrlandy, ack16:14
rlandychecking your bug now16:15
rcastillorlandy: nice16:15
rcastillodo we not need the cloud init downgrade after all?16:15
rcastilloor testing that later?16:15
rlandysoniya|ruck: editing to include rhel 8 rhos-1716:16
rlandyrcastillo; no - sandeep merged a patch to upgarade the cloud image16:16
rcastillorlandy: ack16:16
rlandysoniya|ruck: looks good - pls send an email to CIX that16:18
soniya|ruckrlandy, sure16:18
soniya|ruckrlandy, done16:24
rlandysoniya|ruck: k - great - next step would be to compare the versions of the rpms in the tempest component16:28
rlandyin component-ci-testing16:28
rlandyvs tripleo-ci-testing16:28
rlandyand see what is different16:28
rlandymay help you find the root cause16:28
soniya|ruckrlandy, okay..digging it16:29
rlandysoniya|ruck++16:29
*** jpena is now known as jpena|off16:29
soniya|ruckrlandy, i see some failures in cinder component as well..will check after above investigation16:30
rlandysoniya|ruck: cinder component is probably what we logged earlier16:30
rlandyinstall depenedcy failure16:30
rlandypinged lon about tat16:30
rlandythat16:30
rlandyalan commented in bug16:30
soniya|ruckrlandy, okay16:30
soniya|ruckrlandy, these are only issues with 17 rhel816:31
soniya|ruckapart from that all good 16:32
*** dviroel|lunch is now known as dviroel16:42
soniya|ruckrlandy, can you pass the link for component-ci-testing repo?16:51
soniya|rucki am not getting it16:52
rlandyhttps://osp-trunk.hosted.upshift.rdu2.redhat.com/rhel8-osp17/component/tempest/component-ci-testing/17:00
rlandysoniya|ruck: ^^17:00
soniya|ruckrlandy, thanks17:06
soniya|ruckrlandy, 17 rhel9 component line seems good17:29
soniya|ruckrhel9 multinode-octavia-rhos-17 is failing because of ssh time_out issue but is inconsistent17:32
soniya|ruckrlandy, leaving for today :)17:39
*** soniya|ruck is now known as soniya|out17:39
rlandythanks, soniya17:41
* dasm stepping away for a bit. bbl18:07
dasmback19:22
*** dviroel is now known as dviroel|afk20:11
* dasm is calling it a week21:09
dasmtake care Team!21:09
*** dasm is now known as dasm|off21:09
* rcastillo is also leaving21:33
rcastilloI'll see you on monday o/21:33
*** dviroel|afk is now known as dviroel23:53
*** dviroel is now known as dviroel|out23:56

Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!