Thursday, 2021-05-27

*** jmasud has joined #oooq00:29
*** jmasud has quit IRC00:45
*** jmasud has joined #oooq00:47
*** jmasud has quit IRC01:07
*** zbr has quit IRC01:30
*** zbr has joined #oooq01:43
*** apetrich has quit IRC02:09
*** jmasud has joined #oooq02:10
*** jmasud has quit IRC03:52
*** ykarel has joined #oooq04:15
*** jmasud has joined #oooq04:22
*** ratailor has joined #oooq04:45
*** ratailor has quit IRC04:55
*** ratailor has joined #oooq04:59
*** ysandeep|away is now known as ysandeep|ruck05:05
*** jfrancoa has joined #oooq05:11
*** marios has joined #oooq05:11
*** ratailor_ has joined #oooq05:16
*** ratailor has quit IRC05:19
*** udesale has joined #oooq05:34
*** udesale has quit IRC05:36
*** udesale has joined #oooq05:38
*** jpodivin has joined #oooq06:01
akahatbhagyashris, https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3385506:10
*** marios has quit IRC06:12
*** marios has joined #oooq06:13
*** marios has quit IRC06:21
*** marios has joined #oooq06:27
*** apetrich has joined #oooq06:46
*** ykarel has quit IRC06:49
*** ykarel has joined #oooq06:50
*** jpena|off is now known as jpena06:55
*** ratailor__ has joined #oooq07:06
*** ratailor_ has quit IRC07:09
*** tosky has joined #oooq07:18
zbrquick review on https://review.rdoproject.org/r/c/config/+/33870 needed please07:43
zbris just a linter bumping07:43
*** jmasud has quit IRC07:54
zbrmarios: bhagyashris ?08:29
marioszbr: make me08:32
*** bogdando has joined #oooq08:32
bogdandohi folks. The zuul reproducer on centos stream on libvirt behaves oddly for me... There is "Generate nodepool main configuration" failing for some subnode with AnsibleError: An unhandled exception occurred while running the lookup plugin 'pipe'. Error was a <class 'ansible.errors.AnsibleError'>, original message: lookup_plugin.pipe(ssh-keyscan -t ed25519 <ip>) returned 108:33
bogdandoanyone have had that as well?08:33
bogdandothat's ansible 2.9.2208:33
bogdandowhile ssh-keyscan -t ed25519  <ip> works on console...08:36
bogdandolesigh08:36
bogdandosshnaidm: ^^ perchance08:36
bogdandook, nvm, the issue is that after a subnode reboot, it doesn't always take its ip via dhcp... logging in and poking dhclient eth0 fixes that. wtf really...08:49
*** ykarel is now known as ykarel|lunch08:57
bogdandoPTAL https://review.rdoproject.org/r/c/rdo-infra/ansible-role-tripleo-ci-reproducer/+/3387609:08
*** bogdando has left #oooq09:34
sshnaidmysandeep|ruck, I'll do a patch, let's try to catch this ovb error and hold the environment09:47
ysandeep|rucksshnaidm, ack o/ thanks!09:47
*** arxcruz|rover has joined #oooq09:54
arxcruz|roverysandeep|ruck: can you see me now ?09:55
ysandeep|ruckarxcruz|rover, yes o/09:55
*** jmasud has joined #oooq10:00
sshnaidmysandeep|ruck, https://review.rdoproject.org/r/c/testproject/+/28004/12/zuul.yaml10:00
sshnaidmlet's run  jobs together to increase the chance10:01
*** ykarel|lunch has quit IRC10:02
ysandeep|rucksshnaidm: ack o/ do you have privileges to hold these nodes if required?10:02
sshnaidmysandeep|ruck, yep, I set them for hold in case of failure10:03
sshnaidmysandeep|ruck, ask from fbo in rdo to get same10:04
ysandeep|rucksshnaidm, yes i already requested for token today from fbo and mhu.10:05
*** jmasud has quit IRC10:10
*** ysandeep|ruck is now known as ysandeep|brb10:20
*** ysandeep|brb is now known as ysandeep|ruck10:35
sshnaidmysandeep|ruck, I see multiple attempts for ovb also, it might be a problem with creating ovb stacks. Worth to look how many stack failed to create today.10:37
ysandeep|rucklet me check if we have a easy way to tell that.. http://tripleo-cockpit.usersys.redhat.com/d/Z3PbeiuWz/vexxhost?viewPanel=8&orgId=1&from=now-24h&to=now only give instantious report..10:41
ysandeep|rucksshnaidm, based on estimate from ^^ around ~13 stacks failed to create today. (I calculated the difference whenever there is change in number of stacks which are in CREATE_FAILED)10:51
ysandeep|ruckin last 24 hours.10:51
sshnaidmysandeep|ruck, yeah, not good..10:54
sshnaidmysandeep|ruck, I think dpawlik had this stats somewhere in kibana also10:55
sshnaidminteresting to have logs from one of them..10:55
ysandeep|rucksshnaidm, ack i will request dpawlik10:56
* dpawlik reading10:57
ysandeep|ruckdpawlik, hello, basically is there any kibana report, which can tell us how many ovb stacks failed to create today?10:57
dpawlikysandeep|ruck: there was some, let me find it10:58
*** sshnaidm is now known as sshnaidm|afk10:58
dpawlikhmm, can find. Let me do new one11:00
dpawlikysandeep|ruck: you can try to check https://review.rdoproject.org/analytics/goto/0d4595903e954d30d756e467200248db dashboard11:01
dpawlikprobably I got a backup of ovb stack creation fail11:01
dpawlikysandeep|ruck: try https://review.rdoproject.org/analytics/goto/f966e62bdda69bf511421b6014497d8211:07
*** ykarel has joined #oooq11:10
ysandeep|ruckdpawlik: thanks! as per report, no stack failed today..11:15
ysandeep|rucksshnaidm|afk,  https://review.rdoproject.org/analytics/goto/f966e62bdda69bf511421b6014497d82 not much as per log stash for today11:17
dpawliklet me finish one thing and I will check if  metrics are fine11:20
dpawlikit can be a situation that comparing to old Kibana, some information are missing on restoring in new Kibana11:20
dpawlikgive me few min11:21
ysandeep|rucksure11:21
*** jlarriba has joined #oooq11:27
zbri need a review on https://review.rdoproject.org/r/c/config/+/33869 - and i am happy to explain to others the benefits of pinning test deps using pip-compile.11:28
zbrin fact we already use pip-compile in tripleo health project, but i think is time to extend it use, mainly to reduce pypi-injected surprises11:29
dpawlikysandeep|ruck: do you have some example log?11:36
zbrbhagyashris: any chance to switch you rdo config to "Set new changes to "work in progress" by default"? https://review.rdoproject.org/r/settings/#Profile11:36
*** udesale has quit IRC11:37
zbrthat is very useful on rdo in particular as otherwise rdo will add people to the CR as soon you create it, even if you do not want.11:37
*** udesale has joined #oooq11:38
*** rlandy has joined #oooq11:38
ysandeep|ruckdpawlik: we have a logging of number of heat stack in different after every 15 mins: http://tripleo-cockpit.usersys.redhat.com/d/Z3PbeiuWz/vexxhost?viewPanel=8&orgId=1&from=now-24h&to=now11:41
bhagyashriszbr, not sure let me check11:42
ysandeep|ruckdpawlik: and we run a cleanup script after every 4 hours, that contains names of stack that i.e failed to create/delete http://paste.openstack.org/show/805791/11:42
ysandeep|rucks/different/ different states11:43
dpawlikysandeep|ruck: I was filtering only by message: "RUN END RESULT_TIMED_OUT" or " "\"attempts\": 18"11:44
ysandeep|ruckdpawlik, I guess we need to tune the search query which can capture all the reasons an ovb heat stack creation can fail with.11:49
dpawlikysandeep|ruck: yup + set uniq build_uuid to get correct results11:51
dpawliklet me fix gerrit user creation issue, than I will check the visualization11:51
ysandeep|ruckdpawlik: sure thanks for help!11:52
rlandypojadhav: do you want to meet?12:02
rlandyotherwise we can cancel12:02
pojadhavwe can12:02
rlandymarios: hey - is there work around updates/upgrades or is that mostly on hold?12:10
weshay|ruckysandeep|ruck, UGH.. https://review.opendev.org/c/openstack/tripleo-quickstart/+/79314512:10
weshay|ruckthis is going to be slow patch to merge12:10
rlandymarios: chatting with pojadhav about taking on a task12:10
ysandeep|ruckweshay|ruck: yeah i have been doing recheck dance \o/ since morning to merge these patches.12:11
mariosrlandy: o/12:12
mariosrlandy: what do you mean work around updates/upgrades?12:12
mariosrlandy: for our sprint you mean?12:12
mariosrlandy: i have started working on https://projects.engineering.redhat.com/browse/TRIPLEOCI-496 & https://review.opendev.org/q/topic:wallaby-upgrade-jobs12:13
rlandymarios: ^^ yep that task12:13
mariosrlandy: mainly looking at re-adding master undercloud upgrade voting and also wallaby undercloud/minor update12:13
rlandymarios: si there work for another person there?12:14
mariosrlandy: pojadhav: feel free to pick up any of the other tasks?12:14
mariosrlandy: there are a whole lot of tasks12:14
weshay|ruckysandeep|ruck, arxcruz|rover let's sync in 15min12:14
mariosrlandy: pojadhav: e.g. 38912:14
ysandeep|ruckweshay|ruck, ack12:14
weshay|ruckysandeep|ruck, arxcruz|rover want to show you something cool12:14
rlandymarios: can you join https://meet.google.com/hfu-icdo-kew?authuser=0 for a few?12:14
mariosrlandy: pojadhav: or 39012:14
mariosrlandy: sure12:15
weshay|ruckarxcruz|rover, you avail?12:18
arxcruz|roverweshay|ruck: yes, sorry12:18
weshay|ruckcool.. sync in 12 min.. you'll like this12:18
arxcruz|roverweshay|ruck: ysandeep|ruck let's sync whenever you guys want12:18
ysandeep|ruckarxcruz|rover, in 12 mins, weshay|ruck sent an invite12:19
ykarelysandeep|ruck, is the cause for overcloud deploy already known https://bugs.launchpad.net/tripleo/+bug/1929745 ?12:25
openstackLaunchpad bug 1929745 in tripleo "Unchecked MSR access error - overcloud deploy "timed out waiting for ping module test" [Critical,Triaged]12:25
ykarelnoticed in a test patch and reached to the bug12:26
*** sshnaidm|afk is now known as sshnaidm12:28
ysandeep|ruckykarel: nope, so far we noticed failing node is using a different BIOS version probably compute node in vexx cloud have different version of libvirt/kvm , sshnaidm|afk is running a testproject with autohold to debug that.12:29
ysandeep|ruckykarel, more discussion happened on #tripleo around this http://eavesdrop.openstack.org/irclogs/%23tripleo/%23tripleo.2021-05-27.log.html#t2021-05-27T09:54:2012:29
ykarelok /me check logs12:29
ykarelhave we already checked in #rhos-ops?12:30
ykareli seen discussion related to upgrade, may be it was for this env only12:30
ysandeep|ruckykarel, not yet12:30
ysandeep|ruckarxcruz|rover, meet.google.com/dfz-xszy-pdo12:31
ykarelack12:31
rlandyysandeep|ruck: arxcruz|rover: weshay|ruck: ok to merge https://review.rdoproject.org/r/c/rdo-jobs/+/33861 Re-enable image builds for OVB check jobs?12:34
ysandeep|ruckrlandy, +112:35
weshay|ruckysandeep|ruck, arxcruz|rover https://opendev.org/openstack/ansible-role-collect-logs/src/branch/master/roles/collect_logs/defaults/main.yml#L36812:42
weshay|ruckhttp://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_040/793154/3/gate/tripleo-ci-centos-8-content-provider/0406160/logs/undercloud/var/log/extra/logstash.txt12:43
dpawlikysandeep|ruck: checking logs, would add some new filters12:48
mariosscrum cancelled today?13:01
mariosbhagyashris: ?13:01
*** jbadiapa|away is now known as jbadiapa13:02
*** jfrancoa has quit IRC13:03
*** jfrancoa has joined #oooq13:04
dviroeli'll need to restart here, screen is not responding13:05
*** pojadhav is now known as pojadhav|brb13:05
*** jpodivin has quit IRC13:07
*** jpodivin has joined #oooq13:08
*** jfrancoa has quit IRC13:09
*** jfrancoa has joined #oooq13:11
weshay|ruckarxcruz|rover++13:12
weshay|ruckysandeep|ruck++13:12
*** ratailor__ has quit IRC13:15
weshay|ruckysandeep|ruck, keep an eye on the tripleo component for 16.214:04
weshay|ruckwe have some folks getting itchy for promotions14:04
* weshay|ruck looking14:04
ysandeep|ruckweshay|ruck, it was blocked on something, let me recall14:05
ysandeep|ruckweshay|ruck, yeah infrared job was failing with bug: https://bugzilla.redhat.com/show_bug.cgi?id=196261714:06
openstackbugzilla.redhat.com bug 1962617 in distribution "16.2 Couldn't install tripleo-ansible-0.7.1-2.20210513023001.3c614f9.el8ost.noarch: nothing provides ansible-collection-ansible-netcommon /-ansible-posix/ -community-general / -containers-podman." [Urgent,Closed: notabug] - Assigned to rhos-maint14:06
ysandeep|rucki think sagi reverted those patches.. ^^ i think we might be in better position.14:07
pojadhav|brbweshay|ruck, ysandeep|ruck : https://code.engineering.redhat.com/gerrit/c/openstack/rrcockpit/+/24338814:07
pojadhav|brbweshay|ruck, rlandy : pls approve https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3386814:08
*** pojadhav|brb is now known as pojadhav14:08
weshay|ruckrlandy, https://review.rdoproject.org/r/c/config/+/3386914:11
weshay|ruckpojadhav, thanks.. please start including a link to a screenshot14:30
weshay|ruckpojadhav, https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/33868 comment14:34
weshay|ruckysandeep|ruck, dang it.. https://review.opendev.org/c/openstack/tripleo-quickstart/+/793145 is not going into the gate.. due to https://review.opendev.org/c/openstack/tripleo-quickstart/+/793098/14:41
weshay|ruckthose did not need to be rebased on top of each other14:42
ysandeep|ruckweshay|ruck, yes they need i.e rebase or depends-On , https://review.opendev.org/c/openstack/tripleo-quickstart/+/793145 - containers-multinode branched jobs will fail otherwise.14:49
*** jmasud has joined #oooq14:53
weshay|ruckoh.. because of openvswitch14:56
weshay|ruckk14:56
weshay|ruckfair enough.. I'll watch the patches through14:56
zbrFYI: https://github.com/containers/podman/issues/10469 -- occasion to provide some feedback if you want to improve podman UX14:58
zbrweshay|ruck: does https://review.rdoproject.org/r/c/config/+/33869/6/README.md explain it?14:59
weshay|ruckzbr, thanks.. I think we can merge that15:04
* weshay|ruck adjusting irc/znc15:05
weshay|ruckmay bounce around15:05
*** weshay|ruck has quit IRC15:05
zbri will try to get few more eyes on it especially as this would be used as a model for other repos where we have the same need, i have the molecule ones in mind in particular.15:07
rlandyysandeep|ruck: are we still blocked on content providers?16:01
rlandyhttps://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/77257116:02
ysandeep|ruckrlandy, yes16:02
rlandytrying to get that review through16:02
rlandyysandeep|ruck: pls can you post the review to fix so I can depends-on or rebase16:02
*** marios has quit IRC16:02
ysandeep|ruckrlandy, sure sec16:02
rlandythank you16:03
ysandeep|ruckrlandy, https://review.opendev.org/c/openstack/tripleo-quickstart/+/793098/16:04
*** ykarel has quit IRC16:09
*** jmasud has quit IRC16:10
rlandycool16:13
*** jpodivin has quit IRC16:19
*** jmasud has joined #oooq16:20
*** udesale has quit IRC16:20
*** jpena is now known as jpena|off16:24
*** jmasud has quit IRC16:31
*** jmasud has joined #oooq16:32
*** amoralej is now known as amoralej|off16:32
*** ysandeep|ruck is now known as ysandeep|away16:36
ysandeep|awayarxcruz|rover, see you on monday o/16:37
*** jmasud has quit IRC16:57
*** Tengu has quit IRC17:00
*** jmasud has joined #oooq17:21
*** jmasud has quit IRC17:29
*** jmasud has joined #oooq18:09
*** jmasud_ has joined #oooq18:13
*** jmasud has quit IRC18:16
*** jmasud_ has quit IRC18:29
*** jmasud has joined #oooq18:52
*** jmasud has quit IRC18:58
*** jmasud has joined #oooq19:01
*** jfrancoa has quit IRC19:03
*** slaweq has quit IRC20:32
*** jbadiapa has quit IRC20:45
*** dsneddon has quit IRC20:58
*** jmasud has quit IRC21:04
*** jmasud has joined #oooq21:04
*** jmasud has quit IRC21:06
*** jmasud has joined #oooq21:29
*** jmasud has quit IRC21:35
*** slaweq has joined #oooq21:48
*** jmasud has joined #oooq21:52
*** jmasud has quit IRC22:09
*** slaweq has quit IRC22:24
*** dsneddon has joined #oooq22:49
*** tosky has quit IRC23:12

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!