Monday, 2019-12-30

*** ysandeep has joined #oooq01:08
*** ykarel has joined #oooq03:49
*** epoojad1 has joined #oooq04:39
*** udesale has joined #oooq04:46
*** ykarel has quit IRC05:13
*** epoojad1 has quit IRC05:15
*** ykarel has joined #oooq05:28
*** ratailor has joined #oooq05:39
*** epoojad1 has joined #oooq05:42
*** bhagyashris has joined #oooq05:46
*** bhagyashris_ has joined #oooq05:49
*** bhagyashris has quit IRC05:52
*** surpatil has joined #oooq06:18
*** marios has joined #oooq06:23
*** bhagyashris_ is now known as bhagyashris06:59
*** saneax has joined #oooq07:02
*** SurajPatil has joined #oooq07:12
*** marios has quit IRC07:13
*** surpatil has quit IRC07:15
*** marios has joined #oooq07:16
*** surpatil has joined #oooq07:33
*** SurajPatil has quit IRC07:36
*** SurajPatil has joined #oooq07:42
*** surpatil has quit IRC07:44
*** ykarel is now known as ykarel|lunch07:48
*** surpatil has joined #oooq07:50
*** SurajPatil has quit IRC07:52
*** saneax has quit IRC08:22
*** ratailor_ has joined #oooq08:45
*** ratailor has quit IRC08:48
*** ykarel|lunch is now known as ykarel08:55
*** SurajPatil has joined #oooq09:09
*** surpatil has quit IRC09:11
*** surpatil has joined #oooq09:28
*** SurajPatil has quit IRC09:31
*** ysandeep has quit IRC09:32
zbr|ruckmorning09:44
marioso/09:44
marioszbr|ruck: hi i filed that fyi (didn't know you were ruck) https://bugs.launchpad.net/tripleo/+bug/1857884 & bump timeout at https://review.opendev.org/#/c/700764/09:44
openstackLaunchpad bug 1857884 in tripleo "centos-7-master-containers-build-push time out pushing containers" [Critical,Triaged]09:44
marioszbr|ruck: blocks the master promotion looks like it has timed out for over a week09:44
zbr|ruckmarios: my guess is that removing timeout will decrease its value!09:46
zbr|ruckit was there to increase it, not to decrease it.09:46
*** saneax has joined #oooq09:46
marioszbr|ruck: no it defaults to 3 hours from base09:46
marioszbr|ruck: but please sanity check if you think that is wrong09:48
zbr|ruckmarios: why would we want to put 2h in that case?09:48
zbr|ruckto fail faster? the reality is that if we don't build containers in 2h, we will not be able to do other stuff. and the total job duration on zuul is enforced.09:48
marioszbr|ruck: i can't remember why but i guess at the time it was taken from the same value used downstream https://github.com/openstack/tripleo-ci/commit/4ffc30df03aca056f798d47f4bdfde43f331301a09:48
marioszbr|ruck: but it isn't failing to build09:48
marioszbr|ruck: it is timing out during the push (see bug)09:48
zbr|ruckwell, no push means failure to deliver09:49
zbr|ruckmarios: now the funny part, I can only add +1 to it.09:49
zbr|rucki have no extra permissions.09:50
marioszbr|ruck: ok so since master is REALLY red now i suggest we increase that timeout even if we end up reverting it because we find some better way09:50
marioszbr|ruck: ack thanks for checking the review09:50
zbr|ruckmarios: sure, I am ok to merge it, but I cannot do it.09:50
marioszbr|ruck: ack np we can wait a bit more09:50
zbr|ruckmarios: at some point I lost hope of getting core on tripleo.09:51
*** sanjayu_ has joined #oooq09:52
marioszbr|ruck: i added it to https://etherpad.openstack.org/p/ruckroversprint19 (are you using a different etherpad?)09:53
marioszbr|ruck: do you have rover or are you lone ranger?09:53
ykarelmarios, /me commented on the review, as it looks an actual issue to me09:54
ykareland increasing timeout that much doesn't looks good09:55
ykarelas other working/pushing too better09:55
*** saneax has quit IRC09:55
mariosykarel: ack thanks... indeed train/stein are fine at 2 hours... and it looks like it started ~ 23rd... but still even if temporary we should bump it so we can promote master09:58
ykarelmarios, /me added what i suspect and testing it to confirm10:01
ykareland just for promotion we can use testproject10:01
ykarelalso there was a good hash from 26th which passed all jobs accept fs02010:01
ykarels/accept/except10:01
mariosykarel: ack but not that simple we need the whole pipeline (i have one there if needed https://review.rdoproject.org/r/#/c/24299/ )10:02
ykarelso fs020 is also a promotin  blocker10:02
ykarelyes agree with that ^^, my point was mainly that there is a way to promote10:03
ykarelso if you can debug fs020 issue it will be good, meanwhile i checking container build10:03
marioszbr|ruck: ^^ fyi fs20 master10:03
mariosykarel: i didn't see fs20 since if container build fails nothing else runs10:04
mariosykarel: must have failed on earlier run?10:04
ykarelmarios, u can check build history10:04
ykarelyes it's timing out from last couple of days10:04
mariosykarel: ack thanks10:04
zbr|ruckmarios: i am alone10:04
marioszbr|ruck: k will try help you a bit... looking at stein now (train looks good promoted today already)10:05
zbr|rucki would be cool to have a soft-timeout option, failing the job when is reached but not killing it. this would allow us to see if a small bump is needed, or we are beyond help.10:07
*** ykarel is now known as ykarel|afk10:19
*** ykarel|afk has quit IRC10:25
*** bogdando has joined #oooq10:42
*** bogdando has quit IRC10:42
*** ykarel|afk has joined #oooq11:09
*** ykarel|afk is now known as ykarel11:10
ykarelmarios, my test resulted good https://review.rdoproject.org/r/#/c/24321/11:10
ykarelso next master run in 1 hour should not hit timeout in container build11:11
mariosykarel: ack thanks but what did you change?11:11
mariosykarel: i don't see depends-on at that test11:12
ykarelmarios, as said earlier issue is in infra side11:12
ykarelthat happened due to ppc jobs tags and component promotion pipeline in master11:12
mariosykarel: but its pretty consistent. it has been timing out for almost a week now11:12
marioshttps://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-centos-7-master-containers-build-push11:13
ykarelmarios, so there were too many tags, i cleaned up tags older than 4 days for master11:13
ykarelmarios, if u notice timings ^^, u will find timings are increasing11:13
ykarelfrom day by day11:13
mariosykarel: ah ok, was there a change we can point to (for tags cleanup) or this is manual thing11:13
ykarelmarios, it's happens automatically daily, but due to too much tags push due to ppc and component promotion, this cleanup went insufficient11:14
ykarelmarios, /me will post the findings on bug itself in some time11:15
ykarelafter the next periodic run11:15
mariosykarel: ack ok11:15
ykarelthere are multiple things to taken care though, like dlrn api for component pipeline is different11:15
ykarelwill post everything that i found on bug, and then we will see how to take care11:16
ykarelboth temporary and permanent measure11:16
*** epoojad1 is now known as epoojad1|afk11:35
*** epoojad1|afk has quit IRC11:40
*** bhagyashris has quit IRC12:35
*** ratailor_ has quit IRC12:38
*** udesale has quit IRC12:44
*** sanjayu_ has quit IRC13:17
ykarelkopecmartin, can u check https://bugs.launchpad.net/tripleo/+bug/185736513:19
openstackLaunchpad bug 1857365 in tripleo "Queens, Rocky fs020 tempest test are failing ServersOnMultiNodesTest and TestSecurityGroupsBasicOps" [Critical,Triaged]13:19
ykarelpossibly related to tempestconf bump in queens/rocky13:19
ykarelspecificly relate to commit that added min_compute_nodes param13:19
*** rfolco has joined #oooq14:02
*** saneax has joined #oooq14:12
*** surpatil has quit IRC14:18
*** epoojad1 has joined #oooq14:33
ykarelmarios, added comment re. container build issue https://bugs.launchpad.net/tripleo/+bug/1857884/comments/214:59
openstackLaunchpad bug 1857884 in tripleo "centos-7-master-containers-build-push time out pushing containers" [Critical,Triaged]14:59
ykarellet me know in case something is not clear14:59
mariosykarel: thanks checking zbr|ruck fyi that master build tags issue see ^^15:02
zbr|ruckdogfooding docker registry should come with some warnings about chocking with some bones15:04
*** ykarel is now known as ykarel|away15:21
*** epoojad1 has quit IRC15:35
weshaymarios++ ykarel++15:49
marioso/ weshay15:54
weshayhey brotha15:55
mariosweshay: np noticed this morning that master was dying for a week now and that push job was timing out for days15:55
weshayaye.. looks like a registry cleanup fixed it?15:55
mariosweshay: initially prposed a timeout bump but ykarel|away helped with the reg cleanup15:55
mariosweshay: well he proved it in test project but we didn't see a periodic yet15:55
weshayalso have other rdo issues.. jobs hitting retry_limit15:56
mariosweshay: not sure if there is something else wrong there though...15:56
mariosright15:56
mariosweshay: i mean i thought it would have run already15:56
mariosbut15:56
marioshttps://review.rdoproject.org/zuul/builds?pipeline=openstack-periodic-master15:56
mariosonly once today :/15:56
mariosso something else is up15:56
weshaynot much we can do if infra isn't working quite right atm15:56
weshayI'll write something up later today.. I have a quick doctors app this morning.. will be back in about an hour15:57
weshaymarios, thanks for the tip re: the visa15:57
mariosweshay: ack i am going to go in a bit ... back on friday... maybe on thursday not sure yet :D depends how crazy the kids make me15:58
weshayaye :)15:58
weshaythanks man15:58
mariosweshay: np doing my job :) but I'll take the thanks15:58
weshaytrain is promoting often.. which is awesome.. to be expected that master blows up15:58
mariosweshay: yes, i started typing that and stopped to not jinx it15:59
mariosthanks weshay !15:59
marios;)15:59
weshayheh16:02
rfolcozbr|ruck, quick question... assert: that: is not working, do you know what is wrong ? I tried without {{ }} and with it. http://logs.rdoproject.org/99/24199/5/check/periodic-tripleo-centos-7-master-component-compute-promote-to-current-tripleo/d2ef778/job-output.txt16:13
zbr|ruckrfolco: i did not use assert myself, i used fail with when.16:13
rfolcozbr|ruck, good idea16:14
zbr|rucki can only test in an isolated playbook to see how it works.16:14
rfolcozbr|ruck, my local test works though, with and without {{ }}16:14
zbr|ruckversion of ansible different?16:14
rfolcomaybe16:14
zbr|ruckrfolco: documentation is clear16:15
zbr|rucktry to use only a list16:15
zbr|ruckmaybe is a bug which evaluates from a non list16:15
zbr|rucki personally do not fancy seeing conditions inside strings, so i would rather prefer writing a list with one element16:16
rfolcozbr|ruck, problem is that I only know if works if I merge the change, coz local works16:16
rfolcozbr|ruck, any issues with that: - reported_jobs.stdout | from_json | json_query(query) != []16:17
zbr|ruckwithout testing I would not be able to say, add a debug before16:17
rfolco(it wasn't in a list btw)16:17
*** ykarel|away has quit IRC16:18
zbr|ruckrfolco: that condition is prone to false-positives16:19
rfolcozbr|ruck, how?16:20
zbr|ruckif jinja evaluates as "null" condition would be true. i suspect assert would evaluate to true too often?16:20
*** saneax has quit IRC16:20
*** saneax has joined #oooq16:20
zbr|ruckbut the best way to find out is to add a debug line before that and log the full evaluation, so we know what happens.16:20
rfolconull is covered in a previous fail when check16:20
zbr|ruckrfolco: add a debug line and lets merge it16:21
rfolcook16:21
*** ykarel|away has joined #oooq16:28
marioshappy new year folks see you later this week16:39
marioso/ rfolco missed you today :D16:39
rfolcoo/16:40
rfolco?16:40
mariosrfolco: just mean i thought you were out and it was just me and zbr|ruck16:40
rfolcomarios, atypical day16:41
mariosrfolco: yeah ... grafana has 97% pass rate for upstream jobs... so yeah ;)16:41
mariosvery quiet16:41
mariosanyway have a good rest of week if you're working16:42
marioshappy new year either way !16:42
marios\o/16:42
* marios out16:42
rfolcohappy new year marios16:42
rfolcozbr|ruck, https://review.rdoproject.org/r/2432516:50
rfolcoplease ?16:50
*** marios has quit IRC16:51
zbr|ruckcommented16:54
zbr|ruckprinting only stdout may prove not-enough16:54
zbr|ruckin fact stdout is likely already saved in zuul json file.16:54
rfolcohow about now17:12
rfolcohttps://review.rdoproject.org/r/#/c/24325/17:12
rfolcozbr|ruck, ^17:12
zbr|ruckrfolco: ouch, that looks really ugly. i would probably want to dump criteria value to a file and test outside.17:14
rfolcozbr|ruck, dump to a file?17:16
rfolcothis? json_query(query)17:16
zbr|ruckrfolco: can we test this before merging? i cannot do a proper review just by looking at that code.17:17
rfolcozbr|ruck, I can paste the local test if you like17:17
zbr|ruckAFAIK you do not need credentials to read from DLRN, which means that this playbook could be encapsulated in a test, right?17:19
rfolcozbr|ruck, my local test emulates the dlrnapi call w/ a sample json output... 3 test cases there: empty, [], and valid json reponse.17:26
rfolcozbr|ruck, see http://paste.openstack.org/show/787971/17:26
*** ykarel|away has quit IRC17:54
*** sanjayu_ has joined #oooq18:10
*** saneax has quit IRC18:13
*** rfolco has quit IRC19:54
*** d0ugal has joined #oooq20:32
*** sanjayu_ has quit IRC20:32
*** tosky has joined #oooq20:33
*** d0ugal has quit IRC21:49
*** tosky has quit IRC23:17

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!