Monday, 2020-02-03

*** holser has quit IRC00:56
*** holser has joined #oooq01:06
*** jmasud has joined #oooq01:14
*** holser has quit IRC01:29
*** jmasud has quit IRC01:45
*** jmasud has joined #oooq01:46
*** jmasud has quit IRC01:58
*** jmasud has joined #oooq02:00
*** jmasud has quit IRC02:21
*** jmasud has joined #oooq02:22
*** jmasud has quit IRC03:06
*** ykarel|away is now known as ykarel04:07
*** raukadah is now known as chkumar|rover04:55
*** jmasud has joined #oooq05:18
*** soniya29 has joined #oooq05:45
*** skramaja has joined #oooq05:49
*** udesale has joined #oooq05:49
* chkumar|rover headed to wework06:04
*** sanjayu_ has joined #oooq06:12
*** marios has joined #oooq06:14
*** ratailor has joined #oooq06:17
*** sanjayu__ has joined #oooq06:22
*** sanjayu_ has quit IRC06:25
*** jfrancoa has joined #oooq06:52
*** sanjayu__ has quit IRC06:56
*** saneax has joined #oooq06:58
*** dtantsur|afk is now known as dtantsur07:20
*** jbadiapa has joined #oooq07:39
chkumar|roverzbr, morning, Do we still have missing patches for fix ovb logs, https://logserver.rdoproject.org/openstack-periodic-master/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-7-ovb-1ctlr_2comp-featureset020-master/7a86edf/logs/ logs are not getting collected07:40
zbrnope, but i hope sagi will look into it. i had the -vvvv removal which was blocked.07:44
*** ykarel is now known as ykarel|lunch07:51
*** kopecmartin has joined #oooq08:00
*** jmasud has quit IRC08:13
*** jmasud has joined #oooq08:14
*** tesseract has joined #oooq08:14
*** apetrich has joined #oooq08:16
zbri think someone pointed about isort last week, shortly isort is good but we need to wait ~month until they finish addressing conflicts with black.08:25
*** tosky has joined #oooq08:29
arxcruzchkumar|rover: wondering why fs021 is running on tempest project...08:58
chkumar|roverarxcruz, need to do a git blame there08:58
arxcruzmarios: so, I was able to run full tempest on fs020 with 4:30 min increasing the cpu and the concurrency to 408:59
mariosarxcruz: ack i recall you saying that on friday09:00
mariosarxcruz: maybe add note on https://tree.taiga.io/project/tripleo-ci-board/task/1383 and/or comment on the reviews? we can discuss this afternoon on calls?09:00
*** yolanda has joined #oooq09:03
*** d0ugal has quit IRC09:29
*** ykarel|lunch is now known as ykarel09:29
*** sshnaidm|off is now known as sshnaidm09:30
*** d0ugal has joined #oooq09:31
sshnaidmzbr, the logs problem is not related to -vvvv, it started with last gzipping, I hope you'll check it. Actually w/o debug we won't be able to find the problem09:31
zbrsshnaidm: i am not going to check it, i already pointed to the dangereus piece of code, which I consider a "time-bomb" it has zero projection against becoming a DDOS.09:35
sshnaidmzbr, what are you talking about? I'm talking about logs are not collected09:35
zbrif anyone wants, they can put a gz to that -vvvv line.09:35
zbri am ready to bet that ansible dies writing to that debug log file.09:36
sshnaidmzbr, debug file worked last 2 years, instead of blaming long working code I think it's worth to check last changes09:37
zbrsshnaidm: sorry but i will let you and wes deal with that aspect, i am with alex side on this.09:38
sshnaidmzbr, we are talking about logs are not collected09:38
zbri not saying that is the only issue, but I consider this one a blocker.09:38
*** derekh has joined #oooq09:39
sshnaidmzbr, there are no sides here, there are logs not collected09:39
zbrte code that fails to run to finish is the one with -vvvv09:39
sshnaidmzbr, I'm not sure you're serious now..09:39
sshnaidmzbr, here's the code that ran without -vvvv, where are the logs? https://logserver.rdoproject.org/59/705059/2/openstack-check/tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset001/53209e9/logs/09:41
sshnaidmzbr, maybe you can leave this debug alone and we can start working on a real problem?09:41
sshnaidmchkumar|rover, do you know if we had changed podman recently?09:42
*** bogdando has joined #oooq09:43
*** Tengu has quit IRC09:46
*** Tengu has joined #oooq09:48
*** Tengu has quit IRC09:54
*** Tengu has joined #oooq09:55
*** jaosorior has joined #oooq10:01
zbrchkumar|rover: should we install missing tools during collection? like lsof, lvm2, netstat, lspci, pstree?10:02
*** Tengu has quit IRC10:03
*** skramaja has quit IRC10:03
*** Tengu has joined #oooq10:03
zbrcurrently these are not installed and the collecting commands are likely failing. we have two options: a) install them, or b) make collection command run only when the tools are installed.10:04
zbri would personally install lsof all the time, but lvm2 does not make much sense for containers for example.10:05
zbrso maybe adding only lsof and netstat and pstree, which makes sense in all use-cases, and the other two "hw" specific, to run only when already installed.10:05
*** holser has joined #oooq10:07
*** Tengu has quit IRC10:08
sshnaidmzbr, you have "ss" instead of netstat10:09
sshnaidmlvm2 doesn't make sense in upstream, but might be useful in infrared case I think10:09
zbrsshnaidm: my question was re https://review.opendev.org/#/c/705175/10:10
zbrmaybe we should not add any new rpm, yet an delay that for other changes.10:10
sshnaidmwe can see in logs that collections is stopped on containers part, I think it may be copying of files from podman containers stuck forever10:10
zbrfirst we need to move the package list somewhere where is configurable.10:10
zbrouch.. doing cp with podman was a known issue.10:11
sshnaidmzbr, sorry, I'm focusing now on getting logs back, I don't care in which form10:11
zbrsure. makes sense.10:11
zbrwe would have a big laugh if we discover that this was a bug introduded by newer podman.10:11
zbri think they upgraded it recently.10:11
zbrmaybe we should try to switch molecule to use podman when testing this job, we may have a revelation.10:12
*** Tengu has joined #oooq10:13
*** Tengu_ has joined #oooq10:14
*** Tengu_ has quit IRC10:15
chkumar|roversshnaidm, nope10:30
chkumar|roversshnaidm, Any issue you are seeing?10:30
sshnaidmchkumar|rover, I don't know yet, maybe issues with copying files from containers10:31
sshnaidmzbr, I don't understand your statement here https://review.opendev.org/#/c/705347/3/tasks/collect/container.yml10:39
sshnaidmzbr, what is the issue you see?10:39
zbrthre is no need to load container enginer into an ansible fact.10:40
zbrmainly you explode one shell taks into 3 tasks w/o good reasons.10:40
zbrputting that one big shell script in-line was an old mistake, how abut moving that script like a standalone file?10:41
sshnaidmzbr, I think it's reasonable improvement10:41
zbrit would be easier for us to edit it, lint it, maintain it.10:42
zbrit would be one standalone piece of bash10:42
zbrthe best part is that it doesn't even have to be a template, it can be pure bash.10:42
zbrinline shell in ansible is good until you reach ~15 lines, after it can become a liability.10:43
zbrin fact you could have sorted the original bug with a very simple hack10:44
sshnaidmzbr, what is the issue you see in this patch?10:44
zbrengine=`command -v podman docker|head -n1`10:44
zbri doubt we have any single production place with both engines used.10:44
sshnaidmzbr, no, that won't work10:45
zbrwhy?10:45
sshnaidmzbr, that's the problem, we have always both engines10:45
sshnaidmzbr, did you look at output of this task in jobs?10:46
zbri know we have both on molecule jobs, but i was not aware about doing this trick on other places.10:46
zbrdo you want me to try to do make that shell a script and combine your change?10:47
zbrhappy to help, and keep the detection logic proposed10:47
*** jbadiapa has quit IRC10:49
*** holser has quit IRC11:00
sshnaidmzbr, currently I'd like to understand on which stage the collection is stuck11:01
sshnaidmzbr, although I still don't understand what is the issue you see in this patch11:02
sshnaidmzbr, if you think it deserves -1, please write exact well based reasons behind it11:03
zbrdoes it always happen or is random? i would personally create a change to add `| gzip` to that -vvv line and see what happens. at least we rule-out the outofdisk/time case. big log also means that rsync could fail trying to move it.11:04
sshnaidmzbr, https://docs.openstack.org/project-team-guide/review-the-openstack-way.html#code-review-minus-111:05
zbryep, this reminds that i wanted to ask something on infra.11:06
*** whoami-rajat is now known as whoami-rajat|lun11:07
*** whoami-rajat|lun is now known as whoami-rajat11:07
zbris a common assumption that any change should be covered by a test that prevents regression, but that is not mentioned in the gidelines.11:08
zbrif not needed, i will remove my -1, and just leave a comment.11:08
zbrbut experience told me that comments are ignored.11:08
sshnaidmzbr, also wrt this patch https://review.opendev.org/#/c/705335/11:10
sshnaidmzbr, you shouldn't set -1 just to leave your comment, please read a code review guide I posted above11:11
sshnaidmzbr, you can actually set +1 with a comment, not +211:11
ykarelsshnaidm, me noticed in some of my patches where logs were not collected was container collect info stuck on compute node, i checked only few logs so not sure though11:11
sshnaidmzbr, we don't have yet unittests for sova code, only functional11:11
sshnaidmykarel, yeah, it's exactly what I saw too11:12
ykarelsshnaidm, ack then it's related :)11:12
sshnaidmykarel, that I wondered if it's new podman of kind of11:12
zbradd some, writing the first is less than 10 lines!11:12
sshnaidmzbr, not in this patch, definitely11:12
sshnaidmzbr, you can work on it, you're also part of Ci11:12
ykarelsshnaidm,new podman causing timeout?11:12
ykarelatleast it's not updated in last few days11:13
sshnaidmykarel, well, you say it's not a new11:13
ykareli think month11:13
ykarelyes it's not updated for many days11:13
sshnaidmykarel, yeah, and this started from something like a week ago, with all these gzips11:13
zbri bet that the same was said about all the previous patches, not in this one. i  offer to write one to break that habbit.11:13
ykarelsshnaidm, last update for podman was in september11:14
sshnaidmzbr, because patches should not mix things11:15
ykarelyes possible it's related to gzip patches11:15
sshnaidmykarel, I see.. I wonder if it's related to gzipping11:15
ykarelme not aware of any logs missing11:15
ykarelbefore that11:15
sshnaidmykarel, maybe worth to do containers task as async and add a timeout11:16
sshnaidmgonna try it11:16
ykarelsshnaidm, for debugging i think could be done11:16
ykarelbut not permanent if logs are missing11:16
mariossshnaidm: arxcruz: zbr: chkumar|rover: panda: * reviews please when you next have time thank you "Refresh start_named_hashes after promotion to prevent false positive" https://review.rdoproject.org/r/#/c/24665/11:19
arxcruzmarios: i'm not familiar with the code, so i can only review from the point of view of the logic11:20
mariosarxcruz: cool thanks sure just review what you can, review the python11:20
marioschkumar|rover: thanks for checking added pointer to where the tasks are executed by molecule https://review.rdoproject.org/r/#/c/24771/11:24
sshnaidmzbr, I think you're much better in all related linters and tests, so why wouldn't you contribute from your experience and write some tests for these files in followups?11:28
ykarelsshnaidm, other thing i noticed is it's affecting only master11:37
ykarelhttps://review.rdoproject.org/zuul/builds?job_name=tripleo-ci-centos-7-ovb-3ctlr_1comp-featureset035&branch=master11:38
ykarelsimilar can be seen for other jobs11:38
ykareland it started since 30th Jan11:41
ykarelinfact 29th11:42
mariosack sshnaidm checking11:45
marios(was meant for tripleo ^ sshnaidm )11:45
sshnaidmykarel, 035 in check is running on vexxhost11:47
chkumar|roversshnaidm, https://review.opendev.org/#/c/705007/2 is good to go11:47
chkumar|rovermarios, https://review.opendev.org/#/c/704805/ needs +w on this, thanks :-)11:48
ykarelsshnaidm, so it's affecting both rdo and vexx, so cloud thing should be related i think11:48
sshnaidmykarel, yeah, shouldn't11:48
sshnaidmykarel, how do you see the problem from builds table?11:48
ykarelsshnaidm, from job timing11:49
ykarel13k+11:49
sshnaidmoh, right11:49
ykarelthat's not strict but could be used11:49
marioschkumar|rover: ack11:51
sshnaidmykarel, are you sure it's only master? Because if so, it shouldn't be related to collect-logs..11:53
ykarelsshnaidm, from what i saw it's master, but please cross check11:54
*** holser has joined #oooq11:54
sshnaidmchkumar|rover, do you know why we don't have train job in 3 party? https://review.opendev.org/#/c/702844/11:55
chkumar|roversshnaidm, checking11:56
chkumar|roversshnaidm, I think it got missed adding it right now https://github.com/rdo-infra/review.rdoproject.org-config/blob/master/zuul.d/tripleo.yaml#L7211:57
sshnaidmchkumar|rover, I think it should be in  ovb-branchless template11:58
sshnaidmchkumar|rover, thanks!11:58
*** jbadiapa has joined #oooq12:03
chkumar|roversshnaidm, under ovb branchless template it is there https://github.com/rdo-infra/rdo-jobs/blob/master/zuul.d/project-templates.yaml#L62712:04
mariossshnaidm: chkumar|rover: can you pleas check https://review.rdoproject.org/r/#/c/24771/ & https://review.rdoproject.org/r/#/c/24665/12:04
sshnaidmchkumar|rover, hmm.. then why doesn't it run, maybe problem with anchor?12:08
chkumar|rovermarios, regarding this one https://review.rdoproject.org/r/#/c/24665/ and I think I am watching the correct logs http://logs.rdoproject.org/65/24665/4/check/tripleo-ci-promotion-staging/2137100/logs/promoter_logs/centos7_master.log and http://localhost:58080/api/civotes_detail.html?commit_hash=360d335e94246d7095672c5aa92b59afa380a059&distro_hash=9e5988125e88f803ba20743be7aa99079dd275f212:19
chkumar|rover? sorry for the confusion12:19
weshaychkumar|rover, :) things look a little greener this morning eh?12:21
marioschkumar|rover: yes12:21
chkumar|rovermarios, thansk, done12:22
chkumar|roverweshay, yes, with few hiccups12:22
chkumar|roverweshay, in few master, ovb jobs logs went missing12:22
marioschkumar|rover: i'll comment on the review too https://review.rdoproject.org/r/#/c/24665/ but where did you get that one //localhost:58080/api/civotes_detail.html?commit_hash=360d335e94246d7095672c5aa92b59afa380a059&distro_hash=9e5988125e88f803ba20743be7aa99079dd275f212:22
marioschkumar|rover: thanks12:23
chkumar|rovermarios, http://logs.rdoproject.org/65/24665/4/check/tripleo-ci-promotion-staging/2137100/logs/promoter_logs/centos7_master.log just go down the logs12:24
*** rfolco has joined #oooq12:24
marioschkumar|rover: i se, i thought you were pointing to some other file12:24
marioschkumar|rover: ack yes12:24
chkumar|roverweshay, https://review.opendev.org/#/c/705007/ good to go, one less bug to worry12:25
weshaychkumar|rover, go ahead an wf12:25
weshaykopecmartin, mtg12:32
weshayhttps://projects.engineering.redhat.com/browse/RHOSINFRA-295412:32
*** soniya29 has quit IRC12:41
chkumar|roverweshay, see ya directly during CIX, heading home12:51
*** udesale_ has joined #oooq12:55
*** udesale has quit IRC12:57
*** ratailor has quit IRC12:57
*** ratailor has joined #oooq12:58
*** marios is now known as marios|call13:01
zbrykarel: am I correct to assume that on https://review.opendev.org/#/c/705378/1 you are trying to address the issue of having a html file gzipped?13:01
*** rlandy has joined #oooq13:01
ykarelzbr, nope13:01
ykarelzbr, me trying to fix log footer which contains wrong links when artcl_gzip: false13:02
rfolcoweshay, joining scrum ?13:02
ykarelzbr, see https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_fa0/705378/1/check/tripleo-ci-centos-7-undercloud-containers/fa06d68/logs/README.html without that patch13:03
ykarelzbr, see https://f36f3fd4b8223f0221b9-992fbb76369d370eec805eb398c9de6e.ssl.cf2.rackcdn.com/705061/1/check/tripleo-ci-centos-7-standalone/31ec969/logs/README.html without the patch13:03
ykarelin upstream logs are not gzipped but links there point to not existing\ .gz files13:04
zbrykarel: ok, so the bug is in the generation of the readme file13:05
zbrthere are two approaches here13:06
zbra) generate the readme file after the archival runs, so it has right files13:06
zbrb) alter log server configuration to transparently return .gz files when these exist, also known as pre-archived static file serving (which was even working at some point if I remember well)13:07
zbrmainly the UI (readme file) should not need to know about that the file is kept compressed on the log server.13:08
ykarelzbr, readme file is static13:09
zbrmaybe it should not be, i do not find it hard to make it more dinamic and include it in artcl13:10
ykarelzbr, ack if you want to convert to dynamic u can get that, no issue from my side13:11
zbrwhich could also avoid noise in the file like stuff that do not exist in current execution13:11
zbrykarel: I can give it a try, i bet it would require less maintenance than one that is updated with sed.13:12
ykarelzbr, ack go for it13:12
ykarelfor me important is to fix that, if it's more better way it's fine13:12
zbrykarel: send me link on current static copy and I will create a change today13:13
marios|callhttps://code.engineering.redhat.com/gerrit/#/q/topic:17-standup+status:open13:13
zbrykarel: sure, I do see its use.13:13
ykarelzbr, https://review.opendev.org/#/c/705379/ seperate review depends on the other one13:13
ykarelsshnaidm, i see docker got updated on 28th, what do u think if it can be related?13:20
*** ratailor has quit IRC13:21
sshnaidmykarel, idk, but we used docker for logs collection before together with podman, I have a patch to separate them and not touch docker if we use podman13:21
ykarelsshnaidm, but docker is still used in jobs13:22
ykarelin jobs where pacemaker is there13:22
sshnaidmykarel, yes, and it should be used in logs collection in those jobs only13:23
sshnaidmykarel, I mean we ran both in every job in logs collection, no matter what is used13:23
weshaychkumar|rover, may be an issue w/ fs001 rhel813:23
weshaymaster13:23
ykarelsshnaidm, ack iirc we used to run containers with both docker/podman in same job, not sure about recent though13:24
chkumar|roverweshay, 500 no valid host13:24
ykarelsshnaidm, example was ceph with docker others with podman13:24
marios|callchkumar|rover: any idea molecule-container-push NODE_FAILURE in 0s https://review.rdoproject.org/r/#/c/24705/13:24
*** derekh has quit IRC13:24
marios|callrechecking13:24
ykarelbut that was long ago13:24
sshnaidmykarel, I hope we don't. Otherwise need to manage containers lists for both engines13:25
chkumar|roverykarel, does on undercloud we have both podman and docker?13:25
ykarelchkumar|rover, undercloud i think only one13:26
ykarelthe thing both one used to be there in overcloud in ceph jobs13:26
ykarelbut now i think everything ported to podman13:26
marios|callhttps://bugs.launchpad.net/tripleo/+bug/1861342  https://review.rdoproject.org/r/2477113:27
openstackLaunchpad bug 1861342 in tripleo "tripleo-ci promotion failing on "pull ppc64le tagged containers"" [Critical,Triaged] - Assigned to Marios Andreou (marios-b)13:27
chkumar|roversshnaidm, please comment on the latest update on this https://trello.com/c/0pT1zkSe/1316-cixlp1861378tripleociproa-multiple-postfailure-on-master-periodic-pipeline13:28
chkumar|roversshnaidm, since we still have logs missing on few master ovb jobs13:29
ykarelsshnaidm, me now checks running jobs where it's stuck13:30
sshnaidmykarel, we don't have it on rhel8 ovb master, btw13:30
ykarelsshnaidm, hmm i saw that13:31
ykarelsshnaidm, so it can be related to docker update imo13:31
ykarelas that's only centos13:31
sshnaidmykarel, or their combination with podman..13:31
ykarelsshnaidm, yes13:31
sshnaidmykarel, this has errors, but it set engine for podman only: https://review.opendev.org/#/c/705347/ let's see if it helps13:32
sshnaidmykarel, at least it worked once13:33
ykarelsshnaidm, i logged in to a running job13:33
ykarelsshnaidm, i see /usr/bin/docker-current stats --all --no-stream is stuck for 5 minutes in novacompute13:33
sshnaidmykarel, aha!13:33
ykarelsshnaidm, share your keys, i have to leave now, i have a meeting in next half hour13:34
ykarelsshnaidm, will add your keys so u can check more before node get destroyed13:34
sshnaidmykarel, https://github.com/sshnaidm.keys13:34
sshnaidmykarel, I think we have the same meeting :)13:34
ykarelsshnaidm, i have PCD one13:34
ykareli will not be able to join bootcamp one today13:35
sshnaidmack13:35
ykarelsshnaidm, try zuul@38.145.35.8013:35
ykarelsshnaidm, and then ssh heat-admin@192.168.24.1613:35
*** ykarel is now known as ykarel|afk13:36
ykarel|afki have installed strace there, i see it's stuck at some place13:36
ykarel|afki see FUTEX_WAIT13:37
sshnaidmykarel|afk, manually it's stuck too.. but works w/o --stream13:38
ykarel|afksshnaidm, ack /me leaving will be back after some time13:39
*** ykarel|afk is now known as ykarel|away13:39
sshnaidmykarel|away, ack, thanks for the node13:39
*** Goneri has joined #oooq13:40
rlandyhttps://review.opendev.org/70505213:42
rlandyrfolco: ^^13:44
sshnaidmrlandy, I'm fine to merge it if you can please have the followup with changes, the current way looks unsafe to me and should be changed13:45
rlandysshnaidm: I have no problem making your suggested change13:45
sshnaidmrlandy, cool, thanks13:45
*** marios|call is now known as marios13:45
rlandyI'd juts like everyone to take one last look so we can merge it13:45
rlandywe are holding steve at this point13:46
chkumar|roverweshay, sshnaidm https://review.opendev.org/#/q/topic:cellv2+(status:open+OR+status:merged)  I workflowed it, cellv2 patches13:49
chkumar|roverweshay, sshnaidm do we want to run fs062 in periodic also?13:50
sshnaidmchkumar|rover, yeah, would be great13:51
sshnaidmchkumar|rover, thanks13:51
weshaymarios, please cancel todays boot camp sync13:52
* weshay needs to chat w/ phill13:52
mariosweshay: ack doing13:53
weshaymarios, thanks13:53
mariosweshay: postpone or just cancel?13:53
arxcruzhave a doctor appointment, be back in 1 hour13:53
weshaymarios, not sure :)13:53
weshaywill let you know13:54
mariosweshay: no i mean the sync13:54
weshaymarios, updated our 1-113:54
mariosweshay: ack ok i'lll cancel it we can re-setup a call np13:54
weshayya.. we'll kick it later this week13:54
weshayif we're still on13:54
sshnaidmrlandy, commented https://review.opendev.org/#/c/705052/13:55
weshaychkumar|rover, updating https://bugs.launchpad.net/tripleo/+bug/1856016 to triaged13:57
openstackLaunchpad bug 1856016 in tripleo "Tempest basic ops test_cross_tenant_traffic tests failed on fs020 train with Timed out waiting for 10.0.0.102 to become reachable from 10.0.0.117" [Critical,Triaged]13:57
rlandysshnaidm: updated13:58
chkumar|roverrlandy, small nit pick on metalsmith patch13:58
weshaychkumar|rover, did you see lines 98 - 111 on  https://etherpad.openstack.org/p/ruckroversprint2113:59
rlandychkumar|rover: patch is updated14:00
chkumar|roverweshay, yes, need to ping cgoncalves for the same14:00
weshaychkumar|rover, looks like we should bug it14:00
weshaychkumar|rover, I've seen it twice in the gates over the last few weeks14:00
chkumar|roverweshay, file it sir :-)14:01
weshayk.. will do14:01
chkumar|roverweshay, going to generatre some ansibel related tasks for next sprint related to os_tempest14:01
chkumar|roveroctavia and ironic support in os_tempest14:01
weshayk14:01
*** holser has quit IRC14:02
chkumar|roverweshay, rlandy merge these patches in morning https://review.opendev.org/#/q/topic:unskipvolume+(status:open+OR+status:merged)14:03
chkumar|roversorry in your evening14:03
rlandyok14:03
chkumar|roverit might increase the time of fs02014:03
chkumar|roverso many tests getting unskipped14:03
*** ykarel|away is now known as ykarel14:04
chkumar|roverweshay, we need to keep an eye on these patches https://review.opendev.org/#/q/topic:mistral_to_ansible+(status:open+OR+status:merged)+status:open+label:verified%253D%252B1%252Cuser%253Dzuul might be it break periodic master jobs14:04
rlandychkumar|rover: looking for test run here14:04
weshaychkumar|rover, what is your additional concern?14:05
weshayre: greater than the normal potential for breakage?14:05
chkumar|roverweshay, nothing14:06
weshaychkumar|rover, this cleared 3rd party https://review.opendev.org/#/c/705323/14:06
rlandyhttps://review.opendev.org/#/c/70491914:06
rlandynot through get either14:06
rlandychkumar|rover: ^^14:06
weshaychkumar|rover, /me more concerned about the recent fs001 rhel 8 failures14:07
* weshay will dig in14:07
chkumar|roverweshay, you mean this one https://logserver.rdoproject.org/79/24779/1/check/periodic-tripleo-ci-rhel-8-ovb-3ctlr_1comp-featureset001-master/941e294/logs/undercloud/home/zuul/overcloud_deploy.log 500 error14:08
weshaychkumar|rover, https://bugs.launchpad.net/tripleo/+bug/186168514:11
openstackLaunchpad bug 1861685 in tripleo "scenario10 tempest random tempest failures in check / gate, cloud related" [High,Triaged]14:11
weshayput notes there14:11
chkumar|roverweshay, sure14:12
*** holser has joined #oooq14:14
*** derekh has joined #oooq14:17
*** ykarel is now known as ykarel|mtg14:19
rfolcopanda, what am I missing to run my delegated molecule test here https://review.rdoproject.org/r/#/c/24762/14:20
rfolcopanda, tox.ini has molecule_delegated env, why is it not finding my new test14:20
mariospanda: please check when you next have reviews time thanks https://review.rdoproject.org/r/#/c/24771 (and as discussed added there new task  https://tree.taiga.io/project/tripleo-ci-board/task/1510 under story 1493 consolidate tests14:25
*** marios is now known as marios|call14:32
*** TrevorV has joined #oooq14:33
chkumar|roverweshay, anything more needed on my side, I will be logging off now14:37
chkumar|roverfeel free to drop emails14:37
chkumar|roversee ya, Have a nice day and evening ahead14:37
*** chkumar|rover is now known as raukadah14:37
weshayraukadah++14:38
pandarfolco: tox.ini ignores delegated tests, you need to add another job14:44
pandarfolco: and non-delegated tests are broken right now.14:44
rfolcopanda, ok I realized that looking at the other delegated ones at zuul.d/jobs.yaml14:45
rfolcopanda, thanks14:47
*** marios|call is now known as marios14:49
*** dtantsur is now known as dtantsur|brb14:50
*** ykarel|mtg is now known as ykarel14:52
*** holser has quit IRC14:57
*** jbadiapa has quit IRC15:02
*** apetrich has quit IRC15:07
pandamarios: do you have the powwers of merge on https://review.rdoproject.org/r/#/c/24771 ? You can merge at your discretion.15:13
mariosack thanks panda. reviews please @  https://review.rdoproject.org/r/#/c/24771 when you next have time cc rfolco weshay sshnaidm ** i will merge it if it's still around without -1 tomorrow morning15:16
mariospanda: in fact, it won't do anything until we post to re-enable it weshay (revert revert https://review.rdoproject.org/r/#/c/2475015:17
ykarelsshnaidm, zbr revert temp patch https://review.rdoproject.org/r/#/c/24778/15:26
*** zbr has quit IRC15:30
arxcruzback15:31
*** zbr has joined #oooq15:33
*** zbr has quit IRC15:33
*** zbr has joined #oooq15:34
raukadahrlandy, sshnaidm, please have a look at this one https://review.opendev.org/#/c/705175/2 when free15:36
*** rfolco is now known as rfolco|eats15:43
*** ykarel is now known as ykarel|away15:44
mariossshnaidm: do you have something to point me at for the 'no logs in ovb' we discussed earlier please15:47
mariossshnaidm: like bug or review or taiga15:47
sshnaidmmarios, https://bugs.launchpad.net/tripleo/+bug/186169415:47
openstackLaunchpad bug 1861694 in tripleo "Nonstop restarting ovn_metadata_haproxy container" [Critical,Triaged]15:47
mariossshnaidm: thank you15:48
sshnaidmmarios, review: https://review.opendev.org/70544615:48
mariossshnaidm: thanks15:49
*** holser has joined #oooq15:57
*** jbadiapa has joined #oooq16:03
*** dtantsur|brb is now known as dtantsur16:05
weshayzbr, ping16:24
weshayre: logging16:24
zbro/16:24
zbri was now working to fix the readme generation, i will have a change ready for review before tomorrow16:25
zbrPOC already worked.16:25
zbrit will be kickass readme.16:25
weshayzbr, what's broken re: the readme?16:26
zbryatin tried to fix the the broken .gz links in the readme.16:26
zbrnew one will only have working links. w/ or w/o .gz based on the case.16:27
*** sshnaidm is now known as sshnaidm|afk16:27
weshayoh this link undercloud/var/log/tripleo-container-image-prepare.log.txt.gz - the container download, container update and provision log16:27
weshayand this one https://storage.gra.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_db1/705397/1/gate/tripleo-ci-centos-7-scenario000-multinode-oooq-container-updates/db10d98/logs/undercloud/var/log/extra/errors.txt.txt.gz16:28
weshayzbr, let's chat about this16:28
weshayhttps://meet.google.com/otr-pmkh-rer?authuser=116:29
weshayzbr, you avail?16:32
*** udesale_ has quit IRC16:32
zbrsure. joining.16:32
* marios home time16:44
*** marios is now known as marios|out16:50
*** rfolco|eats is now known as rfolco16:51
*** marios|out has quit IRC16:58
*** bogdando has quit IRC16:59
weshayzbr, https://opendev.org/openstack/tripleo-quickstart-extras/src/branch/master/roles/validate-tempest/vars17:02
rlandyweshay: ever hit this error when running standalone on your VM setup?  "reset failed: reset: standard error: Inappropriate ioctl for device"17:30
weshayhrm... not that I recall.. sec17:31
*** jbadiapa has quit IRC17:40
*** derekh has quit IRC18:00
weshayrlandy, which part is that failing on?18:14
weshayvirt-customize?18:14
rlandyweshay: nvm - I got by it following https://bugs.launchpad.net/tripleo/+bug/184267718:14
openstackLaunchpad bug 1842677 in tripleo "reset failed: reset: standard error: Inappropriate ioctl for device" [Low,Fix released] - Assigned to Alex Schultz (alex-schultz)18:14
rlandyI hacked that part not to reset the console18:14
rlandynot needed18:14
rlandyon other errors now18:14
raukadahweshay, regarding tempest 23.0.0 I will drop an email tomorrow so that downstream ci can keep an eye18:19
weshayraukadah, that merged for us? the bump to 23.0?18:19
raukadahweshay, nope, still in testing phase18:19
raukadahweshay, we are now good to go to merge the stuff18:20
weshayk.. and not related at all to multicell?18:20
raukadahweshay, nope18:20
weshayk k..18:21
weshaythanks18:21
weshayraukadah, go to bed18:21
raukadahweshay, multinode failure is unrelated , need to keep an eye18:21
weshayraukadah, we do http://dashboard-ci.tripleo.org/d/jobs/jobs-exploration?orgId=1&fullscreen&panelId=16&from=now-90d&to=now18:23
weshaystill living on the edge18:24
*** weshay is now known as weshay|ruck18:30
*** holser has quit IRC18:36
*** dtantsur is now known as dtantsur|afk19:53
*** tesseract has quit IRC20:01
*** irclogbot_2 has quit IRC20:05
*** irclogbot_3 has joined #oooq20:05
*** jmasud has quit IRC20:12
*** jmasud has joined #oooq20:13
weshay|ruckrlandy, when you have a sec https://review.opendev.org/#/c/705549/20:39
rlandylooking20:40
weshay|ruckrfolco, did we get a new count on centos-8?20:48
rfolcoweshay|ruck, no, I'm retrying with the new patches merged and the new skips21:12
rfolcoweshay|ruck, ceph patch merged21:12
*** jfrancoa has quit IRC21:14
weshay|ruckk21:15
*** Goneri has quit IRC21:16
weshay|ruckrfolco, has this executed any where? https://review.rdoproject.org/r/#/c/24775/21:18
rfolcoweshay|ruck, no I don't know whats going on with marios patch21:19
rfolcoweshay|ruck, need to sync with him tomorrow21:19
weshay|ruckrfolco, think we just need to update zuul config no?21:20
rfolcoweshay|ruck, https://review.opendev.org/#/c/701937/ zuul -121:20
weshay|ruckrfolco, wasn't so hard ;) https://review.rdoproject.org/r/#/c/24775/21:28
weshay|ruckah crud maybe it is so hard21:28
weshay|ruckhrm21:28
weshay|ruckrlandy, what are we missing here to trigger build containers on a change to zuul.d/build-containers.yaml file21:35
rlandy?21:35
weshay|ruckoh.. sorry21:35
rlandyrlandy or rfolco?21:35
weshay|ruckhttps://review.rdoproject.org/r/#/c/24775/5/zuul.d/build-containers.yaml21:35
weshay|ruckrlandy,21:35
weshay|ruckrlandy, need to define in projects.yaml?21:36
rlandyweshay|ruck: it should work .. have you tried changing the /build-containers.yaml21:38
rlandyfile21:38
rlandyso usually I would add the files to where the job is called21:39
rlandyweshay|ruck: may I edit that?21:41
weshay|ruckrlandy, latest patch is up now21:42
weshay|ruckrlandy, there it goes21:42
weshay|ruckrlandy, you can't add a template there I guess?21:43
weshay|ruckrfolco, it's running man21:43
weshay|ruckrfolco, https://review.rdoproject.org/zuul/status 2477521:43
rlandyweshay|ruck: k - so while we are talking ... getting somewhere with the tls standalone ... question ...21:43
weshay|ruckrlandy, aye21:43
rlandystandalone deploy fails with ERROR! the role 'ipaclient' was not found in ...21:43
rlandythe ipaclient is in https://github.com/freeipa/ansible-freeipa/tree/master/roles/ipaclient21:44
rlandyhow do I get THT to pick that up?21:44
rlandyadd it to the role path for the deploy?21:44
weshay|ruckthat's a good question...21:45
weshay|ruckI could poke at this with you but I don't remember off or I may not have ever known21:45
weshay|rucksagi would know21:45
weshay|ruckbut happy to look at it w/ you21:46
rlandyI'll dig a bit more - if no results, I'll ping you21:46
weshay|ruckk21:46
*** saneax has quit IRC22:25
*** rfolco has quit IRC22:36
weshay|ruckrlandy, fwiw.. pretty sure the ansible path is set by tripleo-common cloudnull could help22:42
rlandyweshay|ruck: sorry - found ot22:42
rlandyit22:42
weshay|ruck:)22:42
rlandy /usr/share/ansible/roles22:42
*** TrevorV has quit IRC22:53
*** jaosorior has quit IRC23:34
*** jaosorior has joined #oooq23:50

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!