*** ykarel|away is now known as ykarel | 04:46 | |
*** marios is now known as marios|ruck | 05:11 | |
*** bhagyashris_ is now known as bhagyashris | 05:39 | |
*** jpena|off is now known as jpena | 07:32 | |
marios|ruck | soniya29|rover: quick errand biab | 08:18 |
---|---|---|
marios|ruck | thanks soniya29|rover | 09:03 |
akahat | marios|ruck, chandankumar hey.. i've come up with small idea about promoter logging solutions: https://paste.opendev.org/show/808005/ | 09:21 |
akahat | if there are no promotions then it will write centos8_master_last_run.log file. This will avoid creating promotion log files | 09:22 |
akahat | and if we got promotion then we will log thatin centos8_master_<timestamp>.log | 09:22 |
akahat | bhagyashris, ^^ | 09:22 |
marios|ruck | akahat: ish... ideally we'd have a logfile per day for the last few days | 09:24 |
marios|ruck | akahat: can still captured a 'promoted' version when that happens as well as the normal log | 09:25 |
marios|ruck | akahat: so you mean in your proposal we only keep 2 files? now and 'everything else'? | 09:28 |
marios|ruck | akahat: i think it would make the 'everything else' file pretty big | 09:28 |
akahat | marios|ruck, no.. that last_run.log file will get purged.. and updated with current run. | 09:30 |
akahat | and for promotion we have file with the timestamp. | 09:31 |
marios|ruck | akahat: but then i cant check yesterday logs? | 09:31 |
marios|ruck | akahat: like i want to see if there is something we can promote eg few missing jobs | 09:31 |
akahat | marios|ruck, yes. you wont | 09:31 |
akahat | okay. then something else i need to think. | 09:32 |
akahat | suggestions welcome. | 09:32 |
marios|ruck | akahat: why don't you like the 'daily' approach | 09:32 |
marios|ruck | akahat: and we keep n days like 3/? 5? | 09:32 |
marios|ruck | akahat: AND if there is a promotion you create a new file called 'master_promoted_timestamp.txt' | 09:33 |
marios|ruck | as well as continue the daily log | 09:33 |
akahat | marios|ruck, okay.. you are saying we can remove the logs older than 4-5 days. | 09:33 |
marios|ruck | and there should be a rainbow | 09:33 |
marios|ruck | akahat: yeah i think so like some configurable number maybe but few days i don't think is useful after that | 09:33 |
marios|ruck | akahat: bring it to the design/planning scrum tomorrow for more opinions though this is just mine i think rlandy agrees i think weshay has other ideas | 09:34 |
akahat | marios|ruck, okay.. got your point. | 09:34 |
akahat | yeah.. sure. | 09:34 |
akahat | marios|ruck, thank you for suggestions. :) | 09:35 |
bhagyashris | akahat, ack | 09:42 |
bhagyashris | folks kindly add in your review lust https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34706 | 09:42 |
marios|ruck | ack added to list bhagyashris | 09:42 |
zbr | marios|ruck: i think i found another problem with get_hash. apparently Ansiballz does not include non .py files when sending them to remote host, so our get_hash is unable to find config.yaml file. | 09:51 |
zbr | solution is to make it a python file. | 09:51 |
marios|ruck | zbr: fantastic | 09:54 |
marios|ruck | zbr: but lets solve/focus on the initial issue of not finding the module first | 09:54 |
marios|ruck | zbr: before starting to dig into that | 09:54 |
zbr | that was sorted few minutes ago | 09:54 |
zbr | and while sorting it, i found that issue with config. | 09:55 |
bhagyashris | marios|ruck, thanks :) | 10:04 |
soniya29|rover | chandankumar, arxcruz, kopecmartin, please edit/add today's agenda for tempest's meeting - https://hackmd.io/fIOKlEBHQfeTZjZmrUaEYQ | 10:37 |
arxcruz | soniya29|rover: we have retro today, should we also have tempest meeting? (althoug isn't overlapping) | 10:38 |
soniya29|rover | arxcruz, we have moved tempest meeting before retro meeting | 10:38 |
arxcruz | yup, i know | 10:39 |
arxcruz | just wondering | 10:39 |
arxcruz | that's fine | 10:39 |
chandankumar | soniya29|rover: I donot have any agenda to discuss , I think hackmd needs cleanup | 10:40 |
soniya29|rover | chandankumar, cleanup? | 10:42 |
zbr | marios|ruck: not glad to report an IncompleteRead with mirror.bhs1.ovh.opendev.org, i did recheck. | 10:46 |
chandankumar | soniya29|rover: I have removed unnecessary items which we already discussed in last meeting | 10:47 |
chandankumar | from hackmd | 10:47 |
marios|ruck | thanks zbr | 10:54 |
soniya29|rover | chandankumar, okay | 11:03 |
marios|ruck | zbr: have the link? | 11:14 |
akahat | Review request: https://review.opendev.org/q/topic:%22utilize-tripleo-operator%22+(status:open%20OR%20status:merged) | 11:23 |
akahat | Review Request: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34334 | 11:24 |
*** dviroel|out is now known as dviroel | 11:32 | |
*** jpena is now known as jpena|lunch | 11:35 | |
*** rlandy is now known as rlandy|ruck | 11:47 | |
rlandy|ruck | marios|ruck: soniya29|rover: hey | 11:48 |
soniya29|rover | rlandy|ruck, hello | 11:48 |
rlandy|ruck | marios|ruck: soniya29|rover: I have a clashing meeting with the first half of the program call | 11:49 |
marios|ruck | zbr: rlandy|ruck: ack np | 11:49 |
rlandy|ruck | marios|ruck: soniya29|rover: I updated the doc | 11:49 |
marios|ruck | rlandy|ruck: thanks i saw | 11:49 |
rlandy|ruck | marios|ruck: soniya29|rover: so I'll miss it but sure you both have it under control | 11:49 |
rlandy|ruck | ping if there are questions on downstream | 11:50 |
marios|ruck | rlandy|ruck: might not even comment may just ask if there are questions since its all 'green' | 11:50 |
marios|ruck | rlandy|ruck: ack np | 11:50 |
weshay|ruck | rlandy|ruck, did you like my cheer? | 11:50 |
rlandy|ruck | marios|ruck: yep - fly under the radar | 11:50 |
soniya29|rover | rlandy|ruck, marios|ruck, we have tempest meeting as well | 11:50 |
rlandy|ruck | weshay|ruck: loving it!! | 11:50 |
soniya29|rover | marios|ruck, shall I join program call or go on with tempest meeting? | 11:53 |
weshay|ruck | marios|ruck, I just realized we should probably report on wallaby too since that is officially imported | 11:58 |
weshay|ruck | mind if I add it? | 11:58 |
soniya29|rover | marios|ruck, weshay|ruck, ^^? | 11:58 |
weshay|ruck | soniya29|rover, go to tempest | 11:58 |
marios|ruck | soniya29|rover: up to you | 11:58 |
marios|ruck | weshay|ruck: sure also green 2 daysold but the fs35 cix is not helping us | 11:59 |
soniya29|rover | weshay|ruck, marios|ruck, ack | 11:59 |
weshay|ruck | marios|ruck, this is the tempest time out issue? | 11:59 |
weshay|ruck | soniya29|rover, have you gotten on a node and looked at why tempest is timing out on us? | 12:00 |
weshay|ruck | soniya29|rover, this one? https://trello.com/c/U1bKNUuu/2051-cixlp1939023tripleociproa-periodic-featureset-35-wallaby-times-out-running-tempest-2-hours | 12:00 |
marios|ruck | weshay|ruck: yeah that one | 12:02 |
marios|ruck | weshay|ruck: see upstream bug has info on the timings | 12:02 |
marios|ruck | weshay|ruck: i have poked at it but can't see why it takes 2x as long run same tests as it did before 3rd august | 12:02 |
weshay|ruck | marios|ruck, have we held an environment? | 12:03 |
marios|ruck | weshay|ruck: not yet | 12:03 |
soniya29|rover | weshay|ruck, tempest meeting? | 12:03 |
rlandy|ruck | weshay|ruck: adding the ephemeral heat settings did not help | 12:03 |
weshay|ruck | soniya29|rover, once we get an environment held.. we'll need your help to understand why tempest is inconsistently timing out | 12:03 |
soniya29|rover | weshay|ruck, sure | 12:03 |
weshay|ruck | rlandy|ruck, aye | 12:03 |
rlandy|ruck | weshay|ruck: its did reproduce the baremetal error | 12:03 |
rlandy|ruck | though | 12:03 |
rlandy|ruck | so there is some combination of settings that is off | 12:04 |
rlandy|ruck | http://osp-trunk.hosted.upshift.rdu2.redhat.com/api-rhel8-osp16-2/api/civotes_agg_detail.html?ref_hash=e55b584d3cad08c6e6cd850c986ada42 | 12:04 |
rlandy|ruck | marios|ruck: weshay|ruck: ^^ going to promote that hash for 16.2 now | 12:04 |
rlandy|ruck | qe jobs looks stuck | 12:04 |
weshay|ruck | marios|ruck, if you can.. getting an environment.. especially for timeouts.. is a great way to diagnose | 12:04 |
rlandy|ruck | https://rhos-ci-jenkins.lab.eng.tlv2.redhat.com/view/pipeline/job/pipeline_integration-pcci-16.2_dlrn-rhel-8.4-virthost-3cont_2comp_3ceph-ipv6-geneve-ceph/ | 12:05 |
weshay|ruck | rlandy|ruck, go 4 it | 12:05 |
weshay|ruck | sounds like imports are not getting turned on until sept. | 12:05 |
marios|ruck | weshay|ruck: sure but also sounds like a good task for soniya29|rover perhaps | 12:05 |
marios|ruck | weshay|ruck: and reaching out to hold the node etc | 12:05 |
marios|ruck | weshay|ruck: can talk on the call after this one | 12:05 |
zbr | marios|ruck: happens again, same place: that is infra issue https://zuul.opendev.org/t/openstack/build/efc4a37164974cce98b128e203871abc | 12:15 |
bhagyashris | arxcruz, zbr, sshnaidm, rlandy|ruck , marios|ruck , ysandeep, bhagyashris, svyas, soniya29|rover , pojadhav, akahat, weshay|ruck , chandankumar, frenzy_friday, dviroel, | 12:16 |
bhagyashris | TripleO CI Retrospective meeting in 14 mins | 12:16 |
bhagyashris | https://miro.com/app/board/o9J_l2p9CCA=/ | 12:17 |
bhagyashris | https://meet.google.com/kkp-bejs-vvo?authuser=0 | 12:17 |
zbr | marios|ruck: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3AIncompleteRead | 12:28 |
zbr | you have the pleasure to announce it to infra folks, sic. | 12:28 |
*** jpena|lunch is now known as jpena|off | 12:28 | |
zbr | and they want to ditch logstash,.... i wonder how they will find stuff like this. | 12:29 |
bhagyashris | dviroel, rlandy|ruck retro time | 12:31 |
dviroel | joining | 12:32 |
weshay|ruck | rlandy|ruck, https://miro.com/app/board/o9J_l2p9CCA=/ | 12:39 |
bhagyashris | https://miro.com/app/board/o9J_l2p9CCA=/ | 12:39 |
weshay|ruck | arxcruz, ! | 12:45 |
arxcruz | weshay|ruck: ? | 12:46 |
weshay|ruck | zbr, dry.. do not repeat.. meh. bad joke | 12:52 |
weshay|ruck | marios|ruck, soniya29|rover if you folks can get that last thing.. re: tempest and fs035 that would be awesome | 13:20 |
soniya29|rover | weshay|ruck, i had discussed it in tempest meeting | 13:20 |
soniya29|rover | weshay|ruck, i and arx cruz will be following that issue | 13:20 |
weshay|ruck | ++ | 13:20 |
weshay|ruck | thank you!! | 13:21 |
soniya29|rover | weshay|ruck, marios|ruck, need to go out for an hour | 13:21 |
marios|ruck | weshay|ruck: define 'get that last thing' :) | 13:22 |
marios|ruck | weshay|ruck: so i've been trying to dig there over last few days and added the findings in the bug | 13:23 |
marios|ruck | weshay|ruck: tempest takes 2x as long as used to | 13:23 |
*** ykarel is now known as ykarel|away | 13:23 | |
marios|ruck | weshay|ruck: i've tried to get soniya29|rover|brb to check before 'cos tempest' so glad the tempest folks will check it | 13:23 |
weshay|ruck | k | 13:23 |
sshnaidm | chandankumar, we don't use puppet-tempest somewhere for tripleo, right? | 13:23 |
weshay|ruck | timeouts suck.. and almost impossible to figure out from just logs | 13:23 |
weshay|ruck | sshnaidm, no.. that is packstack | 13:24 |
chandankumar | sshnaidm: yes, we donot use it, it is used only in packstack and puppet-openstack-integration | 13:24 |
sshnaidm | weshay|ruck, packstack? is it alive? | 13:24 |
marios|ruck | weshay|ruck: but its pretty clear in this case i mean ~2 hour mark the deployment is done and tempest starts... used to cmplete in 1 hour so ~3 total, now timeout after 2 hours so 4 total | 13:24 |
chandankumar | sshnaidm: yes, rdo team have weirdo jobs on that | 13:25 |
sshnaidm | chandankumar, ack | 13:25 |
bhagyashris | folks plz add into your review list https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34688 https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34706 which will help to proceed both upstream and downstream promoter work | 13:27 |
bhagyashris | thanks | 13:27 |
rlandy|ruck | weshay|ruck: now that 16.2 and 17 promoted, I can pick up the fs035 timeout | 13:28 |
rlandy|ruck | marios|ruck: ^^ | 13:28 |
weshay|ruck | dviroel, ping me later for training :) when you have time | 13:28 |
weshay|ruck | rlandy|ruck, we just need an env.. to hand off to soniya29|rover|brb to find which tempest test(s) are messing w/ us | 13:28 |
rlandy|ruck | weshay|ruck: on it | 13:29 |
dviroel | weshay|ruck: ack | 13:29 |
chandankumar | weshay|ruck: rlandy|ruck we can tempest_run file | 13:30 |
chandankumar | in any timedout job | 13:30 |
chandankumar | that will give some idea | 13:30 |
rlandy|ruck | chandankumar: once the node is held? | 13:32 |
chandankumar | rlandy|ruck: can you pass the timedout job link? | 13:32 |
rlandy|ruck | chandankumar: yep - getting | 13:32 |
rlandy|ruck | https://trello.com/c/U1bKNUuu/2051-cixlp1939023tripleociproa-periodic-featureset-35-wallaby-times-out-running-tempest-2-hours | 13:33 |
rlandy|ruck | https://bugs.launchpad.net/tripleo/+bug/1939023 | 13:34 |
rlandy|ruck | chandankumar: ^^ | 13:34 |
rlandy|ruck | also any fs035 job | 13:34 |
chandankumar | rlandy|ruck: https://logserver.rdoproject.org/openstack-periodic-integration-stable1/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset035-wallaby/193b290/logs/undercloud/var/log/tempest/tempest_run.log.txt.gz | 13:34 |
chandankumar | it might give some idea | 13:35 |
* chandankumar reading the whole bug | 13:36 | |
rlandy|ruck | chandankumar: also getting a node | 13:39 |
weshay|ruck | zbr, help? | 13:41 |
weshay|ruck | have a min? | 13:41 |
weshay|ruck | re: that patch | 13:41 |
marios|ruck | thanks rlandy|ruck | 13:42 |
rlandy|ruck | chandankumar: holding https://review.rdoproject.org/r/c/testproject/+/24995 node | 13:43 |
rlandy|ruck | eventually ovb will tear down | 13:44 |
rlandy|ruck | but tempest takes ages | 13:44 |
chandankumar | rlandy|ruck: ack | 13:44 |
zbr | weshay|ruck: send me link about click one and I will find the code you are looking for. i have ameeting in ten mins but i will do it today. | 13:50 |
weshay|ruck | zbr, k.. just need to know how to get an args object for the passed args | 13:50 |
rlandy|ruck | chandankumar: looks like your keys are on 38.102.83.114 | 13:51 |
rlandy|ruck | so you can get on it | 13:51 |
rlandy|ruck | chandankumar: also there is a job in that state in the openstack-periodic-integration-stable1 queue right now | 13:51 |
rlandy|ruck | you can get on that node and look | 13:51 |
rlandy|ruck | weshay|ruck: shocker https://review.rdoproject.org/r/c/testproject/+/18953 centos 9 container builds still passing | 13:52 |
weshay|ruck | rlandy|ruck, ok.. perhaps we just add that one job to the os-$next line? | 13:53 |
rlandy|ruck | forget that | 13:54 |
rlandy|ruck | node didn't kick | 13:55 |
weshay|ruck | heh | 13:55 |
rlandy|ruck | chandankumar: do you have the access you need | 14:03 |
rlandy|ruck | there are actually a bunch of fs035 jobs | 14:04 |
rlandy|ruck | in action in rdo zuul now | 14:04 |
rlandy|ruck | you could access any one | 14:04 |
pojadhav | zbr, do you have any idea why "mol-tripleo_common_integration" job is failing consistently : https://review.rdoproject.org/zuul/builds?job_name=mol-tripleo_common_integration | 14:14 |
pojadhav | this blocking my 2 patches : https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34633 and https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/34572 | 14:14 |
pojadhav | weshay|ruck, ^^ | 14:15 |
weshay|ruck | pojadhav, I added that to the ruck / rover tasks | 14:16 |
weshay|ruck | pojadhav, https://review.rdoproject.org/zuul/builds?job_name=mol-tripleo_common_integration | 14:16 |
weshay|ruck | pojadhav, it's not voting. | 14:16 |
weshay|ruck | shouldn't block | 14:16 |
pojadhav | weshay|ruck, yup.. is it voting in gate? | 14:17 |
weshay|ruck | no | 14:17 |
weshay|ruck | pojadhav, look again | 14:17 |
pojadhav | ok.. then i sould put a recheck | 14:17 |
pojadhav | thanks ! | 14:17 |
marios|ruck | weshay|ruck: sorry i missed the cix call i updated some things earlier that can be close dout | 14:52 |
marios|ruck | weshay|ruck: i guess you didnt need me | 14:52 |
marios|ruck | sorry :D | 14:52 |
weshay|ruck | no worries | 14:53 |
weshay|ruck | not much on the board | 14:53 |
marios|ruck | weshay|ruck: ack thanks | 14:54 |
rlandy|ruck | marios|ruck: was fine | 14:57 |
rlandy|ruck | you did a great job of keeping the board up | 14:57 |
*** jpena|off is now known as jpena | 14:57 | |
rlandy|ruck | marios|ruck: other than fs035 - anything else you want to hand off> | 14:58 |
weshay|ruck | we may not even need a sync / hand off mtg | 14:59 |
weshay|ruck | things are cooking pretty well.. minus 035 | 14:59 |
marios|ruck | rlandy|ruck: nah was hoping to get lucky with 2 hashes for that fs35 https://review.rdoproject.org/r/c/testproject/+/34907 https://review.rdoproject.org/r/c/testproject/+/34916 still didn't report hoping one of them may pass fs35 it sometimes does just within the 4 hours itmeout | 14:59 |
weshay|ruck | dviroel, FYI.. normally previous ruck/rovers meet w/ new ruck/rovers to live xfer work | 15:00 |
weshay|ruck | may skip that this time | 15:00 |
weshay|ruck | but good practice | 15:00 |
marios|ruck | rlandy|ruck: they didn't report yet so maybe give them another spin if they are bad , if either of them passes then wallaby will promote that hash | 15:00 |
marios|ruck | rlandy|ruck: trying to keep wllaby alive despite that timeout | 15:00 |
marios|ruck | rlandy|ruck: its how we've had some wallaby promotions lately :) luck ! | 15:01 |
marios|ruck | rlandy|ruck: i.e. https://bugs.launchpad.net/tripleo/+bug/1939023/comments/3 | 15:01 |
rlandy|ruck | marios|ruck: ack | 15:01 |
rlandy|ruck | throw enough spaghetti at the wall - something might stick | 15:02 |
rlandy|ruck | lovely approach | 15:02 |
marios|ruck | rlandy|ruck: right .. thank you ! | 15:02 |
dviroel | weshay|ruck: ok :) | 15:12 |
*** sshnaidm is now known as sshnaidm|afk | 15:35 | |
*** jpena is now known as jpena|off | 15:42 | |
marios|ruck | o/ weshay|ruck rlandy|ruck off in a couple mins | 15:54 |
marios|ruck | soniya|rover: o/ congrats you made it ;) | 15:54 |
rlandy|ruck | marios|ruck: enjoy the rest | 15:54 |
marios|ruck | rlandy|ruck: are you becoming foreverruck like weshay|ruck :/ | 15:55 |
rlandy|ruck | it's life sentence | 15:55 |
soniya|rover | marios|ruck, congrats to you as well :) | 15:55 |
marios|ruck | rlandy|ruck: tshirt maybe? ;) 'tripleo-ci' 18:55 < rlandy|ruck> it's life sentence | 15:56 |
marios|ruck | bosu oclock | 15:57 |
*** dviroel is now known as dviroel|away | 16:19 | |
zbr | marios|ruck: update: get_hash passed on some jobs but failed on others,... due to No module named 'requests'. | 16:23 |
*** ykarel is now known as ykarel|away | 16:33 | |
*** rlandy|ruck is now known as rlandy|ruck|afk | 16:51 | |
*** rlandy|ruck|afk is now known as rlandy|ruck | 18:59 | |
*** dviroel|away is now known as dviroel | 19:09 | |
dviroel | weshay|ruck: o/ ready when you are | 19:13 |
*** ssamal is now known as ssamal|afk | 19:29 | |
weshay|ruck | dviroel, ah.. still avail? | 19:41 |
dviroel | yes | 19:41 |
weshay|ruck | meet.google.com/rbo-nvyt-rvb | 19:41 |
weshay|ruck | dviroel, ci-config/ci-scripts/infra-setup/roles/rrcockpit/files | 19:58 |
weshay|ruck | dviroel, | 20:05 |
weshay|ruck | cockpit-bridge-249-1.fc33.x86_64 | 20:05 |
weshay|ruck | cockpit-system-249-1.fc33.noarch | 20:05 |
weshay|ruck | cockpit-ws-249-1.fc33.x86_64 | 20:05 |
weshay|ruck | cockpit-networkmanager-249-1.fc33.noarch | 20:05 |
weshay|ruck | cockpit-storaged-249-1.fc33.noarch | 20:05 |
weshay|ruck | cockpit-packagekit-249-1.fc33.noarch | 20:05 |
weshay|ruck | cockpit-249-1.fc33.x86_64 | 20:05 |
weshay|ruck | dviroel, http://localhost:9090/system/terminal | 20:05 |
weshay|ruck | dviroel, https://launchpad.net/~tripleo | 20:18 |
weshay|ruck | dviroel, https://launchpad.net/tripleo | 20:19 |
weshay|ruck | dviroel, https://hackmd.io/07z0xroHTFi2IbX93P5ZfQ | 20:29 |
*** dviroel is now known as dviroel|ruck | 20:35 | |
*** dviroel|ruck is now known as dviroel|ruck|out | 21:46 | |
*** ssamal|afk is now known as ssamal | 22:25 | |
rlandy|ruck | weshay|ruck: we were meant to create a new sprint board | 22:31 |
rlandy|ruck | after tomorrow? | 22:31 |
rlandy|ruck | after planning? | 22:31 |
weshay|ruck | rlandy|ruck, ya.. after planning | 23:21 |
rlandy|ruck | weshay|ruck: ok | 23:21 |
rlandy|ruck | weshay|ruck: left a comment re fs35 failure https://bugs.launchpad.net/tripleo/+bug/1939023/ | 23:40 |
rlandy|ruck | chandankumar: ^^ https://bugs.launchpad.net/tripleo/+bug/1939023/comments/6 pls see what you think | 23:41 |
rlandy|ruck | I think our node went down already | 23:41 |
rlandy|ruck | you can reclaim one in your morning | 23:41 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!