Monday, 2021-07-05

*** ysandeep- is now known as ysandeep06:12
*** amoralej|off is now known as amoralej06:36
ysandeepbhagyashris_, hey o/ chandhan is on sick leave, please let me know if you need a hand with ruck/rovering07:38
bhagyashris_ysandeep, sure thanks .. CI pretty much stable. only Victoria is blocked with TIMED_OUT issue on fs035 it's getting timed out at execute tempest test, rerunning the the job in test project patch07:40
bhagyashris_let see07:40
*** bhagyashris_ is now known as bhagyashris|ruck07:40
ysandeepbhagyashris_, that's good news o/ 07:41
akahatbhagyashris_, i'm also here if you need any help :)08:34
*** sshnaidm|afk is now known as sshnaidm08:35
*** jpena|off is now known as jpena08:35
*** dmellado_ is now known as dmellado08:43
*** ykarel is now known as ykarel|lunch09:05
frenzy_fridayhey bhagyashris_ , could you pls give me access to https://docs.google.com/spreadsheets/d/16rqgaSSoQrYNjsI4q0YInJxrOOJL3xyP3t3D6ZpluFY/edit?skip_itp2_check=true#gid=84689389209:28
bhagyashris_frenzy_friday, Done09:39
frenzy_fridaybhagyashris_, thanks09:39
bhagyashris_akahat, thanks 09:39
mariosneeds votes please when you next have time https://review.opendev.org/c/openstack/tripleo-ci/+/793120 would be good to merge that one to avoid fruther rebases ... just template definitions for new jobs so won't break anything and won't run until it is wired up in different patches https://review.opendev.org/q/topic:wallaby-upgrade-jobs09:45
ysandeepmarios, ack checking09:49
*** pojadhav- is now known as pojadhav10:03
*** bhagyashris_ is now known as bhagyashris|ruck10:08
bhagyashris|ruckarxcruz, marios ysandeep plz add in your review list https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/79943210:37
ysandeepbhagyashris|ruck, already voted10:38
bhagyashris|ruckthanks10:38
bhagyashris|ruckohh thanks arxcruz ysandeep 10:38
mariosbhagyashris|ruck: ack 10:44
*** ykarel|lunch is now known as ykarel10:45
bhagyashris|ruckmarios, that patch got merged 10:54
mariosbhagyashris|ruck: ? 10:55
bhagyashris|ruckmarios, this patch got merge https://review.opendev.org/c/openstack/openstack-tempest-skiplist/+/799432 10:56
mariosbhagyashris|ruck: yeah i had a look and voted before it merged10:56
mariosbhagyashris|ruck: whats up10:56
bhagyashris|ruckmarios, thanks 10:56
bhagyashris|ruckmarios, i have one thing for you , master promotion is blocked due to https://bugs.launchpad.net/tripleo/+bug/1933639 . object storage tempest tests are failing consistently on fs001 master periodic 10:57
bhagyashris|ruckany updates on it10:58
bhagyashris|rucki mean see the comments so is Christian working on it or your are going to debug this ^10:59
mariosbhagyashris|ruck: let me add a comment, no christian is waiting for us to give him a node to debug10:59
bhagyashris|ruckmarios, ok11:00
mariosbhagyashris|ruck: he already checked it and didn't find something in the logs11:00
bhagyashris|ruckok 11:00
*** poojajadhav is now known as pojadhav11:01
frenzy_fridaysshnaidm, zbr, we have a problem in tripleo health queries with the length of filename generated by sova. When you get some time pls review https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/79945611:08
frenzy_fridayzbr, pls add https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/799453 as well to your review list 11:10
zbrfrenzy_friday: that opens a new question: why we use filenames for text messages? they are not reliable. I am personally not pleased with that design.11:11
sshnaidmfrenzy_friday, do you have an example when it's big length?11:11
zbri prefer simple and predictable: put all errors found inside an errors.txt file, one that is not created if nothing is found.11:11
frenzy_fridaysshnaidm, in usual cases no. But in https://opendev.org/openstack/tripleo-ci-health-queries/src/branch/master/samples/errors-testing.err we are trying to have a sample string for all the regex we track11:12
sshnaidmzbr, it creates file so that person that looks in logs will know immediately what is the problem, like "Overcloud_stack__FAILED.log" in https://logserver.rdoproject.org/50/798950/4/openstack-check/tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001/6156d19/logs/_Overcloud_stack__FAILED.log11:12
frenzy_fridayAs a unit test for the regexes11:12
sshnaidmwithout opening files11:12
zbrIMHO, that is achieve by the use of zuul_return11:13
sshnaidmfrenzy_friday, is it some regexp that takes a whole line or a lot of reasons for failure?11:13
sshnaidmzbr, probably, if it works - no need to create it11:13
frenzy_fridaysshnaidm, lots of failure reasons. 11:13
sshnaidmfrenzy_friday, any example?11:14
frenzy_fridayI think it is good to have the failure reasons in the filename itself - helpful while ruckrovering. But I think there should be a error handling in case the file name gets too long11:15
mariosbhagyashris|ruck: https://bugs.launchpad.net/tripleo/+bug/1933639/comments/1411:15
zbrin fact we could easily make use of zuul warnings or file comments11:15
zbrhttps://zuul-ci.org/docs/zuul/reference/jobs.html#leaving-warnings11:16
frenzy_fridaysshnaidm, example - https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/796007/3/src/data/queries.yml Here if you add any more items it would go out of length11:16
sshnaidmzbr, yeah, that would be cool11:17
zbrconsidering that the reason is still inside the filename, you still have to click it, so I am not sure which is the real benefit. Exposing what failed can be done as warning and included directly into the zuul message.11:17
zbrsshnaidm: i do understand the need that caused the original approach, is just that i am not convinced it scales the best.11:17
mariosbhagyashris|ruck: let me know if you need any clarification11:18
sshnaidmzuul "warnings:" seems suitable11:18
zbrwe could look into, i am not proposing to drop file naming without having an viable alternative11:18
bhagyashris|ruckmarios, ack thanks 11:18
sshnaidmzbr, the sooner the person can know the failure reason, the better, zuul warnings/comments seems as a best solution for me11:19
sshnaidmI'm not happy about long filenames as well11:19
zbrwe should make a card for that and experiment11:19
frenzy_fridayyes , ++ that11:19
sshnaidmfrenzy_friday, another option - just cut text[:100]11:19
zbryep, limit size of text for the moment.11:20
sshnaidmif filename is messed up, anyway need to look inside11:20
frenzy_fridaysshnaidm, ok, updating the patch11:20
frenzy_fridaysshnaidm, zbr Updated https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/79945611:22
sshnaidmfrenzy_friday, just a comment, different patterns can point to same problem, for example ""ReadTimeoutError: HTTPConnectionPool" and "yum download error" both point to Yum download error from mirrors, we need to use same "message" for them11:24
mariosbhagyashris|ruck: fyi https://bugs.launchpad.net/tripleo/+bug/1934658/comments/2 cos it will show up on the CIX board 11:25
frenzy_fridaysshnaidm, yes, the converter which generates the sova patterns json is messed up. Right now everything gets added to "console" with different messages. /me checking how to update it 11:26
bhagyashris|ruckmarios, ack11:29
*** dviroel|out is now known as dviroel11:30
bhagyashris|ruckykarel, hey can you please hold node for this job https://review.rdoproject.org/r/c/testproject/+/3432111:30
*** jpena is now known as jpena|lunch11:33
dviroelmarios: o/ hi there, there was a minor update on Readme for this change https://review.opendev.org/c/openstack/tripleo-repos/+/795171 - please take a look when you have some time11:33
mariosack dviroel adding to list 11:39
*** ykarel_ is now known as ykarel11:56
*** jpena|lunch is now known as jpena12:28
mariosneeds reviews please add to your queue tripleo-ci o/ thank you https://review.opendev.org/c/openstack/tripleo-repos/+/795333 12:29
*** jpena is now known as jpena|away12:36
*** amoralej is now known as amoralej|lunch12:44
ysandeepykarel++12:54
akahatReview request: https://review.rdoproject.org/r/c/rdo-infra/ci-config/+/3267913:03
zbrfrenzy_friday: what is the endpoint using wrong certificate?13:08
frenzy_fridayzbr, "https://kibana:<passwd>@elk.review.rdoproject.org:9200/logstash-*/_search" <- this is the elasticsearch url which needs cert13:10
frenzy_fridaywe have another url - the kibana url https://kibana:<passwd>@review.rdoproject.org/elasticsearch/logstash-*/_search - this doesnt need certificate to connect, but pyelasticsearch client doesnt recognize it13:10
frenzy_fridayhttps://hackmd.io/CqKnNTOVRhC6K83sBZrbrQ?both#What-is-failing-now13:11
sshnaidmhttps://review.rdoproject.org/r/c/rdo-jobs/+/34360  https://review.opendev.org/c/openstack/tripleo-quickstart-extras/+/79895013:15
bhagyashris|ruckReviews on https://review.opendev.org/c/openstack/tripleo-ci/+/797891 please, this is already 7 days old :p13:19
bhagyashris|ruckhttps://review.opendev.org/c/openstack/tripleo-repos/+/79480113:19
bhagyashris|ruckhttps://review.opendev.org/c/openstack/tripleo-repos/+/79517113:19
bhagyashris|ruckhttps://review.opendev.org/c/openstack/tripleo-ci/+/79312013:19
bhagyashris|ruckhttps://review.opendev.org/q/topic:wallaby-upgrade-jobs13:19
bhagyashris|ruckhttps://review.opendev.org/c/openstack/tripleo-repos/+/79533313:19
bhagyashris|ruckhttps://review.opendev.org/c/openstack/tripleo-quickstart/+/79148613:20
bhagyashris|ruckhttps://review.rdoproject.org/r/q/topic:%22refactor_job_names%22+(status:open%20OR%20status:merged)13:20
bhagyashris|ruckhttps://review.opendev.org/c/openstack/patrole/+/79554813:20
zbrmarios: tq is using a specific commit from the linter, no wonder you ended up with problems.13:20
bhagyashris|ruckfolks plz add in your review list13:20
zbrsorry my language but WTF is quickstart-usb? is anyone using it?13:22
marioszbr: grateful for any comments there and i will update ... but feel free to update if you want to test something 13:22
marioszbr: let me know if/when you are poking there so i don't please? 13:22
marioszbr: thanks for checking13:22
zbrI see stuff like `ci-scripts/usbkey/RUN_ME.sh:32:28:                --playbook quickstart-usb.yml` but the file is not a playbook! is a tasks file!13:22
marioszbr: yeah noticed that before too i think we can probably nuke it13:23
marioszbr: i think it was some kind of idea like the original tripleo 'seed' node so you can deploy from a usb key 13:23
marioszbr: wouldn't expect that to be working right now i think 13:23
zbri am sure it never worked, so I prefer to remove it unless someone knows it as working13:23
marios;)13:23
zbrprevents linter from being upgrades, as newer version is "smarter"13:23
sshnaidmmarios, commented https://review.opendev.org/c/openstack/tripleo-ci/+/79339313:26
mariosack thanks for review sshnaidm will check your comments and reply 13:29
*** amoralej|lunch is now known as amoralej13:33
* zbr got bit scared observing github autopilot writing the exact code I was supposed to type.13:35
frenzy_fridayzbr, sshnaidm https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/799456 - can we pl merge this one?13:59
zbrsure +214:00
sshnaidmfrenzy_friday, done14:00
zbrusually i would have asked for tests and using a better filename-sanitizer, but I can understand the time-pressure.14:01
*** ysandeep is now known as ysandeep|afk14:08
frenzy_fridaythanks14:16
*** jpena|away is now known as jpena14:29
*** ykarel is now known as ykarel|away14:34
*** ysandeep|afk is now known as ysandeep14:58
*** amoralej is now known as amoralej|off15:39
frenzy_fridayrlandy, https://review.opendev.org/c/openstack/tripleo-ci-health-queries/+/798958 will pass when https://review.opendev.org/c/openstack/ansible-role-collect-logs/+/799456 is merged. We are installing the collection from the git repo, I think that is the reason Depends-On is not working15:40
ysandeepsshnaidm: https://code.engineering.redhat.com/gerrit/c/openstack/tripleo-ci-internal-jobs/+/25270115:55
*** marios is now known as marios|out15:56
rlandyfrenzy_friday: thank you16:09
*** ysandeep is now known as ysandeep|away16:17
*** jpena is now known as jpena|off16:33
* dviroel ☕19:10
* dviroel 💻19:29

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!