Thursday, 2019-07-04

openstackgerritJames E. Blair proposed zuul/zuul master: Fix multi-tenant caching of extra config files  https://review.opendev.org/66900800:36
*** rlandy|bbl is now known as rlandy00:48
*** igordc has quit IRC00:49
*** saneax has joined #zuul00:55
*** rlandy has quit IRC01:09
*** bhavikdbavishi has joined #zuul01:39
openstackgerritJames E. Blair proposed zuul/zuul master: Fix multi-tenant caching of extra config files  https://review.opendev.org/66900801:43
*** swest has quit IRC01:45
*** bhavikdbavishi has quit IRC01:52
*** swest has joined #zuul01:59
*** bhavikdbavishi has joined #zuul03:12
*** bhavikdbavishi has quit IRC03:19
*** bhavikdbavishi has joined #zuul03:25
*** swest has quit IRC04:35
*** altlogbot_0 has quit IRC04:57
*** altlogbot_0 has joined #zuul04:59
*** swest has joined #zuul05:29
*** jamesmcarthur has joined #zuul05:46
*** saneax has quit IRC05:50
*** jamesmcarthur has quit IRC05:50
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: DNM: Test whether jobs run  https://review.opendev.org/66905606:08
*** bhavikdbavishi has quit IRC07:07
*** pcaruana has joined #zuul07:28
*** themroc has joined #zuul07:30
*** bhavikdbavishi has joined #zuul07:32
*** wxy-xiyuan has joined #zuul07:32
*** tosky has joined #zuul07:44
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806107:46
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876707:46
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806108:02
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876708:02
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806108:05
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876708:05
*** sshnaidm|afk is now known as sshnaidm|ruck08:11
*** jangutter has joined #zuul08:13
AJaegercorvus, I'm puzzled why the zuul-cloner test fails in 668061, let me do one more test...08:17
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806108:19
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876708:19
*** pcaruana has quit IRC08:28
*** saneax has joined #zuul08:31
*** pcaruana has joined #zuul09:04
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806109:06
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876709:06
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806109:16
AJaegercorvus: I think I found it - you really need the use-cached-repos role...09:17
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806109:21
*** hashar has joined #zuul09:30
AJaegercorvus: that fixes it ^ - please check whether that is what you intented ;)09:31
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876709:32
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876709:36
*** hwangbo has quit IRC09:40
AJaegercorvus: and the node-failures come from using as labels fedora-latest (valid nodeset but not label) etc, fix coming09:44
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add a script to make platform-specific versions of jobs  https://review.opendev.org/66895509:46
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806109:46
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876709:46
AJaegercorvus: let's try again ;)09:47
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add a script to make platform-specific versions of jobs  https://review.opendev.org/66895509:53
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806109:53
AJaegerwhat fun ;(09:53
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876709:53
AJaegercorvus: this should fix now finally the node-failures09:56
*** hashar has quit IRC09:57
*** pcaruana has quit IRC10:12
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876710:14
*** bhavikdbavishi has quit IRC10:50
AJaegercorvus: full stack passes now. Exception is 668767 where the ubuntu-trusty job failed.11:00
AJaegerAnd that failure is a mirror failure, hope recheck will succeed11:01
AJaegerzuul-jobs maintainers, config-core, please review stack starting at https://review.opendev.org/66895511:01
*** hashar has joined #zuul11:09
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add Gentoo integration tests  https://review.opendev.org/66914711:22
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add Gentoo integration tests  https://review.opendev.org/66914711:26
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: DNM: Trigger gentoo runs  https://review.opendev.org/66914811:26
*** bhavikdbavishi has joined #zuul12:03
*** saneax has quit IRC12:13
*** saneax has joined #zuul12:14
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add a script to make platform-specific versions of jobs  https://review.opendev.org/66895512:16
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806112:16
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876712:16
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add Gentoo integration tests  https://review.opendev.org/66914712:16
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: DNM: Trigger gentoo runs  https://review.opendev.org/66914812:16
*** rfolco has joined #zuul12:28
*** rlandy has joined #zuul12:30
*** bkorren has joined #zuul12:55
bkorrenhi there - is there a way to make a jos not show up in the report zuul send to gerrit?12:56
bkorrens/jos/job12:56
*** bhavikdbavishi has quit IRC13:13
AJaegerbkorren: I'm not aware of that - why would you want to do so?13:13
bkorrenAJaeger, well the job is doing some background setup task that I don't want my users to see13:14
*** bhavikdbavishi has joined #zuul13:14
bkorrenAJaeger, its should have been a 'pre' playbook, but issues with the static notepool provider forced me to make it into a separate job13:15
AJaegerbkorren: and what if it fails?13:17
bkorrenAJaeger, highly unlikely - and I'll get an email13:18
AJaeger;)13:19
AJaegerbkorren: just double checked - and couldn't find any job variables for this.13:19
AJaegerso, cannot help further myself13:20
bkorrenAJaeger, ok, thnaks anyway13:20
*** bhavikdbavishi has quit IRC13:33
AJaegercorvus, config-core, stack starting at 668955 for zuul-jobs passes \o/13:54
AJaegerbut enjoy 4th of July, no urgency on the stack ;)13:54
openstackgerritMonty Taylor proposed zuul/zuul master: Spec: Add a Kubernetes Operator for Zuul  https://review.opendev.org/65918014:20
*** bkorren has quit IRC14:32
pabelangerIs there any way a running job, could know if autohold has been requested?14:33
pabelangergiven the cleanup-run phase, was thinking of maybe auto add of ssh key14:33
*** hashar has quit IRC15:07
*** tosky has quit IRC15:14
flaper87ofosos: you should be able to ommit it, if not there may be a bug15:28
flaper87ofosos: you can also try setting it to something that is not valid, it'll be ignored. But ideally, you should be able to omit that setting15:29
openstackgerritMonty Taylor proposed zuul/zuul master: Spec: Add a Kubernetes Operator for Zuul  https://review.opendev.org/65918015:33
*** chandankumar is now known as raukadah15:39
*** egustafson has quit IRC16:10
*** themroc has quit IRC16:46
SpamapSpabelanger:maybe go the other way. Let a trusted job request autohold. Then you can have a post job that checks for things that suggest the node should be held.17:56
*** saneax has quit IRC18:58
*** tosky has joined #zuul19:13
*** dmyrhorodskyi has joined #zuul19:32
*** dmyrhorodskyi has quit IRC19:43
*** gtema_ has joined #zuul19:49
*** EmilienM is now known as EvilienM19:54
*** EvilienM is now known as EmilienM19:56
*** dmyrhorodskyi has joined #zuul20:03
dmyrhorodskyi Hi, I'm running Zuul for image testing purposes with upstream playbooks and gate scripts. We currently have an issue where playbook defined in job stage RUN is executed two times at the same worker. And runs in parallel. Since these playbooks do the same scripts they usually fail at some point. I've taken some time to investigate this issue but co20:06
dmyrhorodskyiuld not find a solution. So far we have K8s deployed Zuul 3.9.1.dev63. And we can see the same playbook is running in two separate ssh connections as you can see in attached snippet. This problem persist in 90% of runs but some times it is absent. Could someone please help investigate this problem?20:06
dmyrhorodskyi   14:22   0:00  \_ sshd: zuul [priv]20:06
dmyrhorodskyi20:06
dmyrhorodskyi20:06
dmyrhorodskyi            \_ /bin/bash ./tools/deployment/osh-infra-logging/020-ceph.sh20:06
dmyrhorodskyi  0.0  0.0   7924   776 ?        S    14:39   0:00  |                           \_ sleep 520:06
dmyrhorodskyi20:06
dmyrhorodskyiELM_ARGS='' OSH_PATH=../openstack-helm/ OSH_INFRA_PATH=../openstack-helm-infra/ OPENSTACK_RELEASE=newton20:06
dmyrhorodskyi        S    14:32   0:00  |               \_ /bin/sh -c set -xe;  ./tools/deployment/osh-infra-logging/020-ceph.sh20:06
dmyrhorodskyi20:06
dmyrhorodskyi20:06
*** dmyrhorodskyi has quit IRC20:13
fungijust a guess, but could your node be listed twice in the ansible inventory for some reason?20:15
fungior are your nodes using the static driver and somehow ending up running more than one build concurrently?20:16
*** kkalina has joined #zuul20:30
kkalinafungi, hi @dmytri and me are doing the same thing. I have checked inventory multiple times during the execution, since it was my first guess, that it adds 2 same hosts to inventory, but no, inventory has 1 host only.20:37
kkalinawe are using openstack driver, and using ephemeral instances, i have double checked that we are not using the same instance twice20:38
fungithat's definitely strange... does the job have a parent which also includes the same playbook?20:40
fungibut that wouldn't explain why it only happens 10% of the time so guessing not20:40
kkalinasorry, if i have misled you, it happens 90% of the time20:41
fungiahh20:41
fungipossible i misread20:41
kkalinawell maybe not 90% but most of the time20:41
fungithe executor's debug log entries for one of the impacted builds might yield clues20:41
fungiprobably named something like /var/log/zuul/executor-debug.log20:42
kkalinait doesn't have parent, other than base jobs, that just prepare workspace etc, from zuul-jobs repo20:42
kkalinai will double check the logs, we are running executor in k8s currently. using latest docker image zuul/zuul-executor, as this is kind of POC. I can upload whole bunch of logs so you can see it, but there are no errors, other than stream_log connection resets. but as i understand that is websocket connection that shouldn't effect anything20:46
*** gtema_ has quit IRC20:47
kkalinaand we also have a bunch of `defunct` child processes of the zuul executor appearing and disapearing in `ps` output20:49
fungiyou should be able to filter the executor log for a particular build uuid and see the sequence of actions it's taking20:51
mordredfungi, kkalina: I feel like we OCCASIONALLY saw something that sounds similar like a year ago or something. I'm not sure we every fully tracked it down ... pabelanger or clarkb might remember21:33
kkalinawhenever this happens, i can see message from zuul 2019-07-04 21:32:31.422228 | [Zuul] Log Stream did not terminate```21:35
kkalinawhenever this happens, i can see message from zuul 2019-07-04 21:32:31.422228 | [Zuul] Log Stream did not terminate21:35
kkalinain the log steam, at web component, i can see that it emits logs from the first playbook run, but when logs are downloaded with upload-logs role, they are different, and output of `ps auxf`, for example contains two entries of the same playbook belonging to different ssh connections, with difference of 5 minutes.21:41
pabelangerif 2 jobs are running, on the same node, do they have the same build uuid?22:09
*** rlandy is now known as rlandy|bbl22:16
*** tosky has quit IRC22:56
*** sshnaidm|ruck is now known as sshnaidm|off23:14

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!