Tuesday, 2016-08-23

*** david-lyle has joined #openstack-infra00:00
*** baoli has quit IRC00:00
*** david-lyle_ has joined #openstack-infra00:03
docaedoI was looking at a really unimportant bug (https://bugs.launchpad.net/app-catalog/+bug/1553572) and happened to have just spun up a new "test" server for app-catalog stuff, and found the issue00:05
openstackLaunchpad bug 1553572 in Community App Catalog "Last modified date missing from api/v1/assets" [Medium,Triaged] - Assigned to Christopher Aedo (docaedo)00:05
pabelangerclarkb: ack00:05
*** david-lyle has quit IRC00:05
*** eggshell has quit IRC00:06
*** tphummel has quit IRC00:07
*** david-lyle_ has quit IRC00:09
*** edtubill has joined #openstack-infra00:11
*** sdake has joined #openstack-infra00:16
*** Swami has quit IRC00:21
openstackgerritChristopher Aedo proposed openstack-infra/puppet-apps_site: Include package python-dateutils  https://review.openstack.org/35890900:23
*** Jeffrey4l_ has joined #openstack-infra00:23
*** pvaneck has quit IRC00:24
*** sputnik13 has quit IRC00:24
*** tonytan4ever has joined #openstack-infra00:25
*** nwkarsten has quit IRC00:26
*** nwkarsten has joined #openstack-infra00:27
*** zeroDivisible has quit IRC00:27
*** mdrabe has quit IRC00:29
*** zeroDivisible has joined #openstack-infra00:29
*** tonytan4ever has quit IRC00:31
*** nwkarsten has quit IRC00:31
*** Julien-zte has joined #openstack-infra00:34
openstackgerritMerged openstack-infra/irc-meetings: Create a new meeting for WOS-mentoring  https://review.openstack.org/35646700:36
*** caowei has quit IRC00:43
openstackgerritMerged openstack-infra/project-config: Increase packaging-deb timeouts  https://review.openstack.org/35885700:43
docaedoIf any infra cores would like to review a really exciting patch, it would be much appreciated.  https://review.openstack.org/358909 will make the Community App Catalog *gloriously* dynamic (by updating the recently added apps section based on actual dates, vs. current randomness)00:43
ianwfyi, as discussed with rcarrillocruz i'm going to see what i can do about this new review-dev as discussed -> http://eavesdrop.openstack.org/meetings/infra/2016/infra.2016-08-16-19.02.log.html#l-19400:48
*** gouthamr has joined #openstack-infra00:52
*** piet has quit IRC00:53
*** nwkarsten has joined #openstack-infra00:56
*** tqtran has quit IRC00:56
*** fguillot is now known as fguillot_afk00:57
*** fguillot_afk has quit IRC00:57
*** Apoorva has quit IRC00:59
*** fguillot has joined #openstack-infra01:00
*** nwkarsten has quit IRC01:00
*** chem has quit IRC01:02
*** Hal1 has joined #openstack-infra01:04
*** nwkarsten has joined #openstack-infra01:04
*** zxiiro-away has quit IRC01:04
*** rockstar has quit IRC01:05
*** rockstar has joined #openstack-infra01:05
*** esberglu has joined #openstack-infra01:06
*** sdake_ has joined #openstack-infra01:06
*** gyee has quit IRC01:07
*** zxiiro-away has joined #openstack-infra01:07
*** Hal has quit IRC01:07
*** thorst_ has quit IRC01:07
*** thorst has joined #openstack-infra01:08
*** sdake has quit IRC01:10
*** jamielennox is now known as jamielennox|away01:11
*** chem has joined #openstack-infra01:12
*** zhurong has joined #openstack-infra01:13
*** yanyanhu has joined #openstack-infra01:13
*** eranrom has joined #openstack-infra01:15
*** salv-orl_ has joined #openstack-infra01:15
*** jamielennox|away is now known as jamielennox01:16
*** thorst has quit IRC01:16
*** salv-orlando has quit IRC01:18
*** sdake_ has quit IRC01:19
*** eranrom has quit IRC01:19
*** sdake has joined #openstack-infra01:22
*** nwkarsten has quit IRC01:22
*** yanyanhu has quit IRC01:23
openstackgerritMerged openstack-infra/tripleo-ci: Add memory to overcloud vms up to 6144  https://review.openstack.org/35753201:25
*** zhurong has quit IRC01:25
*** salv-orl_ has quit IRC01:26
*** zhurong has joined #openstack-infra01:26
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: POC: WIP: oooq undercloud install  https://review.openstack.org/35891901:26
clarkbubuntu xenial image after reseting the cache is 7.3GB qcow201:28
clarkbso got almost a gigabyte back01:28
*** baoli has joined #openstack-infra01:28
openstackgerritDoug Wiegley proposed openstack-infra/devstack-gate: Remove q-lbaas from tempest pre-installed stuff.  https://review.openstack.org/35825801:28
clarkband the vhd is 18GB so about 3 GB saved there01:28
clarkbI have started image uploads for the new xenial image in all clouds. Goal is to get ntpdate'd xenial iamges everywhere so we can update d-g with that revert01:31
*** hongbin has joined #openstack-infra01:35
*** hockeynut has quit IRC01:36
*** raunak has quit IRC01:37
clarkbit would be really neat if we could read only mount a single volume on many hosts01:38
clarkbthen instead of 18GB images we could have 600MB images with a 17GB cinder volume01:39
*** coolsvap has quit IRC01:39
*** andymaier has joined #openstack-infra01:41
*** markvoelker has joined #openstack-infra01:44
*** markvoelker_ has joined #openstack-infra01:46
*** xarses has joined #openstack-infra01:48
*** markvoelker has quit IRC01:50
*** caowei has joined #openstack-infra01:52
pabelangerclarkb: wow, so, why the drop in size? Stale packages getting pulled into the images?01:54
*** nwkarsten has joined #openstack-infra01:54
clarkbpabelanger: I think our git cache bloats over time01:56
clarkbI didnt delete the old cache just moved it aside so we can compare01:56
*** mtanin___ has quit IRC02:07
*** annegentle has joined #openstack-infra02:07
*** yuanying has quit IRC02:10
*** zhurong has quit IRC02:13
*** annegentle has quit IRC02:14
*** thorst has joined #openstack-infra02:14
*** baoli has quit IRC02:15
*** yamahata has quit IRC02:15
*** zhurong has joined #openstack-infra02:15
*** shashank_hegde has quit IRC02:16
*** rfolco has quit IRC02:18
*** thorst has quit IRC02:22
*** raunak has joined #openstack-infra02:28
*** ramishra has quit IRC02:40
*** esberglu has quit IRC02:43
*** roxanagh_ has joined #openstack-infra02:47
*** jamielennox is now known as jamielennox|away02:49
adriantHey, any reason why a new project I'm trying to get added to the openstack gerrit is taking forever?02:51
*** roxanagh_ has quit IRC02:52
adriantI resolved the only comment posted on it, but haven't gotten any updates since.02:52
adriantpatch in question: https://review.openstack.org/#/c/353818/02:53
*** tqtran has joined #openstack-infra02:54
*** admcleod has joined #openstack-infra02:57
*** gouthamr_ has joined #openstack-infra02:57
*** admcleod_ has quit IRC02:57
*** gouthamr has quit IRC02:58
*** tqtran has quit IRC03:00
*** gouthamr_ is now known as gouthamr03:01
*** andymaier has quit IRC03:03
*** nwkarsten has quit IRC03:03
*** jamielennox|away is now known as jamielennox03:06
*** fguillot has quit IRC03:06
*** ianychoi has joined #openstack-infra03:07
*** thorst has joined #openstack-infra03:20
*** vinaypotluri has quit IRC03:21
*** nwkarsten has joined #openstack-infra03:23
*** thorst has quit IRC03:26
*** salv-orlando has joined #openstack-infra03:30
*** Goneri has quit IRC03:31
*** vikrant has joined #openstack-infra03:34
*** vikrant is now known as vikrant|brb03:34
openstackgerritDonovan Jones proposed openstack-infra/shade: Allow object storage endpoint to return 404 for missing /info endpoint  https://review.openstack.org/35893703:34
armaxhi infra wizards, there’s a change in the gate queue (358753) that’s meant to alleviate some pressure on the gate03:36
armaxif that could be bumped up, that would help get rid of the recent failures like this one03:37
armaxempest.exceptions.BuildErrorException: Server 04fe97b1-e46e-4bea-a378-afc6ec04fb7d failed to build and is in ERROR status03:37
armax2016-08-23 02:30:43.168929 |     Details: {u'created': u'2016-08-23T01:49:28Z', u'message': u'Build of instance 04fe97b1-e46e-4bea-a378-afc6ec04fb7d aborted: Failed to allocate the network(s), not rescheduling.', u'code': 500}03:37
*** salv-orlando has quit IRC03:38
*** asselin_ has joined #openstack-infra03:38
*** yamahata has joined #openstack-infra03:40
*** zul has quit IRC03:41
*** asselin has quit IRC03:42
*** gouthamr has quit IRC03:44
*** shashank_hegde has joined #openstack-infra03:45
*** zul has joined #openstack-infra03:46
*** vikrant|brb is now known as vikrant03:47
*** roxanagh_ has joined #openstack-infra03:48
*** aeng has quit IRC03:51
*** roxanagh_ has quit IRC03:52
*** vinaypotluri has joined #openstack-infra03:54
*** nwkarsten has quit IRC03:57
*** yuanying has joined #openstack-infra03:59
*** M-docaedo_vector has quit IRC04:00
*** sflanigan has quit IRC04:02
*** hongbin has quit IRC04:03
*** aeng has joined #openstack-infra04:07
*** Jaison has joined #openstack-infra04:15
*** timello has quit IRC04:15
*** markvoelker has joined #openstack-infra04:21
*** ilyashakhat has joined #openstack-infra04:22
*** markvoelker_ has quit IRC04:22
*** sarob has joined #openstack-infra04:23
*** thorst has joined #openstack-infra04:24
*** Jaison has quit IRC04:24
*** nwkarsten has joined #openstack-infra04:25
*** jraju has joined #openstack-infra04:26
*** baoli has joined #openstack-infra04:27
*** sarob has quit IRC04:27
*** markvoelker has quit IRC04:28
*** timello has joined #openstack-infra04:29
*** thorst has quit IRC04:31
*** baoli has quit IRC04:31
*** M-docaedo_vector has joined #openstack-infra04:32
*** _nadya_ has joined #openstack-infra04:36
*** jerryz has joined #openstack-infra04:37
*** salv-orlando has joined #openstack-infra04:37
*** AJaeger has joined #openstack-infra04:39
*** jtomasek has quit IRC04:42
*** salv-orlando has quit IRC04:49
*** jamielennox is now known as jamielennox|away04:49
*** edtubill has quit IRC04:50
*** kzaitsev_mb has joined #openstack-infra04:51
*** tqtran has joined #openstack-infra04:57
*** armax has quit IRC04:57
*** salv-orlando has joined #openstack-infra04:58
*** tqtran has quit IRC05:01
*** Hal1 has quit IRC05:01
*** Hal has joined #openstack-infra05:02
*** claudiub has joined #openstack-infra05:02
*** roxanagh_ has joined #openstack-infra05:02
*** roxanagh_ has quit IRC05:03
*** Sukhdev has joined #openstack-infra05:05
*** Sukhdev has quit IRC05:07
*** Sukhdev has joined #openstack-infra05:07
*** jaosorior has joined #openstack-infra05:10
*** yanyanhu has joined #openstack-infra05:11
*** tphummel has joined #openstack-infra05:12
*** kzaitsev_mb has quit IRC05:12
*** senk has joined #openstack-infra05:14
*** eranrom has joined #openstack-infra05:16
*** _nadya_ has quit IRC05:21
*** eranrom has quit IRC05:21
*** sdake_ has joined #openstack-infra05:21
*** jamielennox|away is now known as jamielennox05:23
*** sdake has quit IRC05:24
*** markvoelker has joined #openstack-infra05:29
openstackgerritGuido Günther proposed openstack-infra/jenkins-job-builder: Fix logparser for 2.0 module  https://review.openstack.org/35895605:29
*** thorst has joined #openstack-infra05:30
*** ilyashakhat has quit IRC05:31
*** raunak has quit IRC05:35
*** raunak has joined #openstack-infra05:36
*** thorst has quit IRC05:37
*** sandanar has joined #openstack-infra05:37
*** senk has quit IRC05:39
*** markvoelker has quit IRC05:39
*** ilyashakhat has joined #openstack-infra05:40
*** sdake_ has quit IRC05:41
*** tphummel has quit IRC05:41
*** raunak has quit IRC05:42
*** AJaeger has quit IRC05:42
openstackgerritSteve Martinelli proposed openstack-infra/shade: test commit for osc3.0.1  https://review.openstack.org/35896705:47
*** Sukhdev has quit IRC05:48
*** AnarchyAo has joined #openstack-infra05:48
*** ilyashakhat has quit IRC05:50
*** AnarchyAo has quit IRC05:50
*** r-mibu has quit IRC05:52
*** dstufft has quit IRC05:53
*** dstufft has joined #openstack-infra05:54
*** nwkarsten has quit IRC05:58
openstackgerritMerged openstack-infra/system-config: Set iLO/public/provisioning addresses and metadata for compute043.vanilla  https://review.openstack.org/35859805:59
*** roxanagh_ has joined #openstack-infra06:04
*** asselin__ has joined #openstack-infra06:04
openstackgerritGuido Günther proposed openstack-infra/jenkins-job-builder: Fix logparser for 2.0 module  https://review.openstack.org/35895606:04
*** senk has joined #openstack-infra06:04
*** wcriswell has quit IRC06:05
*** _oanson has joined #openstack-infra06:07
*** asselin_ has quit IRC06:07
openstackgerritMerged openstack-infra/system-config: Enable compute005.vanilla and set all IPs and metadata  https://review.openstack.org/35863106:08
*** roxanagh_ has quit IRC06:08
*** jamielennox is now known as jamielennox|away06:11
*** r-mibu has joined #openstack-infra06:12
*** pcaruana has joined #openstack-infra06:14
*** woodster_ has quit IRC06:19
*** esikachev has joined #openstack-infra06:22
*** shashank_hegde has quit IRC06:24
*** pt_15 has joined #openstack-infra06:25
*** AJaeger has joined #openstack-infra06:33
*** thorst has joined #openstack-infra06:34
*** markvoelker has joined #openstack-infra06:36
*** abregman has joined #openstack-infra06:37
*** aeng has quit IRC06:37
*** AJaeger has quit IRC06:38
*** wcriswell has joined #openstack-infra06:41
*** thorst has quit IRC06:42
*** markvoelker has quit IRC06:42
*** andreas_s has joined #openstack-infra06:43
openstackgerritMerged openstack-infra/system-config: Set compute038.vanilla IPs and metadata  https://review.openstack.org/35863806:47
*** aeng has joined #openstack-infra06:50
*** eranrom has joined #openstack-infra06:51
*** AJaeger has joined #openstack-infra06:51
*** florianf has joined #openstack-infra06:55
*** yolanda has quit IRC06:56
*** sflanigan has joined #openstack-infra06:56
openstackgerritMadhuri Kumari proposed openstack-infra/project-config: Rename Zun gate tests.  https://review.openstack.org/35898806:57
*** nwkarsten has joined #openstack-infra06:59
*** tqtran has joined #openstack-infra06:59
*** tqtran has quit IRC07:03
*** nwkarsten has quit IRC07:03
*** yolanda has joined #openstack-infra07:04
yolandagood morning07:04
AJaegergood morning, yolanda !07:05
yolandahi, AJaeger , back from holiday?07:06
yolandadid you have a good time?07:06
*** fmccrthy has quit IRC07:06
*** rackertom has quit IRC07:06
AJaegeryolanda: was great, thanks! REally relaxing - and my kids were happy ;)07:06
*** watersoul has joined #openstack-infra07:06
*** zubchick has quit IRC07:07
*** rackertom has joined #openstack-infra07:07
*** zubchick has joined #openstack-infra07:08
*** watersoul_ has quit IRC07:08
*** fmccrthy has joined #openstack-infra07:08
yolandaAJaeger, i'm going holiday next week07:09
*** esikachev has quit IRC07:12
AJaegeryolanda: where are you going?07:12
yolandai'll stay in Spain, but a bit more in the south, to the beach07:13
yolandanear a town called Torrevieja07:13
*** penguinolog has joined #openstack-infra07:13
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/system-config: Change hpuswest for vanilla on controller and compute node definitions  https://review.openstack.org/35899207:15
AJaegeryolanda: I wish you a great vacation!07:16
*** salv-orl_ has joined #openstack-infra07:16
yolandathanks! where have you gone?07:16
AJaegerTo the north - an island in the Baltic Sea. A little bit colder than I expect it will be for you ;)07:17
AJaegerStill in Germany - so nice temparature, sea and lots to do ...07:17
jaosoriorAJaeger: so what jobs is bindep actually used for?07:18
*** tesseract- has joined #openstack-infra07:18
*** andymaier has joined #openstack-infra07:18
AJaegerjaosorior: did you read http://lists.openstack.org/pipermail/openstack-dev/2016-August/101590.html ?07:18
*** salv-orlando has quit IRC07:19
jaosoriorAJaeger: I didn't. I went for the documentation.07:19
AJaegerproject-config cores, could you review https://review.openstack.org/358446 https://review.openstack.org/354861 (already +2 by yolanda) and https://review.openstack.org/35876907:19
*** Na3iL has joined #openstack-infra07:20
AJaegerjaosorior: So, let's update documentation to not confuse you - do you want to give it a go? Or do you have still questions after reading that email?07:20
*** salv-orl_ has quit IRC07:21
openstackgerritMerged openstack-infra/system-config: Correct iLO IP and rack number for compute19.chocolate  https://review.openstack.org/35867507:21
*** vinaypotluri has quit IRC07:21
jaosoriorAJaeger: it seems clearer now. Thanks. So; is there another way of managing dependencies for devstack based tests?07:22
AJaegerjaosorior: let me dig out a link for you...07:22
jaosoriorAJaeger: sorry for the extra work; I've actually had a hard time digging out where to do these kind of things. And even got the wrong impression of bindep.07:23
AJaegerjaosorior: http://docs.openstack.org/infra/manual/drivers.html#package-requirements - contains a link to http://git.openstack.org/cgit/openstack-dev/devstack/tree/files07:24
AJaegerjaosorior: sorry to hear that - I would really appreciate if you could help the next person on this journey.07:24
AJaegerjaosorior: So, do you want to patch - or explain to me what confused you and I'll try changing it?07:25
*** salv-orlando has joined #openstack-infra07:25
jaosoriorAJaeger: Would be nice to have a more explicit explanation of how bindep is used in openstack (also to specify that it's not used in devstack based jobs). And also some examples on how to use it would be nice. There is some explanation of the syntax, but one has to dig into the openstack projects to concretely see how it's used.07:26
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/system-config: Change hpuswest for vanilla on controller and compute node definitions  https://review.openstack.org/35899207:27
*** jpich has joined #openstack-infra07:27
AJaegerjaosorior: for examples, here's one: https://review.openstack.org/#/c/358811/07:28
AJaegerjaosorior: I'll write a section...07:28
*** matrohon has joined #openstack-infra07:30
jaosoriorAJaeger: yeah, pabelanger passed me some examples. Which was useful. Though it would be useful for the reader to get some examples in the documentation.07:30
AJaegerjaosorior: could you review 358811, please?07:30
AJaegerSuggestions on what else to add are welcome ;)07:31
*** Na3iL has quit IRC07:31
jaosoriorAJaeger: now I'm not sure if adding the devstack comment is necessary, since you did pass a link where that's mentioned. So I guess it's fine07:33
*** asettle has joined #openstack-infra07:33
openstackgerritAndreas Jaeger proposed openstack-infra/bindep: Document OpenStack usage  https://review.openstack.org/35899807:33
*** yaume has joined #openstack-infra07:34
AJaegerjaosorior: Brief documentation here ^07:34
*** vincentll has joined #openstack-infra07:34
AJaegerthanks for review.07:37
jaosoriorAJaeger: thanks for the commits and the explanation07:37
AJaegerjaosorior: so, add your bindep.txt file in barbican and leave the devstack change out - and talk to the QA team on how to add the dependencies in the best way for your plugin. That should be a separate change IMHO07:38
*** yamamoto has quit IRC07:38
*** markvoelker has joined #openstack-infra07:38
openstackgerritMerged openstack-infra/project-config: Add check-requirements to openstack-ansible-specs  https://review.openstack.org/35841107:38
jaosoriorAJaeger: will do. Thanks07:39
openstackgerritMerged openstack-infra/project-config: Add os_watcher to OpenStack-Ansible  https://review.openstack.org/35888307:39
*** thorst has joined #openstack-infra07:39
*** DrifterZA has joined #openstack-infra07:40
*** ifarkas_afk is now known as ifarkas07:42
*** markvoelker has quit IRC07:43
*** matthewbodkin has joined #openstack-infra07:43
*** e0ne has joined #openstack-infra07:45
*** thorst has quit IRC07:46
*** sshnaidm|afk is now known as sshnaidm07:48
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: POC: WIP: oooq undercloud install  https://review.openstack.org/35891907:48
*** roxanagh_ has joined #openstack-infra07:52
*** yaume has quit IRC07:53
openstackgerritMerged openstack-infra/project-config: Remove DocBook XML publishing for trove  https://review.openstack.org/35844607:53
*** yaume has joined #openstack-infra07:53
*** DrifterZA has quit IRC07:53
*** adriant_ has joined #openstack-infra07:54
*** sleviim has joined #openstack-infra07:55
*** rcernin has quit IRC07:56
*** roxanagh_ has quit IRC07:56
*** esikachev has joined #openstack-infra07:58
openstackgerritMerged openstack-infra/system-config: Change hpuswest for vanilla on controller and compute node definitions  https://review.openstack.org/35899207:58
*** zzzeek has quit IRC08:00
openstackgerritMerged openstack-infra/project-config: Add Install Guide Jobs to Barbican  https://review.openstack.org/35876908:00
sshnaidmthe zuul status page shows half of IPs as IPv6, how can I connect to telnet://2001:4800:1ae1:18:f816:3eff:fe45:326f:19885 ?? I don't have IPv6 with my ISP08:00
*** zzzeek has joined #openstack-infra08:01
openstackgerritYuval Brik proposed openstack-infra/project-config: Karbor (Smaug) Fullstack Path Fix  https://review.openstack.org/35901908:01
AJaegersshnaidm: don't you have a place you can login that has IPv6? If not, you really have to wait - we do not have public IPv4 addresses for each of our test nodes.08:01
sshnaidmAJaeger, can't think about such place..08:02
*** _oanson is now known as oanson08:02
*** esikachev has quit IRC08:03
*** ggnel_t has joined #openstack-infra08:03
*** openstackgerrit has quit IRC08:03
sshnaidmAJaeger, in my country no ISP has IPv6 and even don't plan to have it, like in many others btw08:04
*** openstackgerrit has joined #openstack-infra08:04
*** yaume has quit IRC08:05
*** yuanying has quit IRC08:05
*** vsaienko2 has left #openstack-infra08:06
*** hashar has joined #openstack-infra08:06
sshnaidmNAT is our everything08:07
*** adriant_ has quit IRC08:07
*** jtomasek has joined #openstack-infra08:07
AJaegersshnaidm: in that case you have to wait until the job has completed, the log files will be available as usual from logs.openstack.org...08:07
sleviimhi anteaya, how are you?08:08
*** adriant__ has joined #openstack-infra08:11
Jokke_sshnaidm: https://tunnelbroker.net/08:11
Jokke_that might help for cases like these08:12
*** Goneri has joined #openstack-infra08:13
openstackgerritMerged openstack-infra/project-config: Add Node Launches to nodepool dashboard  https://review.openstack.org/35869908:13
openstackgerritMerged openstack-infra/project-config: Remove q-lbaas from this tempest list, as it is being removed  https://review.openstack.org/35825908:13
adriant__AJaeger: any update on this review: https://review.openstack.org/#/c/353818/08:14
sshnaidmJokke_, thanks, will try08:15
*** lucas-dinner is now known as lucasagomes08:15
*** yamamoto has joined #openstack-infra08:16
sleviimanteaya: it seems like it works :)08:17
*** derekh has joined #openstack-infra08:18
*** esikachev has joined #openstack-infra08:18
*** apetrich has quit IRC08:19
*** apetrich has joined #openstack-infra08:21
*** pgadiya has joined #openstack-infra08:22
AJaegeradriant__: it needs two core reviewers to look at, it's on my long list after vacation and I'll review eventually - if no other project-config core beats me to it ;)08:23
*** esikachev has quit IRC08:23
*** Na3iL has joined #openstack-infra08:24
*** sarob has joined #openstack-infra08:24
openstackgerritsandhya proposed openstack/diskimage-builder: Add support for building images capable of UEFI  https://review.openstack.org/28778408:24
*** coolsvap has joined #openstack-infra08:25
*** sarob has quit IRC08:28
openstackgerritBartosz Kupidura proposed openstack-infra/puppet-apps_site: [wip] Glare support for app-catalog  https://review.openstack.org/35902908:31
*** dingyichen has quit IRC08:33
mrmartinclarkb, anteaya: I'll check the event duplication on groups.o.o sometimes it happens with the events imported through meetup.com api.08:34
openstackgerritCarlos Camacho proposed openstack-infra/tripleo-ci: Adding a 1GB swap file to the undercloud.  https://review.openstack.org/35903508:35
*** yaume has joined #openstack-infra08:35
*** pt_15 has quit IRC08:36
*** dizquierdo has joined #openstack-infra08:39
*** esikachev has joined #openstack-infra08:39
*** markvoelker has joined #openstack-infra08:39
*** yamamoto has quit IRC08:40
*** ifarkas has quit IRC08:41
openstackgerritOpenStack Proposal Bot proposed openstack-infra/project-config: Normalize projects.yaml  https://review.openstack.org/35904108:41
*** ifarkas has joined #openstack-infra08:42
*** esikachev has quit IRC08:43
*** salv-orl_ has joined #openstack-infra08:44
*** markvoelker has quit IRC08:44
*** salv-orlando has quit IRC08:45
*** thorst has joined #openstack-infra08:45
*** vincentll has quit IRC08:46
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/system-config: Fix vlan on vanilla controller and compute machines  https://review.openstack.org/35904608:46
*** vincentll has joined #openstack-infra08:48
*** yamamoto has joined #openstack-infra08:48
*** yaume has quit IRC08:48
*** salv-orl_ has quit IRC08:49
*** salv-orlando has joined #openstack-infra08:50
*** nwkarsten has joined #openstack-infra08:51
*** thorst has quit IRC08:52
*** kzaitsev_mb has joined #openstack-infra08:52
*** adriant__ has quit IRC08:53
*** nwkarsten has quit IRC08:55
*** yaume has joined #openstack-infra08:57
*** gongysh has joined #openstack-infra08:57
*** electrofelix has joined #openstack-infra09:00
*** eranrom has quit IRC09:00
*** eranrom has joined #openstack-infra09:01
openstackgerritGiulio Fidente proposed openstack-infra/tripleo-ci: [NO MERGE] Test write performances  https://review.openstack.org/35905409:01
*** ifarkas_ has joined #openstack-infra09:02
*** eranrom has quit IRC09:03
openstackgerritMerged openstack-infra/tripleo-ci: Use separated SSL endpoint environment file  https://review.openstack.org/35648809:09
*** kaisers_ has joined #openstack-infra09:10
*** _nadya_ has joined #openstack-infra09:12
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/system-config: Correct vanilla Neutron ranges  https://review.openstack.org/35905809:12
*** sambetts|afk is now known as sambetts09:14
*** d0ugal has quit IRC09:15
*** d0ugal has joined #openstack-infra09:16
*** gongysh has quit IRC09:18
*** aarefiev_ is now known as aarefiev09:19
*** caowei has quit IRC09:20
*** salv-orlando has quit IRC09:21
*** salv-orlando has joined #openstack-infra09:21
openstackgerritMerged openstack-infra/system-config: Fix vlan on vanilla controller and compute machines  https://review.openstack.org/35904609:22
*** caowei has joined #openstack-infra09:22
*** berendt has joined #openstack-infra09:23
*** AnarchyAo has joined #openstack-infra09:23
*** AnarchyAo has quit IRC09:23
*** AnarchyAo has joined #openstack-infra09:23
*** AnarchyAo has quit IRC09:23
*** AnarchyAo has joined #openstack-infra09:23
*** AnarchyAo has quit IRC09:24
*** AnarchyAo has joined #openstack-infra09:24
*** AnarchyAo has quit IRC09:24
*** AnarchyAo has joined #openstack-infra09:24
*** AnarchyAo has quit IRC09:24
*** nwkarsten has joined #openstack-infra09:27
AJaegeradriant: I commented on your review. Once I have an answer to that I can +2.09:28
*** Goneri has quit IRC09:30
*** nwkarsten has quit IRC09:31
*** javeriak has joined #openstack-infra09:33
*** pgadiya_ has joined #openstack-infra09:33
*** Goneri has joined #openstack-infra09:33
*** kzaitsev_mb has quit IRC09:33
*** pgadiya has quit IRC09:34
*** lucasagomes is now known as lucas-afk09:40
*** nwkarsten has joined #openstack-infra09:40
*** roxanagh_ has joined #openstack-infra09:40
*** amotoki has joined #openstack-infra09:40
*** markvoelker has joined #openstack-infra09:40
*** tosky has joined #openstack-infra09:41
*** dtantsur|afk is now known as dtantsur09:42
*** jerryz has quit IRC09:43
*** markvoelker has quit IRC09:44
*** nwkarsten has quit IRC09:44
*** roxanagh_ has quit IRC09:45
zigoAJaeger: Hi there!09:47
zigoAJaeger: Regarding your comment, in which file should I put deb-python-fixtures so that it's officially in packaging-deb?09:47
zigoI forgot which file.09:47
openstackgerritVolodymyr Stoiko proposed openstack-infra/project-config: Add fuel-plugin-rally project  https://review.openstack.org/35907609:51
*** amotoki has quit IRC09:51
*** yanyanhu has quit IRC09:52
*** ifarkas has quit IRC09:54
*** ifarkas_ is now known as ifarkas09:54
AJaegerzigo: I gave a link in my review to our fine manual, please read and follow it.09:55
* zigo hides behind his desk...09:55
AJaegerNo need to hide, I don't plan throwing anything through IRC ;)09:56
openstackgerritGraham Hayes proposed openstack-infra/project-config: Do not run all tempest tests on designate grenade job  https://review.openstack.org/35908009:59
*** zhurong has quit IRC09:59
*** tqtran has joined #openstack-infra10:01
openstackgerritThomas Goirand proposed openstack-infra/project-config: Add deb-python-fixtures to packaging-deb  https://review.openstack.org/35881910:01
*** yaume_ has joined #openstack-infra10:02
*** caowei has quit IRC10:04
*** tqtran has quit IRC10:05
*** yaume has quit IRC10:05
openstackgerritMerged openstack-infra/project-config: Normalize projects.yaml  https://review.openstack.org/35904110:08
*** mikelk has joined #openstack-infra10:12
openstackgerritfumihiko kakuma proposed openstack-infra/project-config: Use ovs-interface-nondefault instead of ovs-native job  https://review.openstack.org/33894410:13
*** mikelk has quit IRC10:14
*** mikelk has joined #openstack-infra10:15
*** ccamacho is now known as ccamacho|afk10:15
*** Julien-zte has quit IRC10:18
*** _degorenko|afk is now known as degorenko10:18
*** mikelk has quit IRC10:20
*** mikelk has joined #openstack-infra10:20
*** sarob has joined #openstack-infra10:25
*** sarob has quit IRC10:29
odyssey4meI everyone. Now that https://review.openstack.org/358883 has merged, can you please add me to https://review.openstack.org/#/admin/groups/1538,members10:30
rcarrillocruzodyssey4me: done10:31
odyssey4methanks rcarrillocruz10:32
*** kzaitsev_mb has joined #openstack-infra10:33
openstackgerritMerged openstack-infra/tripleo-ci: Pass TRIPLEO_ROOT directory to heat_deploy_times.sh  https://review.openstack.org/35694610:35
*** yaume has joined #openstack-infra10:36
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/system-config: Replace hpuswest naming for vanilla on hiera keys  https://review.openstack.org/35910310:37
*** javeriak has quit IRC10:38
*** ramishra has joined #openstack-infra10:39
*** kzaitsev_mb has quit IRC10:39
*** yaume_ has quit IRC10:39
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/system-config: Replace hpuswest for vanilla on certfile hiera key  https://review.openstack.org/35910710:41
*** amotoki has joined #openstack-infra10:42
AJaegerrcarrillocruz: could you review https://review.openstack.org/#/c/345441/ , please? The dependency has merged...10:42
*** javeriak has joined #openstack-infra10:45
*** thorst has joined #openstack-infra10:47
*** thorst has quit IRC10:52
*** kzaitsev_mb has joined #openstack-infra10:52
*** coolsvap is now known as coolsvap_10:53
*** amotoki has quit IRC10:53
openstackgerritMerged openstack-infra/project-config: Test api-ref theming with openstackdocstheme  https://review.openstack.org/34544110:54
*** rodrigods has quit IRC10:59
*** rodrigods has joined #openstack-infra10:59
*** oanson has quit IRC10:59
*** icey has joined #openstack-infra11:00
openstackgerritMatthew Bodkin proposed openstack-infra/storyboard-webclient: Move 'Save' button up in 'Preferences' page  https://review.openstack.org/35911911:00
*** Na3iL has quit IRC11:01
*** dizquierdo is now known as dizquierdo_afk11:01
*** jkilpatr has quit IRC11:02
*** thorst has joined #openstack-infra11:04
openstackgerritshizhihui proposed openstack-infra/project-config: Make py35 voting for Horizon  https://review.openstack.org/35912311:05
openstackgerritVolodymyr Stoiko proposed openstack-infra/project-config: Add fuel-plugin-rally project  https://review.openstack.org/35907611:08
*** ccamacho|afk is now known as ccamacho11:09
*** markvoelker has joined #openstack-infra11:09
openstackgerrityolanda.robla proposed openstack-infra/puppet-infracloud: Add management of /etc/nova/ssl/private directory  https://review.openstack.org/35866811:15
openstackgerritMerged openstack-infra/system-config: Temporarily add rabbit keys to hiera  https://review.openstack.org/35702111:15
*** ramishra has quit IRC11:20
openstackgerritJuan Antonio Osorio Robles proposed openstack-infra/tripleo-ci: Enable SSL for undercloud-only job  https://review.openstack.org/35913111:20
*** markvoelker has quit IRC11:20
* AJaeger just got "Could not connect to mirror.regionone.osic-cloud1.openstack.org:80" in http://logs.openstack.org/22/359122/1/check/gate-openstackdocstheme-releasenotes/136d9b8/console.html11:21
*** ramishra has joined #openstack-infra11:22
*** thiagolib has quit IRC11:23
*** sarob has joined #openstack-infra11:24
*** thiagolib has joined #openstack-infra11:24
*** roxanagh_ has joined #openstack-infra11:28
*** sarob has quit IRC11:28
*** dprince has joined #openstack-infra11:29
*** dizquierdo_afk is now known as dizquierdo11:30
*** roxanagh_ has quit IRC11:32
*** jtomasek has quit IRC11:37
*** jkilpatr has joined #openstack-infra11:37
*** ldnunes has joined #openstack-infra11:37
*** ccamacho is now known as ccamacho|lunch11:41
*** nwkarsten has joined #openstack-infra11:42
*** amotoki has joined #openstack-infra11:42
*** rcernin has joined #openstack-infra11:43
*** rfolco has joined #openstack-infra11:47
*** nwkarsten has quit IRC11:47
openstackgerritMerged openstack-infra/system-config: Replace hpuswest naming for vanilla on hiera keys  https://review.openstack.org/35910311:48
*** jtomasek has joined #openstack-infra11:49
openstackgerritMerged openstack-infra/system-config: Replace hpuswest for vanilla on certfile hiera key  https://review.openstack.org/35910711:49
openstackgerritVolodymyr Stoiko proposed openstack-infra/project-config: Add fuel-plugin-rally project  https://review.openstack.org/35907611:49
*** jaosorior has quit IRC11:50
*** asettle has quit IRC11:51
*** jaosorior has joined #openstack-infra11:51
openstackgerritMarton Kiss proposed openstack-infra/groups: Security update for Panelizer module  https://review.openstack.org/35915511:52
openstackgerritMatthew Bodkin proposed openstack-infra/storyboard-webclient: Add a margin to the bottom of all pages  https://review.openstack.org/35911911:56
*** Goneri has quit IRC11:57
*** Na3iL has joined #openstack-infra11:57
*** yamahata has quit IRC12:00
*** pgadiya_ is now known as pgadiya12:03
*** Goneri has joined #openstack-infra12:04
*** xyang1 has joined #openstack-infra12:05
*** ansmith has joined #openstack-infra12:07
*** tpsilva has joined #openstack-infra12:08
openstackgerritIlya Shakhat proposed openstack-infra/project-config: Add new project "os-failures"  https://review.openstack.org/35581912:08
*** mordred has quit IRC12:10
*** kaisers_ has quit IRC12:11
*** Shrews has quit IRC12:11
*** zhurong has joined #openstack-infra12:11
openstackgerritVolodymyr Stoiko proposed openstack-infra/project-config: Add fuel-plugin-rally project  https://review.openstack.org/35907612:12
*** andymaier has quit IRC12:13
*** phschwartz has quit IRC12:14
*** mordred has joined #openstack-infra12:15
*** Shrews has joined #openstack-infra12:16
*** annegentle has joined #openstack-infra12:16
*** esikachev has joined #openstack-infra12:18
*** javeriak has quit IRC12:20
*** rcernin has quit IRC12:23
*** kgiusti has joined #openstack-infra12:24
*** andymaier has joined #openstack-infra12:25
*** gouthamr has joined #openstack-infra12:26
openstackgerritRamana Raja proposed openstack-infra/project-config: remove manila's glusterfs xenial jobs  https://review.openstack.org/35916712:27
*** vikasc has left #openstack-infra12:27
*** rhallisey_ has joined #openstack-infra12:28
*** rcernin has joined #openstack-infra12:28
*** abregman has quit IRC12:28
*** kushal has joined #openstack-infra12:30
*** dtardivel has joined #openstack-infra12:31
*** jed56 has joined #openstack-infra12:31
*** jcoufal has joined #openstack-infra12:32
*** coolsvap_ is now known as coolsvap12:36
openstackgerritMerged openstack-infra/system-config: Correct vanilla Neutron ranges  https://review.openstack.org/35905812:36
*** phschwartz has joined #openstack-infra12:38
openstackgerritMonty Taylor proposed openstack-infra/shade: Allow image and flavor by name for create_server  https://review.openstack.org/35525112:39
*** mdrabe has joined #openstack-infra12:40
*** edmondsw has joined #openstack-infra12:40
*** asettle has joined #openstack-infra12:40
*** vikrant has quit IRC12:41
*** kushal has quit IRC12:42
*** abregman has joined #openstack-infra12:44
*** rlandy has joined #openstack-infra12:45
mugsieanyone around to +W https://review.openstack.org/#/c/359080/ ? It is cause gate failures on most patches\12:46
mugsieit is causing*12:46
AJaegerrcarrillocruz, mordred, yolanda? ^ Any of you around to help mugsie? I've given my +2 already12:47
AJaegermugsie: that comment would have been nice in the commit message ;)12:48
mugsieAJaeger: yeah, it was kinda rushed :( - I should have12:48
mugsieWe are in the mid cycle, trying to get some of our outstanding features merged12:48
*** dtantsur is now known as dtantsur|mtg12:50
zigoAJaeger: I get a "No space left on device" when building a package, probably because using a ramdisk to build. Do you think it's fine to increase the flavor?12:50
zigoIt really was at the end of the build :(12:51
AJaegerzigo, better ask the rest of the team...12:51
mugsieyolanda: thanks!12:51
zigoAJaeger: The other way would be to *not* use a ramdisk, but then it would build slower.12:51
rcarrillocruzsorry, was at lunch12:52
*** baoli has joined #openstack-infra12:54
zigoAJaeger: I'll just disable the tmpfs for now, and then discuss...12:54
*** chlong has quit IRC12:56
*** woodster_ has joined #openstack-infra12:56
mordredzigo: sorry, it's not possible to use a different flavor12:58
*** pvinci has joined #openstack-infra12:58
zigomordred: Is there only a single flavor type available?12:59
mordredzigo: yah12:59
mordredzigo: sorry bout that12:59
zigomordred: It should be fine without using the ramdisk then.12:59
asselin__rcarrillocruz, hey, I figured out most of the issues yesterday with launch_node playbook. Next is to figure out the input file change. You seem to be using a different format than what cloud-launch wants. Why no profiles?12:59
zigomordred: Maybe I could hack something to stop using a ramdisk for only a subset of packages...13:00
rcarrillocruzasselin__: as i explained earlier, that change is to have feature parity to the current launch_node.py13:00
rcarrillocruzyou should have your own resources.yaml to feed it the role13:00
openstackgerritMerged openstack-infra/project-config: Do not run all tempest tests on designate grenade job  https://review.openstack.org/35908013:00
*** coolsvap is now known as coolsvap_13:00
rcarrillocruzrather than use the playbook that creates it on the fly13:00
openstackgerritMonty Taylor proposed openstack-infra/shade: Add support for fetching console logs from servers  https://review.openstack.org/35823213:00
rcarrillocruzthere's no advantage to use that over launch-node.py13:00
*** devananda is now known as devananda|OSE13:01
*** bin_ has joined #openstack-infra13:01
rcarrillocruza profile is a way to reuse common resources13:02
rcarrillocruzthere's no point in using a profile in the launch-node playbook , as you'll just create one server on the fly13:02
*** rhallisey_ is now known as rhallisey13:03
*** tqtran has joined #openstack-infra13:03
*** rcernin has quit IRC13:03
rcarrillocruzthat's ^ the main purpose for profiles13:03
asselin__rcarrillocruz, I guess my question then is: how do you use cloud-launcher without a profile13:03
rcarrillocruzyou can totally use the launcher role with a profiel13:04
rcarrillocruzjust have a cloud with the per-cloud specific resources defined13:04
asselin__rcarrillocruz, I didn't see it in the docs or the example....and not good enough w/ ansible to rev engineer what the resource files is supposed to look like.13:05
*** DrifterZA has joined #openstack-infra13:06
rcarrillocruz"'On these items you can either re-use the profiles previously defined by name or define per-cloud specific resources."13:06
*** tqtran has quit IRC13:07
mordredrcarrillocruz: I think we might need to copy that blog post into the docs ... I can never remember where it is13:08
*** yuval has joined #openstack-infra13:08
rcarrillocruzimproving docs , as putting something else than the current dummy README,  is on my todo list13:08
mordredit's always on my todo list13:09
*** matt-borland has joined #openstack-infra13:09
AJaegermordred: just add a link to the README ;)13:09
*** caowei has joined #openstack-infra13:10
asselin__rcarrillocruz, ok, thanks I see it now: - name: nonprofilescloud13:10
openstackgerritBartosz Kupidura proposed openstack-infra/puppet-apps_site: [wip] Glare support for app-catalog  https://review.openstack.org/35902913:10
AJaegerproject-config cores, I would appreciate review of https://review.openstack.org/#/c/358734/ to get rid of some extra jobs for docs project, please.13:10
*** andymaier has quit IRC13:12
*** lucas-afk is now known as lucas-hungry13:12
*** chlong has joined #openstack-infra13:13
*** mikelk has quit IRC13:13
openstackgerritBartosz Kupidura proposed openstack-infra/puppet-apps_site: w[wip] Glare support for app-catalog  https://review.openstack.org/35902913:14
*** _ari_ has joined #openstack-infra13:16
*** roxanagh_ has joined #openstack-infra13:16
*** _ari_ has quit IRC13:17
*** javeriak has joined #openstack-infra13:17
*** raunak has joined #openstack-infra13:18
*** hrubi_ has joined #openstack-infra13:18
*** senk has quit IRC13:18
*** hrubi has quit IRC13:18
yuvalHey, would appreciate you review for Smaug (Karbor) fullstack fix: https://review.openstack.org/#/c/359019/13:20
*** roxanagh_ has quit IRC13:21
*** julim has joined #openstack-infra13:21
yuval*your :)13:21
*** ccamacho|lunch is now known as ccamacho13:21
rcarrillocruzmordred: was it you who created /root/certs/gencert.sh script or maybe fungi?13:21
rcarrillocruzi wonder why it just creates the csr and key, but not the cert13:21
*** _ari_ has joined #openstack-infra13:21
rcarrillocruztalking about puppetmaster.openstack.org machine btw13:21
*** andymaier has joined #openstack-infra13:22
*** Jeffrey4l_ has quit IRC13:22
asselin__rcarrillocruz, does this look right? `cuz it doesn't work: http://paste.openstack.org/show/562463/ 'item_cloud' is undefined.13:23
*** pgadiya has quit IRC13:23
*** andymaier has quit IRC13:23
rcarrillocruzasking cos i'm not sure if there's a pattern i should use to create the certs, or can I just create the cert for the infracloud controller with 365 days ?13:23
*** andymaier_ has joined #openstack-infra13:24
*** david-lyle has joined #openstack-infra13:24
*** tonytan4ever has joined #openstack-infra13:24
rcarrillocruzhow are you running it asselin__13:25
*** piet has joined #openstack-infra13:27
openstackgerritBartosz Kupidura proposed openstack-infra/puppet-apps_site: [wip] Glare support for app-catalog  https://review.openstack.org/35902913:27
asselin__rcarrillocruz, http://paste.openstack.org/show/562465/13:27
openstackgerritMerged openstack-infra/puppet-infracloud: Add management of /etc/nova/ssl/private directory  https://review.openstack.org/35866813:29
mordredrcarrillocruz: twasn't me13:30
rcarrillocruzvalidate_certs is not an option of the launcher, but an oscc option13:30
rcarrillocruztry to remove it and give it a run13:30
anteayarcarrillocruz: fungi tends to create certs for infra services13:30
*** Julien-zte has joined #openstack-infra13:30
anteayathat I have seen13:30
rcarrillocruzgood, i'll wait for him then, thanks13:30
anteayacongratulations for being at the cert stage13:31
anteayawell done13:31
rcarrillocruzwell yeah, let's see what other yak i have to shave after i put a sane cert13:31
asselin__rcarrillocruz, same error http://paste.openstack.org/show/562468/13:32
rcarrillocruzansible --version13:32
*** yaume has quit IRC13:32
*** yaume has joined #openstack-infra13:32
*** nwkarste_ has joined #openstack-infra13:33
*** jheroux has joined #openstack-infra13:35
*** sdake has joined #openstack-infra13:36
rcarrillocruzi don't see it13:37
*** sdake_ has joined #openstack-infra13:37
rcarrillocruzi'll spin a dsvm to check13:37
*** jcoufal_ has joined #openstack-infra13:38
*** DrifterZA has quit IRC13:38
*** DrifterZA has joined #openstack-infra13:39
*** raunak has quit IRC13:39
*** dprince has quit IRC13:40
*** jraju has quit IRC13:40
*** raunak has joined #openstack-infra13:40
*** jcoufal has quit IRC13:40
*** sdake has quit IRC13:41
*** mikelk has joined #openstack-infra13:41
*** dprince has joined #openstack-infra13:41
*** raunak has quit IRC13:42
*** sdague has joined #openstack-infra13:43
*** abregman has quit IRC13:44
*** thiagop has joined #openstack-infra13:44
*** sandanar has quit IRC13:45
openstackgerritAndreas Jaeger proposed openstack-infra/openstackid: Move other-requirements.txt to bindep.txt  https://review.openstack.org/35486013:47
*** eandersson_ has quit IRC13:47
*** kushal has joined #openstack-infra13:47
*** eranrom has joined #openstack-infra13:49
*** eranrom has quit IRC13:49
*** piet has quit IRC13:50
openstackgerrityolanda.robla proposed openstack-infra/system-config: Add ssl_key_file_contents to compute nodes  https://review.openstack.org/35920813:50
*** eranrom has joined #openstack-infra13:50
*** eranrom has quit IRC13:51
*** raildo has joined #openstack-infra13:51
*** eranrom has joined #openstack-infra13:52
*** eranrom has quit IRC13:52
*** javeriak_ has joined #openstack-infra13:52
*** eharney has joined #openstack-infra13:53
*** javeriak has quit IRC13:53
*** paulobanon has joined #openstack-infra13:55
*** kzaitsev_mb has quit IRC13:56
*** tonytan4ever has quit IRC13:56
sdaguefyi, osic looks extra fubar13:56
*** eranrom has joined #openstack-infra13:56
sdaguebasically it can't produce multinode envs correctly13:56
sdaguethis is why there is a 16 hr gate13:57
mordredyes. this is known. it's not supposed to be in the multi-node providers13:57
* mordred looks13:57
sdaguemordred: actually, it's even fubar on single node it seems13:57
mordredok. that's a different thing13:57
*** dprince has quit IRC13:58
mordredsdague: is it running multi-node test though?13:58
sdaguegate-tempest-dsvm-neutron-full-ubuntu-xenial - 42 failures in 24 hours osic13:58
timrcI don't want to be annoying, but I really want to understand the problem that occured yesterday with osc.  It seems like osc was tested with older packages than what it actually installed with.  This happened because a new version of occ was released after osc passed tests which caused a breaking change (that would have otherwise been caught).  This seems like quite the race condition.  Do I have13:58
timrcthis right?  If so, my question is why don't automatically propose a new requirements.txt for packages like osc which include the global requirements and upper constraints that were actually used to pass tests?  This would eliminate such a race condition... I'm sure I'm missing something though.13:58
*** dprince has joined #openstack-infra13:58
*** hongbin has joined #openstack-infra13:58
sdaguemordred: in this failure cluster, I don't see any13:58
mordredtimrc: we just simply didn't have a gate job. we have added the gate job that was missing, so we should be good now13:59
*** kzaitsev_mb has joined #openstack-infra13:59
*** kaisers_ has joined #openstack-infra13:59
*** piet has joined #openstack-infra13:59
sdagueI'm at openstack days east, so real debug is hard, but given this failure rate, osic should probably be fully disabled14:00
sdagueotherwise no code is going to merge this week14:00
*** xarses has quit IRC14:01
rcarrillocruzasselin__: looks like pypi ansible has include/with_items nested broken14:01
rcarrillocruztry this14:01
rcarrillocruzpip install ansible==
rcarrillocruzand let me know if you no longer get that item_cloud failure14:01
*** eranrom has quit IRC14:02
*** irtermite has joined #openstack-infra14:02
*** eranrom has joined #openstack-infra14:02
irtermite@all at some point today, we will be updating the ssl cert for cloud1.osic.org. If you notice anything odd, hit me here.14:02
*** tqtran has joined #openstack-infra14:02
*** pcaruana has quit IRC14:02
*** oanson has joined #openstack-infra14:02
rcarrillocruzand as a matter of fact, let me throw something at the gate14:02
irtermiteping cloudnull14:02
mordredsdague: thanks - poking at it fo sho14:03
*** eranrom has quit IRC14:03
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: TEST: DONT RECHECK: periodic jobs  https://review.openstack.org/35921514:03
*** eranrom has joined #openstack-infra14:03
*** david-lyle has quit IRC14:04
*** esberglu has joined #openstack-infra14:04
*** kaisers_ has quit IRC14:04
odyssey4memordred sdague It may be useful to provide some sort of filter to allow jobs from specific repositories to go to specific providers. Or perhaps long running jobs (ie !docs, !releasenotes, !linters, etc) to go to specific providers.14:04
timrcmordred: So provided there's complete test coverage, it's guaranteed that any dep a package like osc implicitly installs should be non-breaking?14:04
odyssey4meBlocking a whole provider just because one set of jobs are failing seems counter-productive.14:04
*** yuval has quit IRC14:04
*** tbarron|gone is now known as tbarron14:05
*** coolsvap_ is now known as coolsvap14:05
odyssey4meI can, for instance, confirm that OSIC jobs are working perfectly for OpenStack-Ansible jobs - even if they aren't for devstack jobs.14:05
irtermiteodyssey4me: *thumbsup*14:05
*** bhunter71 has joined #openstack-infra14:05
asselin__rcarrillocruz, yup, that error goes away14:06
sdagueodyssey4me: well, you have to change all the job definitions to chunk them up that way. If you want to build a special class for some of these, go ahead14:07
*** berendt has quit IRC14:07
sdaguebut it becomes a logical mess to manage that given the 5000+ job definitions14:07
zigomordred: Could you please remove the current python-cryptography-vectors from the debian-openstack repo? The version from sid of python-cryptography fails to build, I'd like to use the official backport instead. Also, we need to figure out a way so that *we* can do such operation. Your thougths would be welcome.14:07
mordredsdague: ^^14:07
*** tonytan4ever has joined #openstack-infra14:08
mordredodyssey4me: Retrying (Retry(total=4, connect=None, read=None, redirect=None)) after connection broken by 'ConnectTimeoutError(<pip._vendor.requests.packages.urllib3.connection.HTTPConnection object at 0x7f004550f950>, 'Connection to mirror.regionone.osic-cloud1.openstack.org timed out. (connect timeout=60.0)')': /pypi/simple/paramiko/14:08
*** rods has quit IRC14:08
odyssey4memordred oh dear, bad mirror14:08
rcarrillocruzas such, i expect https://review.openstack.org/#/c/359216/ to fail14:08
rcarrillocruzi'll see14:08
*** hieulq_ has joined #openstack-infra14:08
rcarrillocruztehre are 2-3 bugs opened for include/with_items and loop_var, must be one of them14:09
asselin__rcarrillocruz, how is this related to shade?14:09
mordredthe mirror seems to be returning for me at the moment from my laptop14:09
odyssey4memordred that's weird though, because we've had several successful jobs on OSIC today14:09
rcarrillocruzit's just some dummy change14:09
* mordred poking further14:09
*** eantyshev has joined #openstack-infra14:09
*** rbrndt has joined #openstack-infra14:09
* odyssey4me heads to logstash.o.o14:09
rcarrillocruzi had to revert it regardless14:09
rcarrillocruzbut that gave me perfect excuse to see the behaviour in the gate14:09
sdagueodyssey4me: the number of nodes in osic just doubled yesterday14:09
asselin__rcarrillocruz, got it.14:09
*** rods has joined #openstack-infra14:10
AJaegerI had this mirror problem earlier today as well: http://eavesdrop.openstack.org/irclogs/%23openstack-infra/latest.log.html#t2016-08-23T11:21:3514:10
sdagueonce those all got consumed I can imagine it overwhelming the mirrors14:10
*** lucas-hungry is now known as lucasagomes14:10
odyssey4mesdague hmm, yeah - so a scale issue which has never been seen before because no one provider has ever given so many nodes in a single region14:10
*** eranrom has quit IRC14:10
AJaeger3 jobs for the same change got the timeouts. And then another time later one - several jobs for same change14:10
asselin__rcarrillocruz, ok I really see now: it should install latest version of ansible to show the bug: ansible>=
asselin__rcarrillocruz, what's strange is it doesn't happen when using a profile14:11
sdagueodyssey4me: anyway, right now the fail rate basically has destroyed code merge14:11
rcarrillocruzasselin__: because the looping mechanism differs14:11
*** eranrom has joined #openstack-infra14:11
rcarrillocruzthe logic is quite different14:12
odyssey4measselin__ rcarrillocruz does that happen to nivolve with_flattenned ?14:12
rcarrillocruzeven though i refactored code and reuse as much as possible14:12
rcarrillocruzodyssey4me: i don't use with_flattened on the role so i can't really tell14:12
*** ddieterly has joined #openstack-infra14:12
asselin__rcarrillocruz, btw, I really like the refactored code compared to when I last used it. (multiple files vs all in one file)14:12
rcarrillocruzthere was a ton of duplicated code14:13
rcarrillocruzmuch cleaner now14:13
odyssey4mercarrillocruz yeah, with_flattened is notorious for bad behaviour - just don't use it14:13
mordredI just jumped on an OSIC node and it is able to contact th emirror14:13
mordredso there don't seem to be systemic routing issues between osic nodes and the mirror14:14
sdaguemordred: just because it worked once, doesn't mean it's not a real issue14:15
mordredsdague: sigh. really? wow, that's super helpful14:16
mordredcome on man14:16
*** zz_dimtruck is now known as dimtruck14:17
*** pcaruana has joined #openstack-infra14:17
*** reed has quit IRC14:17
*** edtubill has joined #openstack-infra14:17
odyssey4memordred sdague the trend of successful jobs for OSIC in logstash is pretty good14:18
*** reed has joined #openstack-infra14:19
* odyssey4me tried to figure out how to share a search14:19
*** mikelk has quit IRC14:19
mordredsdague: this stopped happening two hours ago as best I can tell from that logstash query14:19
*** rajinir has joined #openstack-infra14:19
mordredand it seems to have been an issue for about 30 minutes14:19
sdaguemordred: ok14:20
odyssey4meoh, that's a fun one14:20
*** calebb has quit IRC14:20
mordredpabelanger: in the apache error log on that mirror server, it's listing [Tue Aug 23 12:37:06.614393 2016] [core:notice] [pid 2849:tid 140327933454208] AH00051: child pid 5643 exit signal Segmentation fault (11), possible coredump in /etc/apache214:21
sdagueit actually looks like there are 2 spikes14:21
sdagueone at 5am, and one at 8am14:21
mordredthere are 7 seg faults14:21
mordredapache segfaulting seems like a Bad Thing14:22
mordredbut there's no additional info14:22
pabelangerdid that just start happening?14:22
*** oanson has quit IRC14:23
mordredpabelanger: nope.14:23
mordredpabelanger: there are 8 in error.log.114:23
sdagueok, running out of battery and need to give up my seat. mordred thanks for looking.14:23
mordred[Mon Aug 22 06:25:56.562139 2016] [mpm_event:notice] [pid 2849:tid 140327933454208] AH00493: SIGUSR1 received.  Doing graceful restart14:23
mordredsdague: we'll get it sorted - sorry for the toruble14:24
mordredpabelanger: that log line above ^^14:24
mordredis the graceful restart after which this started happening14:24
sdagueyeh, no worries, glad it looks like it may have self resolved14:24
pabelangerI wonder if other mirrors are doing that14:24
mordredsdague: well, I'd love to find root cause - serving static files from mirrors shoudl be a fairly rocksolid thing14:24
pabelangerwe updated logrotate the other day14:24
mordredpabelanger: worth checking14:24
pabelangermaybe it is misconifugred14:25
*** amitgandhinz has quit IRC14:25
mordredhere is error.log.1 : http://paste.openstack.org/show/562480/14:25
mordredhere is error.log: http://paste.openstack.org/show/562481/14:26
mordredthere are no segfaults in error.log.2.gz14:26
*** dtantsur|mtg is now known as dtantsur14:26
*** amitgandhinz has joined #openstack-infra14:26
*** amitgandhinz has quit IRC14:26
mordredit would be neat if there WAS a core dump14:26
pabelangerya, we need to enable that in apache14:27
*** amitgandhinz has joined #openstack-infra14:27
pabelangeralso, we haven't setup ipv6 DNS records on osic-cloud1 mirror14:27
pabelangerdoing that now14:27
mordredthat'll be nice14:27
*** sdague has quit IRC14:28
*** david-lyle has joined #openstack-infra14:28
AJaegerpabelanger, mordred: Once you fixed the mirror, could either of you review https://review.openstack.org/#/c/358734/ , please? That removes some jobs for docs team.14:28
*** adam_g has quit IRC14:29
pabelangerAh, we'll check to schedule the work for ipv6 on osic-cloud1 mirror14:29
pabelangerit lacks ipv6 right now14:29
AJaegerthanks, mordred !14:30
mordredpabelanger: aroo?14:30
mordredpabelanger: OH14:31
rcarrillocruzasselin__: https://github.com/ansible/ansible/issues/17148 looks like a good candidate14:31
dulekHi, can we get https://review.openstack.org/#/c/355678/ in? This will make running Cinder multinode grenade tests easier on patches in review.14:31
pabelangerI don't see anything in syslog that would restart apache14:31
mordredpabelanger: we made that mirror before there was ipv6 in osic14:31
pabelangermordred: yes14:31
*** gbraad has quit IRC14:31
*** hieulq__ has joined #openstack-infra14:31
mordrednod. this makes sense to me14:32
*** jaosorior is now known as jaosorior_away14:32
*** gbraad has joined #openstack-infra14:32
openstackgerritPeter Stachowski proposed openstack-infra/project-config: [trove] Add more nv scenario tests  https://review.openstack.org/35488114:32
*** hieulq_ has quit IRC14:33
*** adam_g has joined #openstack-infra14:34
*** adam_g has quit IRC14:34
*** adam_g has joined #openstack-infra14:34
*** mikelk has joined #openstack-infra14:34
*** sdake_ has quit IRC14:35
pabelangerAug 23 06:25:01 mirror CRON[27393]: (root) CMD (test -x /usr/sbin/anacron || ( cd / && run-parts --report /etc/cron.daily ))14:35
pabelangerapache2 restarts line up with the cron.daily logrotate job14:35
mordredok. well that's good14:35
pabelangernow for coredump14:35
*** piet has quit IRC14:35
mordrednow I guess the question is - did we upgrade apache or something in between those restarts?14:36
*** davidlenwell has quit IRC14:36
*** vern has quit IRC14:36
pabelangerdoesn't look like it14:36
pabelangernothing in /var/log/apt14:37
pabelangerwell, nothing related to apache214:37
*** davidlenwell has joined #openstack-infra14:37
pabelangerso, we have no swap14:38
pabelangerI wonder if we are OOMing14:38
openstackgerritMerged openstack-infra/groups: Security update for Panelizer module  https://review.openstack.org/35915514:38
mordredwouldn't that show up as oomkiller though?14:38
mordredit certainly doesn't seem like we have extra memory though14:39
pabelangerya, don't see oomkiller in logs14:39
*** esikachev has quit IRC14:40
openstackgerritMerged openstack-infra/project-config: Cleanup DocBook XML publishing  https://review.openstack.org/35873414:40
*** xarses has joined #openstack-infra14:40
*** sdake has joined #openstack-infra14:41
mordredpabelanger: I don't see any spikes, leaks or anything else in cacti graphs :(14:41
mordredpabelanger: we haven't updated any packages there since aug 1914:42
*** sleviim has quit IRC14:44
mordredpabelanger: http://logstash.openstack.org/#/dashboard/file/logstash.json?query=message:%5C%22%2Ftmp%2Fansible%2Fbin%2Fansible:%20No%20such%20file%20or%20directory%5C%22%20AND%20tags:%5C%22console%5C%22%20AND%20voting:1&from=864000s14:46
*** calebb has joined #openstack-infra14:48
pabelangerya, lines up when mirror was down14:49
*** tqtran has quit IRC14:49
pabelangerwe need to add: CoreDumpDirectory /var/cache/apache2/14:50
pabelangerinto apache2.conf14:50
*** Goneri has quit IRC14:52
pabelangerjust had another 1 too14:53
pabelanger[Tue Aug 23 14:39:47.398544 2016] [core:notice] [pid 2849:tid 140327933454208] AH00051: child pid 6620 exit signal Segmentation fault (11), possible coredump in /etc/apache214:53
*** javeriak_ has quit IRC14:53
*** david-lyle has quit IRC14:54
*** raunak has joined #openstack-infra14:55
openstackgerritRicardo Carrillo Cruz proposed openstack-infra/system-config: Replace ssl cert for infracloud vanilla controller  https://review.openstack.org/35925414:56
openstackgerritMonty Taylor proposed openstack-infra/shade: Ensure per-resource caches work without global cache  https://review.openstack.org/35877614:57
openstackgerritMerged openstack-infra/shade: Allow object storage endpoint to return 404 for missing /info endpoint  https://review.openstack.org/35893714:57
*** yamamoto has quit IRC14:57
pabelangermordred: I'm going to put osic-cloud1 mirror into emergency and manually enable coredumps. I'll get a patch up for puppet too, but we'll need to land it off hours since it requires apache2 restart14:57
*** pvinci has quit IRC14:58
pabelangerActually, we need to decided if we want to restart mirror osic-cloud1 to pick up the config change now14:58
pabelangeror just wait until we land the puppet patch14:58
*** weshay is now known as weshay_afk14:58
*** hockeynut has joined #openstack-infra14:58
*** hieulq__ has quit IRC15:01
*** yamamoto has joined #openstack-infra15:01
*** vinaypotluri has joined #openstack-infra15:01
*** hieulq_ has joined #openstack-infra15:01
*** vern has joined #openstack-infra15:03
nwkarste_it looks like the openstackci puppet module hasn't been updated with the new split logstash::indexer parameters https://github.com/openstack-infra/puppet-openstackci/blob/master/manifests/logstash_worker.pp#L49 https://github.com/openstack-infra/puppet-logstash/blob/master/manifests/indexer.pp#L3815:06
*** armax has joined #openstack-infra15:06
*** yamamoto has quit IRC15:06
anteayanwkarste_: have you considered offering a patch?15:07
*** edtubill has quit IRC15:07
*** Goneri has joined #openstack-infra15:07
nwkarste_anteaya: sure i'll do it15:07
dhellmannhas someone already reported the tarball job failure for smaugclient? http://logs.openstack.org/74/74a8a033aafbc0cdc6f984b2ffb4cd327498fbd6/release/python-smaugclient-tarball/9e47bfe/console.html15:07
*** weshay_afk is now known as weshay15:07
anteayanwkarste_: wonderful15:08
fungircarrillocruz: skimming scrollback, on a conference call right now, but didn't we work out a way to use self-signed certs for the last infra-cloud deployment?15:08
rm_workHey, so ... *some* of the Zuul telnet links for running jobs are using IPv6 links ... which is great! Except, my corp network doesn't support IPv6 internally... T_T15:08
rm_workIs that expected?15:08
clarkbrm_work: yes15:08
fungirm_work: yes, some of our job nodes only have ipv6 addresses15:08
clarkbmordred: pabelanger what puppet change for the mirror?15:08
*** salv-orlando has quit IRC15:08
timrcLatest shade does not install successfully in a clean venv.  It breaks installing positional which breaks if pytz is not already installed.  If you install pytz first and then install shade, things seem good.15:08
*** jtomasek has quit IRC15:08
*** salv-orlando has joined #openstack-infra15:08
fungirm_work: if the node has a global ipv4 address we list that for the console, and any that only have global v6 we fall back on that for the url15:09
pabelangerclarkb: I'm writing a patch to enable coredumps for apache215:09
mordredtimrc: oh fantastic15:09
pabelangerbut will require apache2 to restart15:09
rm_workok, interesting15:09
anteayadhellmann: I have not seen a tarball job failure reported yet personally, no15:09
rm_workso I'm basically out of luck until it's done and posted15:09
clarkbpabelanger: gotcha15:09
fungipabelanger: trying to track down those intermittent segfaults we get with the event worker on trusty?15:09
mordredtimrc: I did not experience the problem you describe15:10
rm_workor unless I can get an ipv6 link working :P15:10
clarkbrm_work: or set up a tunnel of some sort15:10
rm_workworking on that15:10
rcarrillocruzfungi: yeah, i generated a self-signed cert on the puppetmaster machine by using the gencert.sh , then openssl command to craete cert from csr+key15:10
clarkbtimrc: I am not able to reproduce that behavior15:10
clarkbtimrc: make sure your virtualenv is up to date15:10
rcarrillocruzis it ok if i leave the csr and key there? /root/certs15:10
fungipabelanger: turning on coredumps was going to be my next step for looking into that but i never got to it15:10
dhellmannanteaya : ok, thanks. I've pointed it out to saggi in #openstack-dev and since smaug is an independent project I'll let the team work on debugging it15:10
timrcLet me do a pastebin.15:10
openstackgerritMichal Dulko proposed openstack-infra/project-config: Move cinder multinode grenade job to check  https://review.openstack.org/35927515:10
fungircarrillocruz: yeah, that's fine by me15:10
rcarrillocruzwas more looking on the usual way of storing that stuff really15:11
anteayadhellmann: very good15:11
fungircarrillocruz: it's mainly been our staging area for generating csrs and then dumping the resulting keys/certs and chain certs into hiera15:11
rcarrillocruzi genreated a new one with 365 days15:11
rcarrillocruzjust pushed to gate the new cert15:11
openstackgerritMerged openstack-infra/nodepool: Don't delete building DIB images  https://review.openstack.org/35884315:12
clarkbtimrc: since virtualenv bundles pip and setuptools15:13
mordredclarkb: I am doubtful we're going to be able to release shade with the needed patches today - purely due to gate depth. I could be wrong, of course, but I'm currently pessimistic that it'll happen before tomorrow15:13
mordredclarkb: in positive news though, the shade-nodepool-dsvm job totally caught a bug15:14
timrcclarkb: mordred: Here's the paste: http://paste.openstack.org/show/77jv1TCSFnMIHJyBXgtW/15:14
mordredtimrc: thanks15:14
clarkbtimrc: ya try updating virtualenv or at least upgrade setuptools and pip in the virtualenv before installing shade15:15
*** Julien-zte has quit IRC15:15
fungiokay, mediawiki finally released their security fixes after i disappeared last night, so i'll be working on that between now and the meeting https://lists.wikimedia.org/pipermail/mediawiki-announce/2016-August/000195.html15:15
mordredtimrc: that's a really old virtualenv. yah - what clarkb said15:15
anteayafungi: yay15:16
timrcclarkb: Let me try... this VM should be getting booted with a daily built Trusty image though so hrm :/15:16
rm_workclarkb / fungi: is telnet://2001:4800:1ae1:18:f816:3eff:feb9:537:19885 connectable for you? I'm trying from a machine that DOES have verified working ipv6 connectivity and not getting a connection15:16
mordredtimrc: fwiw, I never use distro packaged virtualenv - but if you do need to, definitely update pip/setuptools in the venv before you do anything in it to make it useful15:16
mordredfor that matter, I never use distro packaged pip either - but that's not your issue here15:16
openstackgerritPaul Belanger proposed openstack-infra/system-config: Enabled coredumps for apache2 on AFS mirrors  https://review.openstack.org/35927815:16
*** annegentle has quit IRC15:16
pabelangerfungi: clarkb: first stab^15:16
*** klindgren has quit IRC15:17
fungitimrc: one way around it if you're using the distro package of virtualenv is to virtualenv an intermediary venv, pip install latest virtualenv inside that, and then run that virtualenv to create your desired venvs15:17
fungithat's my usual bootstrapping trick, since i detest pip installing anything system-wide15:17
*** ePrVRSBBhG has joined #openstack-infra15:17
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool: Remove unnecessary NodePoolBuilder thread  https://review.openstack.org/35667615:17
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool: Add new ZK method for sending cluster heartbeat  https://review.openstack.org/35886815:17
openstackgerritDavid Shrewsbury proposed openstack-infra/nodepool: Add new ZK method for registering a watch.  https://review.openstack.org/35883715:17
fungii just keep a tree of venvs in my homedir with the tools i use, and deep-link from ~/bin/whatever to ~/pyenvs/whatever/bin/whatever15:18
anteayafungi: I've advised smaug folks to add an infra meeting agenda item about renames, since they want one. I have told them it won't be prior to feature freeze and them attending is the best way to have one scheduled15:19
anteayain case they show up but don't have an agenda item15:19
timrcclarkb, fungi, mordred: Cool.  Thanks for the help.15:19
fungianteaya: sounds good, we already have another project with a rename requested too, so will likely address them both at the same time15:19
clarkbrm_work: egat us your telnet/nc command?15:19
anteayafungi: thought as much, thank you15:19
clarkbrm_work: note tge trailing 19885 is the port not part of the addr15:19
rm_worktelnet 2001:4800:1ae1:18:f816:3eff:feb9:537 1988515:19
*** andymaier_ has quit IRC15:19
pabelangerclarkb: I haven't seen an ubuntu-xenial launch failure since you uploaded the images last night15:19
anteayacloudnull: morning, so some backscroll for you15:19
pleia2good morning15:19
*** ePrVRSBBhG has quit IRC15:20
anteayacloudnull: something about being able to find the mirror on osic?15:20
anteayamoring pleia215:20
clarkbpabelanger: huh also only ovh gra1 and osic got new images all the othera failed :/15:20
fungirm_work: not all telnet clients have ipv6 support by default. also it's not really a telnet server just a streaming tcp socket, you might find it saner to install netcat-openbsd and then use the nc command instead of telnet15:20
rm_workhmm k15:20
mordredcloudnull: http://logstash.openstack.org/#/dashboard/file/logstash.json?query=message:%5C%22%2Ftmp%2Fansible%2Fbin%2Fansible:%20No%20such%20file%20or%20directory%5C%22%20AND%20tags:%5C%22console%5C%22%20AND%20voting:1&from=864000s15:20
pabelangerclarkb: boo15:20
fungi(note that netcat-traditional also does not support rav v6 addresses, but netcat-openbsd does)15:20
clarkbpabelanger: I will requeue the others shortly in hopes of getting d-g updated sometime soon15:20
mordredcloudnull: we are also looking at things on the node, since there were some segfaults15:21
SamYaplemornings cloudnull15:21
*** ifarkas is now known as ifarkas_afk15:21
rm_workfungi / clarkb: I tested with "telnet -6 google.com 80" and it connects successfully via ipv6, that's why i was asking if that server worked for you guys or not15:21
SamYaplenetcat-openbsd is the only syntax i know15:21
mordredbut the segfaults were just a few and there was 30 minutes of lack of connectivity - so we're not really sure what the heck was going on15:21
*** dizquierdo has quit IRC15:22
*** DmZDsfZoQv has joined #openstack-infra15:22
fungirm_work: i'm getting no response out of 2001:4800:1ae1:18:f816:3eff:feb9:537 (not even with ping6) so the node has probably already been deleted15:22
*** baoli has quit IRC15:22
openstackgerritVladyslav Drok proposed openstack-infra/project-config: Set whole disk image options directly in devstack  https://review.openstack.org/35928515:22
fungirm_work: is it for a currently running job?15:22
rm_workfungi: yes the job is still running according to zuul15:22
*** baoli has joined #openstack-infra15:23
*** david-lyle has joined #openstack-infra15:23
fungiyeah, nodepool hasn't deleted it yet15:23
*** piet has joined #openstack-infra15:23
zigopabelanger: Hey there! Could you please remove python-cryptography-vectors from the repo? I would like to use the version from official jessie-backports instead.15:23
jeblairfungi, rm_work: no answer on ssh either15:23
zigopabelanger: The version from Sid fails to build ...15:23
zigopabelanger: That's a major blocker for other stuff to build.15:24
rm_workjeblair: so, this node just doesn't like me T_T15:24
mordredrm_work: it has emotional issues15:24
fungi| b08d8dc2-373c-4883-aa6b-0d69db680e25 | ubuntu-trusty-osic-cloud1-3788505 | ACTIVE | GATEWAY_NET_V6=2001:4800:1ae1:18:f816:3eff:feb9:537,   | template-ubuntu-trusty-1471909100 |15:24
rm_workand by me, i mean, the universe hates me today15:24
Shrewspabelanger: can you +A these nodepool reviews that add example files? https://review.openstack.org/357329  and  https://review.openstack.org/35733015:24
mordredheh. I already +2'd them15:24
Shrewsmordred: i just want them in to get the default branch change in that you already +A'd  :)15:25
mordredShrews: yah15:25
*** abregman has joined #openstack-infra15:25
*** andreas_s has quit IRC15:25
openstackgerritVladyslav Drok proposed openstack-infra/devstack-gate: Do not set ephemeral size based on driver  https://review.openstack.org/32606115:25
Shrewsmordred: you could push them through too if you feel so inclined15:26
cloudnullmordred: so the mirrors are busted or do we suspect there's a routing issue at the edge?15:26
openstackgerritPaul Belanger proposed openstack-infra/project-config: Run host lookup first for configure_mirror.sh  https://review.openstack.org/35928915:26
openstackgerritPaul Belanger proposed openstack-infra/project-config: Include dib-builddate.txt for configure_mirror.sh  https://review.openstack.org/35929015:26
fungirm_work: any chance https://review.openstack.org/286381 could nuke the network on the job node?15:26
jeblairrm_work, fungi: so i wonder if either the node has hung, or if some of the network configuration changes that job does (gate-neutron-lbaasv2-dsvm-api-namespace-nv) have affected our ability to connect from the outside.15:26
jeblairfungi: right that :)15:26
mordredcloudnull: right now they're working - so I have no legit clue why they were not working for that period15:26
pabelangerShrews: looks like mordred has you covered15:27
Shrewsaye. danke15:27
mordredcloudnull: pabelanger is adding coredump config to apache so we can look into the segfault15:27
*** davidlenwell has quit IRC15:27
rm_workfungi: I don't think so ...15:27
cloudnullmaybe an issue at the DFW edge?15:27
pabelangerzigo: remove from which repo?15:27
mordredcloudnull: but yeah - it's possible there was a 30 minute issue15:27
cloudnulloh, was the mirror server segfaulting?15:27
mordredalso - we dont have ipv6 on the mirror server15:27
*** njohnston has left #openstack-infra15:27
mordredso all of the nodes are bouncing through the neutron router too15:27
mordredso one could imagine an issue with neutron for a bit15:27
cloudnullI can look into that15:28
mordredcloudnull: yah - yesterday we started having occasional apache segfaults with no other info15:28
mordredcloudnull: http://paste.openstack.org/show/562481/ and http://paste.openstack.org/show/562480/15:28
jeblairwe've seen apache segfault on mirrors in other clouds, but not all of them. (like, i think it happens more often in ord)15:28
dougwigjeblair, fungi, rm_work - nothing in that job should've affected inbound access, i think.15:29
cloudnullare jobs in the OSIC working now? -cc sdague ?15:29
mordredcloudnull: yes15:29
zigopabelanger: The jessie-newton-backports one.15:29
mordredcloudnull: the failures stopped 2 hours ago15:29
mordredjeblair: it was not in error.2 ... only in 1 and current - but we did not see any changes in the server that would correlate with the introduction of the segfaults15:29
*** yamahata has joined #openstack-infra15:30
mordredalthough it's possible that the lack of them in error.2 is circumstantial15:30
rm_workjeblair / clarkb / fungi: Looks like the job just re-queued >_>15:30
rm_workdid you guys do that?15:30
*** dprince has quit IRC15:30
pabelangerWe have a nice wave going on in osic-cloud1 too: http://grafana.openstack.org/dashboard/db/nodepool-osic?from=1471962597278&to=147196619727815:30
fungirm_work: if zuul thinks the node has fallen off the network, it blames the provider and restarts the job on a new node15:30
rm_workhmm lol k15:31
pabelangerneed to see what is going on there, if job failures or just running short lived jobs15:31
fungirm_work: if it continues to loop like this, then... probably the change itself15:31
openstackgerritMerged openstack-infra/nodepool: Add an example logging.conf for development  https://review.openstack.org/35732915:31
cloudnullsymetrical :)15:31
cloudnull^ pabelanger15:31
openstackgerritMerged openstack-infra/nodepool: Add a fake-secure.conf  https://review.openstack.org/35733015:31
openstackgerritMerged openstack-infra/nodepool: Set default branch to feature/zuulv3  https://review.openstack.org/35732615:31
rm_workanother one just requeued...15:31
*** yamamoto has joined #openstack-infra15:31
*** abregman is now known as abregman|mtg15:31
rm_worki don't think it's the change. but yeah, we'll wait and see what happens15:31
openstackgerritMerged openstack-infra/nodepool: Add zookeeper-servers to fake config  https://review.openstack.org/35732715:32
pabelangerrm_work: which review are you looking at?15:32
*** dprince has joined #openstack-infra15:32
openstackgerritMerged openstack-infra/system-config: Add ssl_key_file_contents to compute nodes  https://review.openstack.org/35920815:32
cloudnullmordred: maybe something w/ the event mpm settings allowing memory consumption to get too high?15:32
*** yamamoto has quit IRC15:32
*** yamamoto has joined #openstack-infra15:33
rm_workpabelanger: ^^ it passed recently and the only changes since then shouldn't be able to break it, but ... <_<15:33
*** yamamoto has quit IRC15:33
rm_workwe'll just wait and see what happens, could just be something intermittent15:33
mordredcloudnull: we were thinking something related to oom ... but we didn't see any mentions of oomkiller running15:33
dougwigrm_work: let's see what happens.  we did just enable some templates that mess with namespaces.15:33
*** david-lyle has quit IRC15:33
fungirm_work: if you find another one that's frozen, i can try to grab the nova console log before it gets deleted15:33
rm_workfungi: k... going to look and figure out if there's any way it COULD be this change15:34
fungiwish i'd thought to do that on the last one while it was still showing active in nova15:34
openstackgerrityolanda.robla proposed openstack-infra/puppet-infracloud: Set the ssl_key_file_contents to mandatory  https://review.openstack.org/35929415:34
clarkbthis mornings ubuntu-precise iamge build took less than an hour15:35
rm_workfungi: telnet://
rm_workfungi: that one looks to be nonresponsive15:35
*** piet has quit IRC15:35
cloudnullis log rotate doing a graceful restart? maybe logs are filling and its rotating more often than it should or calling multiple restarts? we just added the log rotate bits right?15:35
*** senk has joined #openstack-infra15:37
fungirm_work: weird, that one's in internap not osic15:37
*** esikachev has joined #openstack-infra15:37
mordredcloudnull: it is doing a graceful - but we're not seeing failures everytime it does15:37
*** sdague has joined #openstack-infra15:38
pabelangerzigo: okay, deleted15:38
cloudnullmordred: hum, interesting... I'd be curious to see what the output is from the logs.15:39
cloudnullmordred: do you have the mirror instance(s) UUID(s) on hand?15:40
mordredcloudnull: one sec15:40
cloudnullI can go look up the compute nodes and see if there's something else a-foot here.15:40
rm_workfungi: maybe i'm not able to connect for a different reason :P15:40
mordredcloudnull: 54bce385-4b3e-4a14-aa2a-f87f5ddd6bc015:40
*** davidlenwell has joined #openstack-infra15:40
cloudnullo/ SamYaple -- missed your earlier ping :)15:40
mordredcloudnull:           "hostId": "2f26ddc2dafefc5d995c6daeda125b7159bd210399d1763c4b3a2a82",15:40
mordredcloudnull: in case that's useful15:41
fungirm_work: well, i'm taking an inordinate amount of time to track down credentials for checking that one because the all-clouds.yaml on our puppetmaster is incomplete for some reason15:41
cloudnullmordred: tyvm15:41
rm_workfungi: yeah nm looks like THAT one is a firewall issue, because it's not a RAX node, we can only get to RAX nodes :P15:41
*** vincentll has quit IRC15:41
rm_workfungi: it's ok, we'll figure it out, hopefully the next runs of those other jobs will be fine15:41
clarkbfungi: missing the internap jenkins account?15:41
mordredfungi: oh - that seems unpleasing15:41
cloudnullmordred:  if we turn-on LBaaS would more than 1 mirror server help things ?15:41
fungiclarkb: missing username and password for it, yes. probably incorrect hiera keys15:41
*** esikachev has quit IRC15:42
mordredcloudnull: I don't know enough yet - I'd like to grok the problem first if we can ... at the moment this is still total mystery15:42
cloudnullor if we did HAP w/ multiple mirror server backends ?15:42
clarkbcloudnull: and hope they don't all die at the same time15:42
fungianyway, i need to get back to this wiki security update for now15:43
cloudnullclarkb: ++15:43
fungirm_work: yeah, i'm able to get to the console on anyway15:43
fungiso sounds like egress filtering in your office15:43
rm_worki am able to from another location15:44
rm_workinterestingly the RAX nodes are whitelisted15:44
rm_workor, not surprisingly15:44
rm_worki'll verify better next time, if we see another similar issue on our jobs15:45
*** piet has joined #openstack-infra15:45
rm_workbut, i can say given which jobs passed already and which re-queued, it shouldn't be anything to do with the change15:45
fungirm_work: we had to pick a fairly oddball port number in an attempt to avoid colliding with any services that might try to listen on the job nodes (including teh various service ports devstack made up)15:45
*** piet has quit IRC15:45
*** zhurong has quit IRC15:46
*** tesseract- has quit IRC15:46
*** piet has joined #openstack-infra15:47
*** kaisers_ has joined #openstack-infra15:48
*** esikachev has joined #openstack-infra15:49
*** yaume has quit IRC15:50
*** ddieterly is now known as ddieterly[away]15:50
openstackgerritChangcheng Intel proposed openstack-infra/jenkins-job-builder: update base_email_ext to adapt Email-ext plugin  https://review.openstack.org/35513915:50
mordredfungi: have I mentoined how silly I think egress filtering is?15:50
openstackgerritPaul Belanger proposed openstack-infra/project-config: Use aliasByNode for Node Launches panel  https://review.openstack.org/35929915:50
fungimordred: i have a choir you're free to preach to15:51
jeblairpabelanger, mordred, cloudnull: i'm still catching up -- but do i understand correctly that pabelanger is adding the ipv6 address for the osic mirror to dns now?15:51
openstackgerritCyril Roelandt proposed openstack-dev/hacking: Add a check to make sure the right assert* method is used  https://review.openstack.org/35418515:52
jeblairpabelanger, mordred, cloudnull: and because it wasn't in dns, that's why all the logs indicate the requests come from a single ipv4 addr -- the nat server doing v6->v4 translation?15:52
pabelangerjeblair: Yes, but not at the moment. I think we need to schedule it to avoid some downtime15:52
fungiwhy would that cause downtime?15:52
openstackgerritMerged openstack-infra/system-config: Replace ssl cert for infracloud vanilla controller  https://review.openstack.org/35925415:52
mordredthere is no ipv6 address15:52
*** mmedvede has quit IRC15:53
fungioh, the mirror has no ipv6 address configured?15:53
jeblairpabelanger, mordred, cloudnull: and so if we're looking for a component problem, not only should we consider neutron, but also the v4->v6 nat system?15:53
*** kaisers_ has quit IRC15:53
pabelangerfungi: right, we need to first do that and update the network15:53
mordredjeblair: yah15:53
irtermitepabelanger: mordred: fungi: cloudnull: is this mirror an instance hosted on osic?15:53
mordredand I believe cloudnull is looking at host logs15:54
mordredirtermite: yes15:54
cloudnullI am15:54
fungiindeed, eth0 has only a linklocal v6 addy15:54
irtermiteand it has no ipv6 because that was how it was originally deployed mordred?15:54
*** sdague has quit IRC15:54
mordredwe need to either add a new nic to the existing system, or just boot a new mirror15:54
mordredirtermite: yes. that's right15:54
*** ddieterly[away] is now known as ddieterly15:54
pabelangerI mean, we could launch a replacement server then just switch the DNS15:54
mordredwhen we booted this mirror, the GATEWAY_NET_v6 network did not exist15:54
*** timello has quit IRC15:54
jeblair[i went to look at apache logs to see what they looked like around the time of the errors, and ... well, it's difficult to discern patterns with the nat address :( ]15:55
irtermiteunderstood, that's what I figured15:55
pabelangerlikely faster then waiting for a window to add ipv615:55
irtermitewould it not help to add another interface to it and put it on gateway_net_v6?15:55
jeblairpabelanger: do we have a free ipv4?15:55
fungiwell, actually it has a linklocal address and a ula15:55
fungibut regardless, no global v6 address15:55
pabelangerjeblair: I haven't not checked. I can do that now15:55
*** ggnel_t has quit IRC15:55
clarkbjeblair: we should have a quota of 5 fips in that project so as long as the cloud has one then we should be fine15:55
*** _nadya_ has quit IRC15:55
clarkbwe can also move the fip to the new server15:56
mordredirtermite: we could theorecticaly do that - but it's likely more work than just spinning up a new one - this is a fairly stateless server15:56
clarkb(which may be disruptive)15:56
*** mmedvede_ is now known as mmedvede15:56
irtermiteyea mordred, i would agree. can't hurt to have another mirror15:56
jeblairclarkb: yeah, i imagine it might be; i like the replace with new server+ip option if we can15:56
*** pcaruana has quit IRC15:56
mordredthe floating ip we currently have is on GATEWAY_NET ... does the neutron there have a router from GATEWAY_NET to GATEWAY_NET_V6 ?15:56
*** matrohon has quit IRC15:56
jeblair(even if it's just for 30 seconds or so, it would tank a bunch of jobs)15:57
irtermitemordred: mirror_v4, mirror_v6    + cloudnull    ;)15:57
*** Sukhdev has joined #openstack-infra15:57
irtermitemordred: no I do not believe we have a router between15:57
pabelangerjeblair: our current quota for FIPs in 1 of 3 in openstackci15:57
mordredk. then the new server may have to be a little extra special15:57
mordredor we may need to figure out a router15:58
clarkbmordred: why would we have to route between them?15:58
clarkbmordred: can't we just have one interface on GATEWAY_NET and one on GATEWAY_NET_V6?15:58
mordredbecause we'll need an IPv4 FIP attached ot the private ipv4 address we get from GATEWAY_NET_V615:58
mordredclarkb: we can also do that15:58
irtermiteclarkb: that's what i suggested15:58
irtermiteorrrrr, just spin up a new mirror on both stacks and THEN destroy the old one15:59
pabelangeryes, we've done that a few times now15:59
clarkbya I think spin up a new one with two interfaces, check both v4 and v6 work happily, update dns, delete the old one15:59
*** matthewbodkin has quit IRC15:59
irtermitewell, you know my vote clarkb and mordred16:00
irtermitebut, I'm just your account manager   ;)16:00
irtermitewhat do I know?   ;)16:00
*** jaosorior_away is now known as jaosorior16:00
jeblairirtermite: i think you and clarkb and pabelanger are all agreeing, right?  or am i missing something?16:00
mordredyah. I think we're all on the same page16:01
irtermitejeblair: yup... it would appear so16:01
cloudnullmordred: so there are several instances on the node where the mirror is, however the node was not our of mem or slamming the CPU. Dual socket 48cores, 256MEM, constant tx/rx but nothing that the nic cant handle. so I can't imagine it was a noisey neighbor issue for that host or the host causing KVM OOM issues.16:01
jeblairok. cool. carry on.16:01
*** timello has joined #openstack-infra16:01
fungithe remaining question is whether we want to scale our osic max-servers back down temporarily while we do that16:01
cloudnull's/not our/not out/'16:01
mordredfungi: it's not currently causing problems16:01
*** david-lyle has joined #openstack-infra16:01
irtermiteNO SCALE DOWN FOR YOU!16:01
mordredthe problems existed for 30 minutes about 3 hours ago16:01
cloudnullthis very well could've been an issue within the RAX DC --cc irtermite16:02
*** cody-somerville has joined #openstack-infra16:02
irtermitehaven't heard anything though16:03
*** gyee has joined #openstack-infra16:03
irtermitenothing globally impacting anyway, cloudnull  https://status.rackspace.com/16:03
cloudnullmaybe we can reachout to dcops and see if there was an incident or issue ~3 hours ago?16:03
zigopabelanger: http://mirror.dfw.rax.openstack.org/debian-openstack/pool/main/p/python-cryptography-vectors/ <--- There's still a binary package remaining there, probably because there was some stuff in the POST.16:04
irtermitecloudnull: ^^16:04
zigopabelanger: I waited until they are all done, it should be fine if you delete the package now.16:04
zigopabelanger: How do you delete the package btw, is there a kind of API for reprepro?16:04
cloudnullmordred: ^^16:04
zigopabelanger: I'm asking so we can design some kind of API together ...16:04
cloudnullmaybe we were in the subset of customers that were impacted?16:05
*** ganesan has joined #openstack-infra16:05
pabelangerzigo: yes, there is some commands in reprepro to remove a package16:05
irtermiteat your desk cloudnull? have time to sit with Johnny and me for SSL cert, or want to resolve this issue first?16:05
cloudnull:) i am16:05
fungicloudnull: irtermite: wow. i had skimmed the status page and never realized that "cloud monitoring" was a valid category for network outages16:05
pabelangerzigo: Ya, not right now, but we can also loop mordred in also. We should be able the process via a job16:05
zigopabelanger: Let's say I would add a file "reprepro-delete" to the uploads folder, and your script would process it?16:06
fungicloudnull: irtermite: i didn't even bother clicking into that incident because i figured it was just a problem impacting the monitoring service16:06
cloudnullsorry ? :\16:06
clarkbdoing more rough maths we are averaging just over an hour per image build now. Which is about half as long as it took before. So that is an improvement. I am also working on getting these xenial images uploaded so that we can have ntpdate for d-g16:06
pabelangerzigo: removed now16:07
zigoThanks so much.16:07
pabelangerbasically: reprepro --confdir /etc/reprepro/debian-openstack remove jessie-newton-backports python3-cryptography-vectors16:07
fungicloudnull: just trying to correct my previous assumptions... are "cloud monitoring" outages a catch-all for outages impacting other services too?16:07
pabelangerreprepro --confdir /etc/reprepro/debian-openstack --nokeepunreferencedfiles deleteunreferenced16:07
jeblairfungi, clarkb, irtermite: if i'm tz-mathing correctly, that incident was about 10 hours ago, but our errors were about 4 hours ago...?16:07
pabelangervos release mirror.deb-openstack16:07
*** Sukhdev has quit IRC16:08
*** piet has quit IRC16:08
pabelangerzigo: Ya, we'd need to pass a list of files to delete to reprepro-delete job16:08
clarkbpabelanger: zigo what would trigger the need for a delete?16:08
pabelangerzigo: then add some logic into zuul to only run said job when the file changes16:08
clarkbpabelanger: zigo udnerstanding that might help determine how we want to do it16:08
clarkbjeblair: yes 10-11 hours ago or so for the rax incident16:09
*** pcaruana has joined #openstack-infra16:09
pabelangerclarkb: once scenerio might be incorrectly publishing a package to the wrong repo16:09
clarkbpabelanger: that should almost never happen if we automate things right?16:10
pabelangerwe have 2 today, jessie-newton and jessie-newton-backports16:10
*** piet has joined #openstack-infra16:10
*** tphummel has joined #openstack-infra16:10
clarkbpabelanger: eg if we get the builds working properly we shouldn't have to worry about that much16:10
pabelangerclarkb: right, shouldn't happen but we also don't impose validation16:10
pabelangerclarkb: it could only happen if somebody patched the wrong branch for example16:10
clarkbpabelanger: could tehy just revert to fix?16:10
pabelangerthat wouldn't remove the package from reprepro today16:11
clarkb(assuming revert makes new package and reuploads I Think taht should work)16:11
pabelangersince we already build and published it16:11
pabelangerthat would work16:11
pabelangerbut zigo would need to increment his version number to be greater then the broken package16:11
pabelangerwhich is possible16:11
clarkbso revert + version bump16:12
pabelangerya, that would fix this example16:12
pabelangerthe issue is, you need to delete a released package for some reason16:12
pabelangerand remove it from reprepro with now replacement16:12
openstackgerritNate Johnston proposed openstack-infra/project-config: Make neutron-fwaas functional job not experimental  https://review.openstack.org/35932016:13
irtermitejeblair: hrm, good point (timestamp)16:13
*** hashar has quit IRC16:14
mordredShrews: we have per-resource expiration in clouds.yaml for shade-nodepool test: http://logs.openstack.org/97/315697/7/check/gate-dsvm-nodepool-src-shade/0df6717/logs/etc/openstack/clouds.yaml.txt.gz16:15
mordredShrews: any chance you know what puts those settings there?16:15
clarkbpabelanger: ya I think that situation is the one we actually need to solve for16:15
*** DrifterZA has quit IRC16:15
mordredShrews: I cannot find _anywhere_ that writes out expiration times16:15
mordredShrews: (I want to add a setting is why I'm asking)16:16
*** jpich has quit IRC16:16
Shrewsmordred: i don't understand the question. those are manually added to clouds.yaml16:18
mordredShrews: yah - where?16:18
pabelangerclarkb: going to start work on the replacement mirror in osic-cloud116:18
Shrewsmordred: umm, by the user that owns it?16:18
mordredShrews: like, they end up in clouds.yaml for the test job16:18
*** jaosorior has quit IRC16:18
clarkbpabelanger: ok fo t forget to pass both nics to the boot command which launch node may not do?16:18
Shrewsmordred: oh. isn't it a fixture?16:19
clarkbactually the new ansible stuff probably just hasa list you can set?16:19
mordredShrews: _something_ is adding it to /etc/openstack/clouds.yaml16:19
Shrewsmordred: shade/tests/unit/fixtures/clouds/clouds_cache.yaml16:20
pabelangerclarkb: ya, so keep the current network (openstackci-subnet1) and add GATEWAY_NET_v6 right?16:20
Shrewsmordred: oh, in /etc... maybe in devstack itself16:21
clarkbpabelanger: or switch openstackci-subnet1 to GATEWAY_NET and dont fip16:21
pabelangerclarkb: ya, lets do that16:21
mordredShrews: nope. the cache settings in the clouds.yaml file are different ... yah - I looked in devstack but couldn't find it - I'll look again though16:21
mordredShrews: this is extra weird :)16:21
pabelangerclarkb: any preference of NIC order?16:21
clarkbpabelanger: I dont think it matters. If I hadto choose v6 first since thats the rest of the cloud for ys16:22
jeblairfungi: updated https://wiki.openstack.org/wiki/Meetings/InfraTeamMeeting to add an item on the doc-in-afs spec -- basically an RFC and RFVolunteers before we vote on it next week.16:22
jeblairAJaeger: are you back?16:22
Shrewsmordred: yeah, i dunno man16:23
AJaegerjeblair: yes, I am16:23
fungijeblair: thanks!16:23
jeblairAJaeger: welcome!  i hope you were able to stay away sufficiently when you were away.  ;)16:23
jeblairAJaeger: will you have time to join the infra meeting today?16:24
mordredShrews: nodepool's devstack plugin16:25
mordredShrews: wow. that was fun16:25
Shrewsmordred: neat16:25
jeblairAJaeger: i want to discuss https://review.openstack.org/276482 and make sure you have an opportunity to participate16:25
AJaegerjeblair: I'll join the meeting - and I was sufficiently away ;)16:26
AJaegerthanks, jeblair16:26
openstackgerritMonty Taylor proposed openstack-infra/nodepool: Add floating-ip batching settings to clouds.yaml  https://review.openstack.org/35932716:27
mordredShrews: ^^ that's why I was looking for it16:27
*** shashank_hegde has joined #openstack-infra16:31
rcarrillocruzin other news:16:32
rcarrillocruzcontroller00.vanilla.ic.openstack.org : ok=15   changed=5    unreachable=0    failed=016:32
rcarrillocruzpuppet on controller infracloud converges16:32
*** florianf has quit IRC16:32
mordredrcarrillocruz: woot!16:33
*** yamamoto has joined #openstack-infra16:33
*** eharney has quit IRC16:35
*** sputnik13 has joined #openstack-infra16:35
rcarrillocruzafk for a bit16:36
*** zul has quit IRC16:36
rcarrillocruzsee ya in the meeting16:36
*** jraju has joined #openstack-infra16:38
*** fernnest has joined #openstack-infra16:38
*** jraju has quit IRC16:39
AJaegerteam, do we have zuul-cloner installed as /usr/zuul-env/bin/zuul-cloner on the proposal node? I see very strange failures in post jobs that run on proposal node but work fine elsewhere. Could it be that the version on the proposal node is very old?16:41
*** yamamoto has quit IRC16:41
*** AnarchyAo has joined #openstack-infra16:42
*** AnarchyAo has quit IRC16:42
*** cody-somerville has quit IRC16:42
*** AnarchyAo has joined #openstack-infra16:42
*** _nadya_ has joined #openstack-infra16:43
*** _nadya_ has quit IRC16:43
fungiAJaeger: http://paste.openstack.org/show/562508/16:44
*** bethwhite_ has quit IRC16:44
fungiAJaeger: so, yes, looks old-ish16:45
AJaegerthanks, fungi. How can we update this to current zuul-cloner? The one that is able to be used in post and periodic jobs...16:45
fungiAJaeger: latest from git should be zuul==2.5.1.dev4  # git sha 569b7a316:46
AJaegerthat's new enough ;916:46
fungiAJaeger: probably we need to double-check the puppet we have for creating that env on persistent job nodes and make sure it's correctly set to upgrade16:47
*** asselin has joined #openstack-infra16:47
*** zul has joined #openstack-infra16:48
*** yamahata has quit IRC16:48
*** asettle has quit IRC16:48
clarkbok osic, internap, and bluebox should all have up to date xenial images with ntpdate installed16:48
clarkbworking on ovh and rax16:49
*** asettle has joined #openstack-infra16:49
*** asettle has quit IRC16:49
*** asettle has joined #openstack-infra16:49
clarkbalso image builds continue to be quicker so we may weant to consider semi periodic cache cleanups (but also dig into why having a cache makes builds slower and not faster)16:49
*** awayne has quit IRC16:49
clarkbgreghaynes: FYI ^ dib caching behavior makes things slower16:49
AJaegerfungi, https://git.openstack.org/cgit/openstack-infra/system-config/tree/modules/openstack_project/manifests/slave_common.pp#n155 - ensures only installation.16:49
greghaynesclarkb: wah16:50
clarkbgreghaynes: I moved the old cache dir aside and builds are now half as long (from 2 hours to about 50 minutes)16:50
*** asettle has quit IRC16:50
pabelangermordred: any ideas why openstack network list in osic-cloud1 returns Connection failure that may be retried. ?16:51
clarkbpabelanger: see if neutron client does the same thing?16:51
AJaegerfungi, I don't see how we can ensure that those are uptodate - my puppet knowledge is nearly zero.16:51
pabelangerclarkb: sure, testing16:52
fungiAJaeger: we includes more than just you, luckily16:52
*** e0ne has quit IRC16:52
pabelangerclarkb: that works as expected16:52
clarkbneat. Does openstackclient have a trace flag? if so you can compare the API calls between the two clients and see if they differ16:53
ganesanI am getting the Auth Expection when nodepool try to ssh the nodes(hitting this prob very long time and couldnot fix). I verified the ssh keys manually and it works16:53
AJaegerfungi, this problem hits us now with projects using constraints everywhere with failing translation jobs (running on the proposal node)16:53
pabelangerclarkb: sure, let me get some food before I shave this yak16:53
AJaegerSo, any help here is welcome16:53
ganesanIs it possible to check the ssh keys injected into an image16:53
fungi#status log The https://wiki.openstack.org/ site (temporarily hosted from wiki-upgrade-test.o.o) has been updated from Mediawiki 1.27.0 to 1.27.1 per https://lists.wikimedia.org/pipermail/mediawiki-announce/2016-August/000195.html16:53
openstackstatusfungi: finished logging16:53
ganesanlike I could mount the image and check the given ssh keys are injected into the image created by nodepool builder16:54
fungiganesan: you should be able to mount the image on a loop block device, yes16:55
fungimount -o loop /path/to/image/file16:55
*** jed56 has quit IRC16:55
fungii think it's that easy anyway, though been a while since i've needed to16:55
clarkbfor a qcow2 you have to nbd it I think but for raw that should work. THere are all sorts of directions on doing it on the internets if you google for mounting $image type16:56
ganesanfungi: thanks16:56
fungier, that's `mount -o loop /path/to/image/file my_mountpoint` i guess16:56
fungiwhere my_mountpoint is some local directory you're going to use as the mountpoint16:57
fungiand yeah, for raw. qcow2 needs some extra decoding16:57
*** mtanino has joined #openstack-infra16:57
greghaynesclarkb: huh... are those logs being put on nodepool.o.o?16:58
greghaynesclarkb: something I can look at a before/after16:58
clarkbgreghaynes: yup, http://nodepool.openstack.org, compare today's image logs to those from a few days ago16:58
clarkbrax image uploads just succeeded16:59
clarkbso just waiting on ovh now before we can merge the ntpdate in d-g change16:59
*** derekh has quit IRC16:59
*** dtantsur is now known as dtantsur|afk17:01
clarkbgreghaynes: we can actually poke at those logs together later today17:02
mordredpabelanger: oh, lovely17:02
clarkbI will try and prefetch them onto laptop so that we don't have to derp with tether for them17:02
greghaynesclarkb: ah, yea17:02
fungiAJaeger: so... i think we should probably just set https://git.openstack.org/cgit/openstack-infra/system-config/tree/modules/openstack_project/files/zuul-env-reqs.txt to whatever version we want to install (2.5.0?) and let it get the release from pypi. however, before we do, i notice that there is no subscribe/notify between File['/etc/zuul-env-reqs.txt'] and Python::Virtualenv['/usr/zuul-env'],17:03
fungiwhich will be needed to get it to update17:03
*** jerryz has joined #openstack-infra17:04
*** javeriak has joined #openstack-infra17:04
*** Apoorva has joined #openstack-infra17:04
clarkbmordred: is current plan to get shade/occ things merged up today then restart nodepool tomorrow?17:04
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: POC: WIP: oooq undercloud install  https://review.openstack.org/35891917:04
mordredclarkb: yah. I mean, gate willing17:04
*** Apoorva has quit IRC17:04
clarkbmordred: ok, looks like my nodepool change merged so we don't have to wait for that one17:05
AJaeger2.5.0 as version should be ok17:05
*** Apoorva has joined #openstack-infra17:05
*** Guest25180 is now known as med_17:06
*** med_ has joined #openstack-infra17:06
*** med_ is now known as medberry17:06
*** medberry is now known as med_17:06
fungiwo here has familiarity with the python::virtualenv puppet class? latest release looks like it only has a way to create a virtualenv, but no mechanism to upgrade/replace one? https://github.com/stankevich/puppet-python/blob/1.14.2/manifests/virtualenv.pp17:06
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: WIP: DONT MERGE Testin OOOQ job  https://review.openstack.org/35914617:07
fungiis there some special puppet dance you're expected to follow to remove and replace a resource?17:07
*** AnarchyAo has joined #openstack-infra17:07
*** AnarchyAo has quit IRC17:08
*** AnarchyAo has joined #openstack-infra17:08
*** AnarchyAo has quit IRC17:08
mordredpabelanger: neutron support in python-openstackclient is very new17:08
*** lucasagomes is now known as lucas-afk17:08
*** AnarchyAo has joined #openstack-infra17:08
mordredpabelanger: and shaky in places17:08
*** AnarchyAo has quit IRC17:08
*** AnarchyAo has joined #openstack-infra17:08
*** AnarchyAo has quit IRC17:08
mordredpabelanger: using neutron client at this point is still probably more betterer17:08
fungior maybe it magically knows to upgrade packages in a virtualenv when the requirements file changes17:08
*** AnarchyAo has joined #openstack-infra17:08
*** asselin has quit IRC17:09
ganesanfungi: there is a script to mount qcow2 file - http://git.openstack.org/cgit/openstack-infra/project-config/tree/nodepool/elements/README.rst#n6117:09
*** sambetts is now known as sambetts|afk17:09
*** ddieterly is now known as ddieterly[away]17:10
*** abregman|mtg is now known as abregman17:10
*** mtanin___ has joined #openstack-infra17:10
ganesanand I see the id_rsa.pub keys are injected into the image under /home/jenkins/.ssh/authorized_keys17:10
*** hashar has joined #openstack-infra17:10
ganesanbut still I am getting Auth exception17:11
clarkbfungi: I think the dance may be to run pip within the virtualenv17:11
*** AnarchyAo has joined #openstack-infra17:11
*** hashar is now known as hasharAway17:11
fungiahh, yep, looks like if a requirements parameter is specified, there's an exec on it with a refreshonly which is installing packages, but i don't see a subscribe or notify to it from the requirements file17:11
openstackgerritMarc Aubry proposed openstack-infra/project-config: Add python34-jobs on Almanach  https://review.openstack.org/35934517:12
fungii wonder if we could notify Exec["       exec { "17:12
*** mtanino has quit IRC17:12
fungier, notify Exec["python_requirements_initial_install_${requirements}_${venv_dir}"]17:12
*** abregman has quit IRC17:12
fungibut yeah, i suppose we could also just have our own exec subscribed to the file and make it require the venv resource17:13
fungii'll give that a shot17:13
clarkbganesan: usually the best way to debug that is to manually boot an instance off that image then attempt sshing to it as the nodepool user17:13
*** ilyashakhat has joined #openstack-infra17:14
*** asselin has joined #openstack-infra17:15
*** brad_behle has joined #openstack-infra17:17
*** mikelk has quit IRC17:19
openstackgerritDoug Hellmann proposed openstack-infra/release-tools: fix announce.sh for projects with setup_requires  https://review.openstack.org/35935117:19
*** hieulq_ has quit IRC17:20
clarkbgreghaynes: waiting for todays ubuntu trusty build to finish then I will have logs from the 21st and today to compare17:20
clarkbthey are all at http://nodepool.openstack.org too17:21
*** ilyashakhat has quit IRC17:21
brad_behleHello, I'm trying to find what exact ubuntu-trusty image is used for the gate jobs? openstack-infra/devstack-gate/blob/master/README.rst has a link to system-config modules/openstack_project/templates/nodepool/nodepool.yaml.erb, but that file doesn't exist.17:22
*** piet has quit IRC17:22
*** piet has joined #openstack-infra17:22
*** piet has quit IRC17:23
openstackgerritJeremy Stanley proposed openstack-infra/system-config: Update zuul-env on job nodes  https://review.openstack.org/35935217:25
fungiAJaeger: ^17:25
*** _nadya_ has joined #openstack-infra17:25
clarkbbrad_behle: the file moved to openstack-infra/project-config/nodepool/nodepool.yaml17:25
*** oanson has joined #openstack-infra17:25
AJaegerthanks a lot, fungi!17:25
clarkbbrad_behle: the dib images are all defined at the end of that file. That repo also includes some of the dib elements we use to build the images17:26
AJaegerbrad_behle: could you update the README.rst, please?17:26
*** zul has quit IRC17:26
openstackgerritJeremy Stanley proposed openstack-infra/devstack-gate: Update link to nodepool.yaml in README.rst  https://review.openstack.org/35935517:28
fungiAJaeger: brad_behle: ^ there17:28
fungii was already updating it17:28
*** yamahata has joined #openstack-infra17:28
AJaegerthanks, fungi17:28
*** sarob has joined #openstack-infra17:29
openstackgerritMonty Taylor proposed openstack-infra/shade: Support dual-stack neutron networks  https://review.openstack.org/35751717:29
*** senk has quit IRC17:30
*** Swami has joined #openstack-infra17:30
*** senk has joined #openstack-infra17:30
*** samueldmq has joined #openstack-infra17:31
*** nwkarste_ has quit IRC17:31
openstackgerritTim Burke proposed openstack-dev/hacking: Add optional H204 to check that assert(Not)In is used  https://review.openstack.org/35935817:31
openstackgerritTim Burke proposed openstack-dev/hacking: Add optional H205 to check that assertTrue/False is used  https://review.openstack.org/35935917:31
openstackgerritTim Burke proposed openstack-dev/hacking: Add optional H206 to check that assertIs(Not)Instance is used  https://review.openstack.org/35936017:31
*** nwkarsten has joined #openstack-infra17:32
*** zz_ja is now known as zz_zz_ja17:32
brad_behleclarkb, AJaeger, Thanks!17:33
*** hockeynut has quit IRC17:33
*** sshnaidm is now known as sshnaidm|afk17:34
*** tqtran has joined #openstack-infra17:35
*** oanson has quit IRC17:36
*** nwkarsten has quit IRC17:36
pabelangermordred: Ya, it appears that way. http://paste.openstack.org/show/562521/17:36
*** kaisers_ has joined #openstack-infra17:37
clarkbpabelanger: opuch17:37
pabelangerya, I cannot use launch-node right now for osic-cloud117:38
pabelangertying to see what changed17:38
*** e0ne has joined #openstack-infra17:39
*** ilyashakhat has joined #openstack-infra17:39
*** tqtran has quit IRC17:39
*** shashank_hegde has quit IRC17:40
brad_behleclarkb: I don't have accounts with any of those cloud providers, I was hoping I could find a link to download the exact version of ubuntu-trusty and deploy a few VMs of it myself.  the networking-ovn project needs one of its gate jobs to have specific kernel features that I don't think are in the ubuntu images that are out there17:40
*** ilyashakhat has quit IRC17:40
*** eharney has joined #openstack-infra17:40
*** ilyashakhat has joined #openstack-infra17:40
brad_behleI wanted to set up a few vms and see if those kernel patches are really needed, so we can figure out how to get them into the gate job we are going to create.17:40
clarkbbrad_behle: we don't currently publish them (we should but they are huge and unwieldy), but you can build them yourself using dib. The tools/build-image.sh script in project-config should make it easy17:40
clarkbbrad_behle: we use the normal ubuntu cloud vm kernels17:41
*** cody-somerville has joined #openstack-infra17:41
clarkbso we aren't doing anything specail to remove modules or add them17:41
*** _nadya_ has quit IRC17:41
*** nwkarsten has joined #openstack-infra17:41
clarkbbrad_behle: it shouldn't be hard to determine if the ubuntu trusty/xenial kernels have what you need17:42
*** kaisers_ has quit IRC17:42
pabelangerclarkb: mordred: failure log: http://paste.openstack.org/show/562522/17:42
*** csomerville has joined #openstack-infra17:42
*** nwkarsten has quit IRC17:42
*** tosky has quit IRC17:42
*** shashank_hegde has joined #openstack-infra17:42
*** nwkarsten has joined #openstack-infra17:43
brad_behleclarkb: Okay, I'll take a look at build-image.sh and dib and try to build them myself.  Thanks.17:45
*** jcoufal_ has quit IRC17:45
clarkbbrad_behle: do you know what version of the kernel/modules you need?17:45
*** cody-somerville has quit IRC17:45
clarkbubuntu xenial is a 4.4 kernel iirc17:45
clarkbianw is also working on fedora 24 images which I don't think are functional yet but should have a 4.5 kernel looks like17:46
clarkboh maybe 4.617:46
clarkbso newer17:46
*** senk has quit IRC17:46
brad_behleclarkb: I just started on this an hour ago, so I don't know exactly, but I think they are patches that aren't in a released kernel yet, right now the developers are applying kernel patches to the test systems.  I did see a reference to 4.6 somewhere.17:46
*** ganesan has quit IRC17:47
brad_behleclarkb: on the test vagrant environment, uname -a shows: Linux compute2.ursula 4.6.0+ #1 SMP Wed Jun 8 15:23:19 CDT 2016 x86_64 ...17:47
brad_behleursula is the name of the ansible project used to deploy.17:47
clarkbpabelanger: that almost looks like ti can't http to that url?17:48
*** nwkarsten has quit IRC17:48
pabelangerclarkb: ya, was just about to ask cloudnull to take a look17:48
clarkbbrad_behle: you probably need to srot out what features/versions you need first?17:48
pabelangercloudnull: mind helping with an issue we are seeing in osic-cloud1? http://paste.openstack.org/show/562522/17:48
brad_behleMy first step was just to find out what the gate jobs are using, install it and run the existing tests with the new code, watch some tests fail, then figure out what kernel patches/features are needed17:48
*** vhosakot has joined #openstack-infra17:48
openstackgerritDoug Wiegley proposed openstack-infra/project-config: Remove stale octavia job that never gets run, and is broken  https://review.openstack.org/35936717:48
clarkbpabelanger: that port is listening and does accept connections for me17:49
clarkbbrad_behle: that seems backward to me but ok...17:49
brad_behleclarkb: I'll actually be doing both in parallel, trying to set up an environment and also tracking down the people building the test kernels we have that are working and see what patches they are pulling in17:50
*** nwkarsten has joined #openstack-infra17:51
*** rbrndt has quit IRC17:51
fungibrad_behle: yeah, publishing the images we're using is something we considered, but they're in the neighborhood of 5gb compressed each and have our access credentials baked into them, so would take some local surgery to make it possible for someone else to log in17:52
clarkbfungi: its actually 8GB compressed now and 20ish uncompressed17:52
fungioh, yeeowch17:52
fungiguess i haven't looked at the image sizes in a while17:52
brad_behleclarkb, fungi: I don't suppose you have seen this requirement before, where a gate job has needed kernel patches or something else not in a standard cloud distro at the moment?17:52
clarkbthose numbers fell slightly when I claered out the cache we are in the middle of rebuilding everything with the new cache17:52
clarkbbrad_behle: not really17:53
fungibrad_behle: for upstream jobs we usually recommend running on a distro that has a new enough kernel (e.g. fedora 24)17:53
clarkbrussellb at one time cared for a special image with a custom kernel compile iirc but that was it17:53
clarkbI think that lasted about a cycle?17:53
fungibut if you need a completely nonstandard kernel, not just a newer kernel, you'd currently require a separate node type with its own image17:54
pabelangerclarkb: only 1 launch failure in osic-cloud1 in the last 6 hours!17:54
clarkbpabelanger: nice17:54
fungiwhich we're hesitant to add without extremely good use cases due to the impact on our already strained image updating solution17:54
*** e0ne has quit IRC17:55
*** zul has joined #openstack-infra17:55
clarkbbrad_behle: a better understanding of the requirements would definitely help17:55
brad_behlefungi: Okay, good to know.17:55
clarkbwhich is why I would sort those out first17:55
fungithough it might be interesting to some up with a way of saving job state on nodes and supporting controlled reboots during job runtime. it wouldn't be a trivial change to our framework though17:56
brad_behleclarkb: Yeah, agreed.  Once I understand the requirements better, I'll definitely let you know :-)17:56
*** eharney has quit IRC17:56
clarkbfungi: I think zuulv3 will support that since ansible can have a reboot and wait portion of the playbook17:56
timrcWe should add this image and node type for April 1 https://www.gnu.org/software/hurd/hurd/running/cloud.html17:57
clarkbtimrc: and run pep8 on it right?17:58
*** _nadya_ has joined #openstack-infra17:58
clarkb(I assume hurd has working python but that might be a bad assumption)17:58
fungiclarkb: oh, that'll open up an entire new class of job options if we support rebooting at runtime. awesome17:58
timrcno virtio drivers though, that's rough17:58
*** dimtruck is now known as zz_dimtruck17:59
*** baoli has quit IRC17:59
fungiclarkb: for example rebooting a subnode in a multi-node job onto a separate decompressed image to work around lack of nested vir acceleration18:00
fungier, virt18:00
*** e0ne has joined #openstack-infra18:00
*** _sarob has joined #openstack-infra18:01
AJaegerinfra cores, could you review my changes to move other-requirements to bindep, please? https://review.openstack.org/#/q/topic:bindep-mv+projects:openstack-infra+status:open is the list of open changes18:01
*** zz_dimtruck is now known as dimtruck18:01
*** ansmith has quit IRC18:01
*** jamielennox|away is now known as jamielennox18:01
openstackgerritAndreas Jaeger proposed openstack-infra/project-config: Merge zuul-git-prep-upper-constraints/zuul-release-git-prep-upper-constraints  https://review.openstack.org/35257518:02
clarkband now just ovh bhs1 lacks the new xenial iamge18:03
*** baoli has joined #openstack-infra18:03
openstackgerritAndreas Jaeger proposed openstack-infra/project-config: Merge zuul-git-prep-upper-constraints/zuul-release-git-prep-upper-constraints  https://review.openstack.org/35257518:04
*** sarob has quit IRC18:04
pabelangerclarkb: jeblair: I'd like to see if we can clean-up the following error message: http://paste.openstack.org/show/562528/18:04
*** DrifterZA has joined #openstack-infra18:04
pabelangerclarkb: jeblair: it doesn't do much today in nodepool except increase the size of our log file18:04
AJaegerfungi, for infra-specs I have a sphinx warning fix - could you review https://review.openstack.org/352218 , please?18:04
clarkbpabelanger: the ssh connect time out should be user configurable18:05
clarkbpabelanger: maybe we just need to increase the timeout?18:05
pabelangerclarkb: Ya, we could do that. I just need to figure out which cloud it is18:05
AJaegermtreinish, mordred : could you review my constraints change for cookiecutter, please? https://review.openstack.org/#/c/352758/18:06
*** jcoufal has joined #openstack-infra18:06
cloudnullpabelanger clarkb looking no w18:07
*** ilyashakhat has quit IRC18:07
cloudnullsorry split brained today18:07
*** eharney has joined #openstack-infra18:08
anteayajust today?18:08
anteayaI consider that normal operating mode18:08
openstackgerritJames Slagle proposed openstack-infra/tripleo-ci: DO NOT MERGE - Periodic test.  https://review.openstack.org/34694918:08
cloudnullah neutron cant deal with X-Forward-For in liberty.18:09
cloudnullwhen we upgrade to mitaka thats supposedly fixed18:09
cloudnull^ clarkb pabelanger18:09
openstackgerritPaul Belanger proposed openstack-infra/nodepool: Include ip address for ssh_connect exception  https://review.openstack.org/35936918:10
cloudnull^^ RE: network list failing18:10
pabelangerclarkb: jeblair: that should give us some additional information on failure ^18:10
*** _nadya_ has quit IRC18:10
pabelangercloudnull: okay, cool. That explains the failure18:10
*** csomerville has quit IRC18:10
cloudnullyea we treid to add rewrite rules on the F5 to deal with that however it failed misserably and caused other issues18:11
cloudnullso we left it as-is and know its a limitation18:11
cloudnullnova net-list still works to list networks though18:11
cloudnullwhich may be of use?18:12
*** hockeynut has joined #openstack-infra18:13
anteayafungi: the meetbot in -meeting didn't show the results of a vote that just occured18:13
anteayais there a solution?18:13
anteayafungi: keystone has moved on, but it is odd behaviour for the meetbot18:14
*** rbrndt has joined #openstack-infra18:14
*** bin_ has quit IRC18:14
fungianteaya: not sure. are the vote options there case sensitive? i know it's also picky about having more than one space separating commands and data18:15
pabelangercloudnull: okay, let me try that18:15
*** amitgandhinz has quit IRC18:15
pleia2anteaya: I wonder if adding so many invalid choices at the end confused things?18:15
anteayait is possible18:15
pleia2(it really is just yes/no, not all the rest)18:15
anteayaI just witnessed it and thought it was odd18:16
*** caowei has quit IRC18:16
*** amitgandhinz has joined #openstack-infra18:16
anteayathey don't seem concerned18:16
anteayaso we don't have to do anything I guess, just datapoint18:16
*** caowei has joined #openstack-infra18:16
fungione person did #vote  Yes (two spaces) and the rest did #vote yes or #vote no (lower-case)18:16
*** csomerville has joined #openstack-infra18:17
*** jcoufal has quit IRC18:17
anteayawould the two spaces make meetbot forget the results?18:17
fungiwhile the bot said the valid choices were Yes and No (upper-cased first letter)18:17
anteayawhich noone typed18:18
fungiso it's possible it thought none of those were valid votes18:18
anteayaso yeah if case senstivive it got no votes18:18
*** degorenko is now known as _degorenko|afk18:18
fungii don't know if #vote followed by more than one space is a problem, but we recently saw someone trying to #startmeeting  something (two spaces) and it consistently ignoring them18:19
openstackgerritHenry Gessau proposed openstack-infra/project-config: Fix neutron failure rates dashboard integrated jobs list  https://review.openstack.org/35846218:19
*** javeriak_ has joined #openstack-infra18:19
fungiso at least some meetbot commands are sensitive to having an extra separating space18:19
anteayait would seem so18:19
anteayanow I know18:19
*** csomerville has quit IRC18:20
anteayaI'll try to share our theory during open discussion18:20
anteayafungi: pleia2 thank you18:20
*** javeriak has quit IRC18:21
*** electrofelix has quit IRC18:21
*** Sukhdev has joined #openstack-infra18:22
clarkbthat part of the parser (the command part) is all in meetbot proper18:23
*** javeriak has joined #openstack-infra18:23
clarkbthe case sensitivity is in the bits I added, we could make it case insensitive if we wanted18:23
openstackgerritJeremy Stanley proposed openstack-infra/puppet-mediawiki: Switch from old recaptcha to recaptcha-nocaptcha  https://review.openstack.org/35820218:23
openstackgerritJeremy Stanley proposed openstack-infra/puppet-mediawiki: Clean up old recaptcha parameters  https://review.openstack.org/35821018:23
openstackgerritJeremy Stanley proposed openstack-infra/puppet-mediawiki: Parameterize database connection settings  https://review.openstack.org/35819518:23
openstackgerritJeremy Stanley proposed openstack-infra/puppet-mediawiki: Update scope.lookupvar() calls to shorter @ lookup  https://review.openstack.org/35819418:23
anteayaclarkb: what was the reason for adding case sensitivity?18:23
*** raunak has quit IRC18:23
fungiyolanda: ^ updated from scope[] to @ per your suggestion18:23
openstackgerritMerged openstack-infra/infra-specs: Fix warnings  https://review.openstack.org/35221818:24
*** pvaneck has joined #openstack-infra18:25
*** javeriak_ has quit IRC18:25
*** pfallenop has quit IRC18:25
*** jcoufal has joined #openstack-infra18:26
clarkbanteaya: things are case sensitive by default typically we didnt add it souch as we didnt make an effort to make it insensitive18:27
*** AnarchyAo has left #openstack-infra18:28
*** csomerville has joined #openstack-infra18:28
anteayaoh sorry, I interpreted "the case sensitivity is in the bits I added" as you added case sensitivity18:28
anteayabut you mean it was included in other things you added18:28
anteaya+1 make meetbot case insensitive18:29
openstackgerritMerged openstack-dev/cookiecutter: Adjust tox.ini for constraints  https://review.openstack.org/35275818:29
*** pfallenop has joined #openstack-infra18:30
mordredpabelanger: I see your note above about launch_node and osic18:30
mordredpabelanger: did you get anywhere with that or should I look?18:31
pabelangermordred: I'm trying to hack around it18:32
pabelangerby using nics18:32
pabelangerand not network18:32
pabelangerthe reason this works in nodepool, is because we do that, nics18:32
pabelangerotherwise, we'd have the same problem18:32
clarkbpabelanger: mordred it might work if you tell shade to not neutron in clouds.yanl18:32
clarkbsimilar to what we did in tripleo cloud18:33
pabelangerwe can try that18:33
pabelangerbut I have a server launching now with my hack18:33
pabelangerbut, it would be good to add --nics support to launch-node18:34
clarkbhappy to use nics too if that works18:34
pabelangerI think we need to do that either way, since I cannot see how network support more then 118:34
mordredclarkb: hang on18:34
AJaegerfungi wrote a change to update zuul-cloner envs for system-config, please review https://review.openstack.org/359352 - this fixes translation sync with constraints.18:34
mordredthis is working fine for me when I'm doing shade in the repl18:35
mordredso gimme a sec to see where it's going south in launch_node18:35
*** amitgandhinz has quit IRC18:35
*** ilyashakhat has joined #openstack-infra18:36
*** amitgandhinz has joined #openstack-infra18:36
mordred Network ['GATEWAY_NET_V6'] is not a valid network in openstackci-osic-cloud1:RegionOne18:36
pabelangerya, that is the error18:36
mordredthat says to me that something is passing GATEWAY_NET_V6 into shade as a list18:36
mordred                    'Network {network} is not a valid network in'18:36
mordred                    ' {cloud}:{region}'.format(18:36
mordred                        network=network,18:37
pabelangerOh, hmm18:37
mordredI don't see anything that would cause such a thing to be true though18:38
mordredpabelanger: are you passing in more command line than I see in that paste?18:38
pabelangermordred: that is likely my fault, I was using a list to see if I could do more then 1 network18:39
mordredah - you cannot18:39
pabelangerya, that's why I switched to nics18:40
mordrednetwork is a convenience setting for if you have a single network you're specifying18:40
mordredcool. that will work much betterer18:40
pabelangerwhen I revert --network works correctly18:40
*** kzaitsev_mb has quit IRC18:41
clarkbwe probably want multi nic support anywyas ya?18:41
pabelangerI think so18:41
*** dtantsur|afk has quit IRC18:41
mordredyah - that's just what nics is there for. now, that said - we could also add support to shade to handle network as a list with not much effort18:42
mordredsince the input format of nics is annoying18:42
pabelangerI would not complain about that18:43
*** ddieterly[away] is now known as ddieterly18:44
openstackgerritMonty Taylor proposed openstack-infra/shade: Support more than one network in create_server  https://review.openstack.org/35937818:45
openstackgerritMonty Taylor proposed openstack-infra/shade: Support more than one network in create_server  https://review.openstack.org/35937818:46
mordredsorry, updated docstring18:46
*** zul has quit IRC18:46
mordredI should really make that check to see if any of the entries are already dictlike ...18:46
openstackgerritMonty Taylor proposed openstack-infra/shade: Support more than one network in create_server  https://review.openstack.org/35937818:48
*** _nadya_ has joined #openstack-infra18:48
mordredthere. that's more complete and also now not a syntax error :)18:48
*** nwkarsten has quit IRC18:48
mordred(you should still hack around this in launch_node for now, because gate length today)18:49
*** nwkarsten has joined #openstack-infra18:49
pabelangerSure, that is no issue18:49
pabelangerbut I'll update launch-node.py to use that now18:49
*** _nadya_ has quit IRC18:52
openstackgerritMarc Aubry proposed openstack-infra/project-config: Add python34-jobs and python35-jobs to Almanach  https://review.openstack.org/35934518:53
*** nwkarste_ has joined #openstack-infra18:54
*** nwkarsten has quit IRC18:54
*** raunak has joined #openstack-infra18:55
*** raunak has quit IRC18:56
*** javeriak has quit IRC18:56
*** dtardivel has quit IRC18:57
*** ilyashakhat has quit IRC18:57
*** markvoelker has joined #openstack-infra18:58
fungiit's that (weekly infra team meeting) time again! find us in #openstack-meeting for the next hour19:00
*** hasharAway is now known as hashar19:00
*** zul has joined #openstack-infra19:01
*** markvoelker has quit IRC19:01
*** Hal has quit IRC19:02
jeblairfungi: if you're on the waiting list, does that mean you have to stay in the beer garden?19:02
*** tqtran has joined #openstack-infra19:04
*** ddieterly is now known as ddieterly[away]19:04
*** piet has joined #openstack-infra19:05
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: Implement non-ovb overcloud update job - Newton -> Newton  https://review.openstack.org/35133019:05
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: WIP - Implement undercloud upgrade job - Mitaka -> Newton  https://review.openstack.org/34699519:06
fungijeblair: that sounds like a compelling argument in favor of procrastination19:06
openstackgerritEmilien Macchi proposed openstack-infra/tripleo-ci: WIP - Implement overcloud upgrade job - Mitaka -> Newton  https://review.openstack.org/32375019:06
*** kushal has quit IRC19:09
*** dizquierdo has joined #openstack-infra19:11
*** tonytan_brb has joined #openstack-infra19:11
*** DrifterZA has quit IRC19:12
*** spzala has joined #openstack-infra19:13
*** ansmith has joined #openstack-infra19:13
*** tonytan4ever has quit IRC19:14
*** Hal has joined #openstack-infra19:15
*** jtomasek has joined #openstack-infra19:18
pabelangernew address for osic-cloud1 mirror19:20
*** jtomasek has quit IRC19:20
*** jtomasek_ has joined #openstack-infra19:20
pabelangerneed to confirm IPv6 security group access19:20
*** pcaruana has quit IRC19:20
*** andymaier_ has joined #openstack-infra19:21
pabelangeractually, cloud_launcher can do that19:22
clarkbpabelanger: I want to say I always open both even if cloud doesnt have v6 for precisely this reason19:22
clarkband if it isnt open already then we should definitely do it that eay19:22
irtermiteHeads up... doing SSL renew now19:22
dmsimardIs review.o.o working for you guys ?19:23
pabelangerclarkb: ya, add the rule into cloud_launcher last week, just need to confirm openstackci-osic-cloud1 is there19:24
*** ddieterly[away] is now known as ddieterly19:24
pabelangerirtermite: yup, just had a big spike in errors in osic-cloud119:24
dmsimardUnable to reach review.o.o on my end.. http://paste.openstack.org/show/562542/19:25
irtermitepabelanger: thank you. all good now19:25
irtermitecheck again pabelanger19:26
sdakegerrit seems pokey19:26
sdakeis it just me?19:26
pabelangerirtermite: http://grafana.openstack.org/dashboard/db/nodepool-osic19:26
dmsimardsdake: I'm seeing it too but openstack-infra is in a meeting19:26
pabelangerirtermite: what I was watching19:26
sdakedmsimard roger19:26
*** kaisers_ has joined #openstack-infra19:26
irtermitesmall error... selected the old intermediate with the new cert on mistake. fixed it right away, pabelanger19:26
*** gyee has quit IRC19:27
irtermiteannnnd spike gone, pabelanger19:27
irtermitesorry about that19:27
fungii seem to be getting to gerrit okay, though it is a mite slow19:28
*** salv-orlando has quit IRC19:28
fungimight check http://cacti.openstack.org/ for anything anomalous?19:28
dmsimardirtermite: that spike you mention, was that anything to do with review.o.o ?19:28
rcarrillocruzdmsimard: it was fair, it took a long time for me to load up19:28
*** salv-orlando has joined #openstack-infra19:28
*** hockeynut has quit IRC19:28
pabelangerirtermite: yup, will keep an eye on it19:28
irtermitedmsimard: nope... that was us updating the ssl cert19:29
irtermiteit's better now   >.<19:29
dmsimardirtermite: ok19:29
irtermitedmsimard: accidentally added the wrong intermediate but quickly fixed19:29
*** kaisers_ has quit IRC19:30
irtermitealways safe to blame me when things break here...19:32
*** salv-orlando has quit IRC19:32
*** ewindisch_ has joined #openstack-infra19:35
*** ewindisch has quit IRC19:36
*** ewindisch_ is now known as ewindisch19:36
*** piet has quit IRC19:36
*** esikachev has quit IRC19:38
mordredclarkb, pabelanger: https://review.openstack.org/#/c/358232/ made it through the gate gauntlet and is ready for re-review19:41
mordredShrews: ^^19:41
openstackgerritMarc Aubry proposed openstack-infra/project-config: Add python34-jobs and python35-jobs to Almanach  https://review.openstack.org/35934519:42
*** markusry has quit IRC19:44
*** raunak has joined #openstack-infra19:45
*** markusry has joined #openstack-infra19:45
openstackgerritAndreas Florath proposed openstack/diskimage-builder: Refactor: block-device handling (local loop)  https://review.openstack.org/31959119:45
*** markusry has quit IRC19:45
*** andymaier_ has quit IRC19:46
dougwigfungi: another node that went offline: telnet -6 2001:4800:1ae1:18:f816:3eff:fe0b:a9e0 1988519:47
dougwigfungi: just re-queued, so it might still be around.19:47
fungiif only i weren't chaiting a meeting19:47
dougwigfungi: ok, we'll catch another later.19:48
*** Apoorva has quit IRC19:48
*** andymaier_ has joined #openstack-infra19:49
fungiit's still live (288b718b-f453-4f4c-8460-9475f3af02fd in osic) but i'm spacing on the openstack server console syntax (that isn't it apparently)19:49
*** xarses has quit IRC19:50
*** Na3iL has quit IRC19:50
*** andymaier_ has quit IRC19:50
*** xarses has joined #openstack-infra19:51
pabelangerclarkb: next issue with osic-cloud1 mirror replacement, only 1 interface was configured by cloud-init, meaning we are still missing ipv619:51
pabelangerclarkb: will dive into it shortly19:51
clarkbpabelanger: wow fun19:51
*** xarses has quit IRC19:51
*** andymaier_ has joined #openstack-infra19:51
*** xarses has joined #openstack-infra19:52
*** Apoorva has joined #openstack-infra19:52
pabelangerconfig-drive is enabled19:52
clarkbpabelanger: there should be a cloud init log somewhere iirc19:52
clarkbpabelanger: maybe that will say what it did and why19:53
mordredclarkb, pabelanger: I think the stock ubuntu images are only configured with one nic listening to dhcp19:53
*** asettle has joined #openstack-infra19:53
mordredand I don't think cloud-init is involved19:54
mordredI'm pretty sure if you just add another auto dhcp line to /etc/network/interfaces19:54
mordredand do an ifup eth119:54
mordredit _should_ work19:54
clarkbwell it shouldnt dhcp there19:54
clarkbI dont think19:54
russellbbrad_behle: we just compile the ovs kernel module from the ovs tree for networking-ovn jobs.  are you saying that's not good enough for some cases?19:54
clarkbptovably just need to ifup it19:54
clarkbbecause ipv6 magic19:55
krotscheckAny cores around to assist with a few npm-based build updates? Adding docs builds, devstack, etc. https://review.openstack.org/#/q/topic:npm+status:open19:55
mordredclarkb: ++19:55
mordredclarkb: yah - whatever the magic is - I mostly think we just need to enable the nic19:55
brad_behlerussellb: From what I understand, some of the SNAT and Floating IP work for OVN requires some very recent linux kernel enhancements (I think in conntrack)19:56
russellbbrad_behle: yes, but all of that is backported into the ovs kernel module from the ovs git tree19:56
pabelangermordred: clarkb: Ya, adding configuration for it works. So, we'll need to add this to system-config19:56
mordredpabelanger: cool19:56
mordredpabelanger: I support adding such config to system-config19:57
*** markvoelker has joined #openstack-infra19:57
pabelangerokay, let me get the volume going again19:57
krotscheckAJaeger: Thanks :D19:57
russellbbrad_behle: https://github.com/openvswitch/ovs/blob/master/FAQ.md ... see "Q: Are all features available with all datapaths?" and the "Linux OVS tree" column19:57
*** ddieterly is now known as ddieterly[away]19:58
*** asettle has quit IRC19:58
russellbclarkb: i think i caught a question about vxlan and ipv6 in scrollback a couple days ago?  the link above covers that same thing -- vxlan ipv6 is available for ovs in 4.3+ or if you compile the ovs kernel module from the ovs tree (compatible back to 3.11 or something)19:58
clarkbrussellb: so if the job builds the module and loads it is good?19:58
russellbclarkb: yes19:59
russellbthat's what all OVN jobs do today19:59
clarkbrussellb: ya we dont have ovs nearly that new19:59
* russellb nods19:59
*** amotoki has quit IRC20:00
fungiokay, so it's `openstack console log show` apparently, not namespaced under the server subcommand tree?20:01
clarkbrussellb: mosyl just mind boggled ipv6 is so second class citizen with newish networking tools20:01
*** xarses_ has joined #openstack-infra20:01
dhellmannfungi : regarding your scheduling question; a maintenance window the day after milestone 3 feels a bit tight. I guess we're running out of good times at this point in the cycle, though.20:02
fungidhellmann: so on the scheduling... do you feel like there'll be spillover into friday for the milestone releases?20:02
mordredclarkb: perhaps we should get newer ovs?20:02
*** dizquierdo has quit IRC20:02
dhellmannfungi : possibly in the morning us eastern if we have late tag submissions the day before20:02
clarkbmordred: that seems like a lot of work for all the distros20:02
mordredclarkb: oh. right. nevermind20:02
clarkbmordred: since we need it for the underlying multinode network stuff20:02
mordredclarkb: let's not do that20:02
*** ansmith has quit IRC20:02
mordredI rescind my thought20:03
russellbcompile it from master!20:03
*** xarses has quit IRC20:03
russellbwhat could go wrong20:03
fungidhellmann: if it helps, we're scheduling the window for a much longer period than we expect to actually need (this is our first time trying online reindexing in production), but we're also talking about starting the work at 18:00 utc20:03
mordredclarkb: https://review.openstack.org/#/c/358232/ nudge20:03
pabelangermirror.regionone.osic-cloud1.openstack.org replacement online, both with ipv4 / ipv620:04
pabelangerwill be updating DNS in a minute20:04
mordredpabelanger: woot!20:04
irtermitewoot pabelanger !20:04
ianwfungi | anyone : here's launch-node.py atm -> http://paste.openstack.org/show/562545/ ... that node comes up, has an ipv6 address in interfaces but it's not on eth020:04
irtermiteuh, jinx, mordred ?20:04
dhellmannfungi : hmm, that's around noon US eastern20:04
mordredianw: looking20:04
fungidhellmann: 2pm us eastern according to my clock?20:04
* dhellmann looks again20:05
dhellmannoh, 18 not 1620:05
rcarrillocruzwell done pabelanger ;-)20:05
brad_behlerussellb: Okay, I'll take a look at that FAQ.  I'm in the process of building the ubuntu-trusty image that the gate uses, I plan to use that image to run the latest tests, to see if any fail due to needing an updated kernel20:05
dhellmannfungi : ok, that's a little better. let me confer with my veteran expert, but I think if we end up with a late tag we can wait and apply it monday20:05
brad_behlerussellb, that would be great if we didn't require anything extra from the base image20:05
dhellmannfungi : I'll track down ttx here at the conference and see what he thinks and let you know20:06
fungidhellmann: we can also delay/abort if it looks like release work is still underway. it's not terribly critical that we do it next week, it just seemed like one of the sooner possibilities20:06
dhellmannfungi : ok, let's say a tentative yes for now20:06
fungithanks dhellmann!20:06
russellbbrad_behle: OK, let me know what you find, i'm not aware of any reason we need a custom image20:06
brad_behlerussellb: Will do, thanks!20:06
pabelangerand DNS update20:06
pabelangeripv6 traffic already20:07
mordredianw: that seems like a potential bug in the rackspace nova-agent or something, and should be solvable by rebooting?20:07
*** ilyashakhat has joined #openstack-infra20:07
ianwmordred: yes, or just ifdown && ifup ... but that doesn't really help finish launch-node.py :)20:07
ianwi mean, i could easily add that, if we just think it's something silly upstream & not my fault20:07
pabelanger#status log mirror.regionone.osic-cloud1.openstack.org upgraded to support both ipv4 / ipv6. DNS has also been updated.20:07
openstackstatuspabelanger: finished logging20:07
openstackgerritMerged openstack-infra/project-config: Created DSVM Job for NPM Projects  https://review.openstack.org/34805620:08
mordredianw: I'm not sure it's a thing launch-node is doing though :(20:08
mordredianw: I mean, we could put in a launch-node "ifdown && ifup" ...20:08
ianwmordred: yeah, that's what i mean i could do, if it's not user-error on my behalf20:09
mordredianw: I do not believe it is20:09
mordredianw: I expect the things you did to result in a working in-guest network20:09
ianwyeah, i'm surprised nobody has noticed their rax vm's not booting with a working ipv6 address20:10
*** esikachev has joined #openstack-infra20:10
mordredthis does seem to be new behavior20:10
mordredthe last rax servers we spun up did not exhibit this to my knowledge20:11
*** bhunter71 has quit IRC20:12
*** e0ne has quit IRC20:12
*** edmondsw has quit IRC20:13
rcarrillocruzianw: yeah, that's new20:13
rcarrillocruzcos i created firehose not that long ago20:13
rcarrillocruzand i do remember running ipv6 dns commands20:13
phschwartzjeblair: you around? I am getting a zuul error and wanted to see if you have seen it http://paste.openstack.org/show/562547/20:15
phschwartzjeblair: been happening since early this morning. It has our whole set of queues blocked20:15
ianwmordred | rcarrillocruz : ok, trying with an ifdown && ifup in there20:15
openstackgerritJames E. Blair proposed openstack-infra/zuul: Re-enable the shared/independent queue test  https://review.openstack.org/35801020:17
jeblairphschwartz: looking20:17
phschwartzjeblair: I have never seen it out of no where from working projects start to throw project.name doesn't exist. We are tracking master in this env so not sure if anything changed, figured you would have the quickest insite20:18
phschwartzjeblair: and to note, what ever is causing this, is causing zuul to push the cpu top 100% and eat almost all ram20:20
*** esikachev has quit IRC20:21
*** ddieterly[away] is now known as ddieterly20:21
*** ansmith has joined #openstack-infra20:21
dhellmannfungi : I've talked with ttx, and we think we'll be ok with that gerrit maintenance window. We'll make the final call on any late tags earlier that day at the release team meeting and should have time to process them all before you start.20:22
*** kgiusti has quit IRC20:22
jeblairphschwartz: that doesn't ring a bell.  has anything changed in your gerrit or zuul configuration recently?20:24
*** e0ne has joined #openstack-infra20:25
AJaegerinfra-root, fungi wrote a change to update zuul-cloner envs for system-config, please review https://review.openstack.org/359352 - this fixes translation sync with constraints on the proposal slave.20:26
fungidhellmann: ttx: perfect. as i said, let us know that day if release work is still underway and we can delay starting or reschedule as needed20:26
dhellmannfungi : yep, I'll leave myself a note to do that20:26
fungidhellmann: i'll try to remember to check in with #openstack-release before we start as well20:27
sdakepabelanger - say I had to rebase https://review.openstack.org/#/c/349278/20:27
sdakeand your +2 fell off20:27
sdakeany chance you could reaapply it20:27
*** hockeynut has joined #openstack-infra20:27
mordredclarkb, Shrews: while I've got you: https://review.openstack.org/#/c/359378/ and https://review.openstack.org/#/c/315697 are ready to fly (trying to burn down shade patches in the outstanding queue as quickly as the gate gives a green)20:28
sdakemordred - 12 pages of patches to review!!20:28
sdake7 days to do it in20:28
sdakemission impossibe?20:28
ianwmordred: ok, new error, from ansible ... -> http://paste.openstack.org/show/562549/20:28
phschwartzjeblair: one project has been added, but it was working after it was.20:28
*** sigmavirus is now known as sigmavirus|away20:29
jeblairphschwartz: did you reconfigure zuul after adding the project?20:29
*** NobodyCam has quit IRC20:29
phschwartzjeblair: yeah, it auto reconfigures every 15 min20:29
pabelangerianw: make sure you run launch-node.py as root, there is a bug where we don't copy hieradata properly20:29
*** NobodyCam has joined #openstack-infra20:30
phschwartzjeblair: only change since then could have been a new pull of master of zuul20:30
ianwpabelanger: ok ... should we add that to readme?20:30
jeblairphschwartz: you restarted zuul?20:30
fungiour beaker jobs for centos-7 seem to have started failing with a bundler error... anybody already looking into it? http://logs.openstack.org/94/358194/3/check/gate-openstackci-beaker-centos-7/5f54990/console.html#_2016-08-23_18_56_02_29750520:30
pabelangerianw: we do have a patch to fix it, it hasn't landed yet20:30
jeblairpabelanger: how about we land it?20:31
phschwartzjeblair: looks like it was restarted last night. issues started after that20:31
phschwartzjeblair: also seeing http://paste.openstack.org/show/562550/ in the logs. Just noticed it20:31
ianwyeah, i'd prefer that than running as root ... to much chance i destroy something else :)20:31
phschwartzI might move us to a pin og the last stable release of zuul20:31
*** coolsvap has quit IRC20:31
pabelangerianw: I think the issue you are having with ansible-playbook has been fixed in shade, but we don't have a new release yet20:32
anteayafungi: it looks to me like the gem rubocop-rspec failed to download: http://logs.openstack.org/94/358194/3/check/gate-openstackci-beaker-centos-7/5f54990/console.html#_2016-08-23_18_56_01_97606720:33
mordredpabelanger: getting closer20:33
mordredwe're on the final burn-down list :)20:33
*** itisha has joined #openstack-infra20:33
*** pt_15 has joined #openstack-infra20:33
fungianteaya: indeed. i wonder if something's wrong with a very recent release of that gem or something20:34
pabelangerianw: http://paste.openstack.org/show/562551/ should work around the failure20:34
openstackgerritIan Wienand proposed openstack-infra/system-config: launch-node.py: restart interface  https://review.openstack.org/35941620:34
openstackgerritIan Wienand proposed openstack-infra/system-config: launch-node.py : save key when failing early  https://review.openstack.org/35941720:34
jeblairphschwartz: it might be helpful to know the sequence of relevant changes to the configuration of gerrit and zuul that preceded the first occurances of those errors, as well as more log context around them.20:34
anteayafungi: it releaseed Aug 320:35
anteayasurely we have run beaker tests since that20:35
jeblairphschwartz: basically, they are both errors that should never happen, so in order to track them down, we'll need a lot more data20:35
fungianteaya: hrm... almost three weeks ago20:35
fungianteaya: yeah, the job was passing earlier today20:35
anteayaor maybe not: rubocop-rspec requires Ruby version >= 2.2.020:35
anteayawe are ruby 1.9 are we not?20:35
anteayaor did we move to ruby 2+20:35
fungianteaya: on centos 7? not sure20:35
openstackgerritIan Wienand proposed openstack-infra/system-config: launch-node.py: save key when failing early  https://review.openstack.org/35941720:36
fungianteaya: there's a warning further up about "Rubygems 2.0.14 is not threadsafe" so i'm guessing it's that20:36
phschwartzjeblair: what could I provide to get some help. I am stuck and our zuul is too. lol20:36
anteayafungi: yeah I'm just seeing that20:36
anteayanot sure how we could have been passing tests for the last 3 weeks if that is the error yet failing today20:37
mordredphschwartz: did you see: 20:34:36          jeblair | phschwartz: it might be helpful to know the sequence of relevant changes to the configuration of gerrit and zuul that preceded the first occurances of those errors, as well as more log20:37
mordred                          | context around them.20:37
*** gyee has joined #openstack-infra20:37
*** csomerville has quit IRC20:37
*** e0ne has quit IRC20:37
phschwartzmordred: no, didn't get that log line. yay irc that hates me20:38
anteayain any case, my read is that upgrading to ruby 2.2 or downgrading rubocop-rspec gem to maybe 1.5.1 is the route20:38
mordredphschwartz: there was also: 20:35:15          jeblair | phschwartz: basically, they are both errors that should never happen, so in order to track them down, we'll need a lot more data20:38
ianwpabelanger: ok, trying again ...20:38
phschwartzjeblair: so, no changes to gerrit have been made. the only changes to zuul were moving a job from being a single job called twice to having a passed suffix on the jjb template, and then moving the job from experimental to being in the check queue20:39
pabelangeranteaya: fungi: I believe EmilienM is also fighting that fire now too.20:39
anteayaEmilienM: awesome20:39
jeblairphschwartz: you said you added a project to gerrit?20:39
phschwartzjeblair: not gerrit, to zuul20:39
anteayaI can't find the version notes on the gems so I have no idea why 3 version were released the first week of august20:40
* AJaeger waves good night20:40
anteayaAJaeger: good night20:40
EmilienMwe're doing https://review.openstack.org/#/c/359385/20:40
anteayaAJaeger: thank you for a great day20:40
phschwartzjeblair: pm'ed you 2 links20:40
jeblairphschwartz: can you paste a much longer log segment that preceeded the first error?  and let me know what the names of the jobs and projects you changed are?20:40
AJaegerthanks, anteaya ! Enjoy the rest of the day!20:40
EmilienManteaya: you need to do the same for https://github.com/openstack-infra/puppet-openstack_infra_spec_helper20:41
phschwartzjeblair: let me get that paste for you20:41
EmilienMnibalizer: fyi ^20:41
anteayaAJaeger: thank you20:41
anteayaEmilienM: thank you20:41
ianwpabelanger: same issue20:41
EmilienMjeblair: an FYI about https://review.openstack.org/#/c/356675/ -- I would like you to look when you have time, in the commit message you'll find out a nice feature we could have in zuul later20:42
nibalizerEmilienM: whats up?20:42
EmilienMnibalizer: all puppet syntax jobs fail, https://tickets.puppetlabs.com/browse/MODULES-377620:42
anteayanibalizer: we need https://review.openstack.org/#/c/359385/1 in infra spec helper20:43
EmilienMnibalizer: we're solving the problem with https://review.openstack.org/#/c/359385/ for now, you'll need to do the same for infra modules20:43
EmilienMnibalizer: not sure all infra modules use infra spec helper though20:43
ianwalso, why so much ... [WARNING]: log file at /var/log/ansible.log is not writeable and we cannot create it, aborting20:43
jeblairEmilienM: that will be easier in zuulv3 :)20:43
pabelangerianw: log?20:43
pabelangerianw: file permissions issues20:43
ianwpabelanger: yeah ... but seems like we should not be logging to that?20:44
phschwartzjeblair: http://paste.openstack.org/show/562552/ http://paste.openstack.org/show/562553/20:44
phschwartzjeblair: large chunk of both debug.log and zuul.log20:44
EmilienMjeblair: great, please comment then, I'm curious how :)20:44
*** ansmith has quit IRC20:45
ianwpabelanger: same as in http://paste.openstack.org/show/562549/ ...error while accessing the file /etc/puppet/hieradata/production/common.yaml20:45
nibalizerEmilienM: oh hahhahah rip20:45
jeblairphschwartz: thanks -- can you grab a chunk of lines before the first instance of those errors?20:45
*** raildo has quit IRC20:46
nibalizerjesusaur: do we still need https://review.openstack.org/#/c/350835/ ?20:46
pabelangerianw: right, we need to land https://review.openstack.org/#/c/32664920:47
*** priteau has joined #openstack-infra20:47
pabelangerianw: that fixes the first problem20:48
phschwartzjeblair: let me dig, there are thousands of those errors. 10-15 going in a min20:48
pabelangerianw: the patch I linked to you, you might not need20:48
jesusaurnibalizer: I'm not sure why that affected gozer and not infra, but it doesn't seem like there have been any issues with that upstream20:48
jeblairEmilienM: overrides of job attributes via project-local job-specification -- http://specs.openstack.org/openstack-infra/infra-specs/specs/zuulv3.html#jobs20:48
EmilienMjeblair: that's awesome :)20:48
openstackgerritSpencer Krum proposed openstack-infra/puppet-openstack_infra_spec_helper: Pin puppetlabs-spec-helper  https://review.openstack.org/35942120:49
*** kzaitsev_mb has joined #openstack-infra20:49
nibalizerjesusaur: EmilienM   https://review.openstack.org/35942120:49
EmilienMnibalizer: +120:49
ianwpabelanger: ok, cool, let's start with that then ...20:49
jesusaurnibalizer: also today there was a rubocop upgrade that now requires ruby ~> 2.020:50
*** markvoelker has quit IRC20:52
phschwartzjeblair: It will be a couple more min. There are 400k lines of that error in the log20:52
jeblairphschwartz: you will probably need to restart zuul to fix this20:53
*** ilyashakhat has quit IRC20:53
openstackgerritMerged openstack-infra/shade: Add support for fetching console logs from servers  https://review.openstack.org/35823220:54
*** eggshell has joined #openstack-infra20:54
*** eggshell has quit IRC20:54
*** eggshell has joined #openstack-infra20:55
openstackgerritEmilien Macchi proposed openstack-infra/project-config: TripleO scenario001 experimental job  https://review.openstack.org/35667520:55
fungithanks EmilienM!20:55
EmilienMfungi: well, all credits go to mwhahaha (Alex)20:55
fungithanks, mwhahaha!20:56
anteayayeah should we credit mwhahaha on that patch commit message, nibalizer20:56
fungiand nibalizer!20:56
ianwpabelanger: so we set log_path in /etc/ansible/ansible.cfg -- perhaps we shouldn't and just let syslog handle it -> http://docs.ansible.com/ansible/intro_configuration.html#log-path20:56
ianwor open permissions on /var/log/ansible.log, that seems bad20:57
nibalizeranteaya: ok20:57
anteayanibalizer: thank you20:57
*** gouthamr has quit IRC20:58
pabelangerianw: or have launch-node.py handle it.20:58
*** jkilpatr has quit IRC20:58
mordredclarkb: how's your upper-constraints/devstack zen?20:58
openstackgerritSpencer Krum proposed openstack-infra/puppet-openstack_infra_spec_helper: Pin puppetlabs-spec-helper  https://review.openstack.org/35942120:58
anteayacan constraints and zen be used in the same sentence?20:58
pabelangerianw: we can add it to JobDir(), thats how we do it with zuul-launcher20:58
*** xarses_ is now known as xarses20:59
*** ccarmack has joined #openstack-infra21:00
*** jswarren has joined #openstack-infra21:00
dougwigfungi: just reset again, but missed grabbing the link. and given the overall runtime of most of the check queue, i think there's something weird going on.21:01
*** amotoki has joined #openstack-infra21:01
jeblairdougwig: i'm not following21:01
*** rfolco has quit IRC21:01
*** tonytan_brb has quit IRC21:01
pabelangerdougwig: it is likely ansible losing connection with the node the job runs21:02
*** asettle has joined #openstack-infra21:02
rm_workpabelanger: well it makes sense that ansible would lose connection, because the nodes are becoming totally unconnectable21:02
pabelangerthere is some logic in zuul-launcher to requeue the job if network is unavailable21:02
rm_workthat's the issue we're seeing21:02
dougwigjeblair: dsvm nodes seem to be resetting intermittently, only seen so far on nodes with ipv6 addresses. instead of a runtime of about an hour, we're at nearly 3 and counting.  and this one review isn't alone.21:02
dougwigjeblair: i've watched one job almost "finish" and reset three times now.21:02
rm_workbasically the nodes become completely unresponsive, thus causing a reset21:02
rm_workit's happening on more than just our change, I think21:03
ianwpabelanger: that makes sense, i'll do that21:03
jeblairdougwig: what jobs are involved?21:03
*** jcoufal has quit IRC21:03
dougwigjeblair: i have to step away, but rm_work can give details.  back in 30.21:04
rm_workI'm looking for other examples21:04
rm_workso far i've seen it happen specifically on the octavia gate with:21:04
openstackgerritMonty Taylor proposed openstack-infra/nodepool: Directly use pip instead of setup_develop in plugin  https://review.openstack.org/35942521:05
mordredclarkb, jeblair, pabelanger: ^^ sigh - getting caught by devstack's defaulting to using upper-constraints in our shade-nodepool job21:05
anteayarm_work: is it just affecting octavia patches?21:05
rm_workbut I think it's probably doing it in other projects/CRs as well21:05
rm_workbecause I have seen others have jobs reset21:06
phschwartzjeblair: Here is one around the first error http://paste.openstack.org/show/562556/21:06
rm_workand look at the queue length at the moment...21:06
*** amotoki has quit IRC21:06
phschwartzjeblair: working on second21:06
mordredShrews: you may also want to look at that, although you may not21:06
anteayarm_work: can you recall which projects?21:06
rm_worki'm looking for other examples at the moment21:06
anteayarm_work: thanks21:06
jeblairrm_work: the queue length is being reduced at the rate i would expect21:06
pabelangerjeblair: I am seeing a lot of exit code 3 on zl01: http://paste.openstack.org/show/562557/21:06
anteayarm_work: having the check queue long the week prior to feature freeze is expected21:06
rm_workthere's a good number of CRs with 6+h runtimes, where some jobs are JUST starting21:07
rm_workwhich to me indicates they were reset21:07
jeblairrm_work: that is likely yes21:07
rm_worksimilar to what we were seeing happen to us21:07
rm_workthat's what i mean by "look at the length of the queue"21:07
rm_workbut give me a moment, i'm manually searching for cases21:07
openstackgerritMonty Taylor proposed openstack-infra/shade: Support dual-stack neutron networks  https://review.openstack.org/35751721:07
* mordred cries21:08
openstackgerritJohn L. Villalovos proposed openstack-dev/hacking: Add documentation about off-by-default options  https://review.openstack.org/35942721:08
*** psilvad has quit IRC21:08
rm_workI think this is one: telnet 2001:4800:1ae1:18:f816:3eff:fe64:b2bf 1988521:08
rm_workthat's from a tempest CR21:08
jeblairrm_work: okay, i understand what you mean now and agree -- it's just that 'length of the queue' did not convey that to me, as the length is nominally what i would expect and decreasing at a reasonable rate.  but we can move on.  :)21:09
rm_workright yeah, realized it was ambiguous21:09
clarkbmordred: and use pip -e just to maintain compat with the old develop stuff?21:10
jeblairrm_work, pabelanger: so there's likely something in those jobs that borks the (ipv6?) network.  it might help to get a full list of the affected jobs and triangulate that way.21:10
openstackgerritMichael Krotscheck proposed openstack-infra/project-config: Removed directory changes in npm-dsvm-macro  https://review.openstack.org/35942821:10
jeblairpabelanger: your grep for exit codes and x-referencing by node id to find what job ran might help.21:10
pabelangerjeblair: yes, I am going to try and find if there is a pattern21:10
mordredclarkb: yah21:10
rm_workjeblair: those jobs have passed in previous and later runs21:10
rm_workjeblair: and some of them wouldn't even be touching the network21:10
*** _ari_ has quit IRC21:10
jeblairrm_work: well, every dsvm job touches the netwerk, right?21:10
clarkbjeblair: pabelanger are they running on trusty or xenial? we may not have the ipv6 private stuff in trusty yet21:11
jeblairclarkb: http://paste.openstack.org/show/562557/ says both21:11
rm_worki guess technically, but minimally and in a way that's been pretty well tested?21:11
rm_workso let me see21:11
phschwartzjeblair: I think nibalizer tracked it down. Looks like we have reviews with a depends-on another review that is not in zuul so zuul is bombing on it21:11
rm_workif there are any non-dsvm21:11
jeblairphschwartz: that case should be covered21:11
rm_workyeah ok, it is quite possible it's DSVM jobs only21:12
phschwartzjeblair: hmm, it seems to get that error in the one I just sent every .002 seconds which causes cpu and mem usage to spike21:12
mordredclarkb: I mean - we'll see if it works21:12
*** julim has quit IRC21:12
rm_workso maybe you are correct that it's something with the dsvm process hosing the network, but I don't think it has anything to do with the specific code in the CRs21:12
rm_workjeblair: ^^21:12
*** michauds has joined #openstack-infra21:13
mordredclarkb: this is what failure looks like: http://logs.openstack.org/17/357517/5/check/gate-dsvm-nodepool-src-shade/dbbbdfc/logs/screen-nodepool.txt.gz21:13
mordredclarkb: error of not having new enough os-client-config, even though the requirements.txt file has it21:13
mordredSO - if we don't get that error, then the nodepool change works :)21:13
clarkbmordred: ya I think that your change should fix that IF the plugin is evaluated after everything else installing occ21:14
clarkbmordred: I do not know if that is the case but +2 for now and the self testing should find out :)21:14
*** bhunter71 has joined #openstack-infra21:15
jeblairphschwartz: can you provide, say, a few hundred more lines of logs before that point?21:15
*** ldnunes has quit IRC21:15
*** kaisers_ has joined #openstack-infra21:15
*** david-lyle has quit IRC21:15
*** spzala has quit IRC21:16
*** dizquierdo has joined #openstack-infra21:16
*** esikachev has joined #openstack-infra21:17
*** spzala has joined #openstack-infra21:17
*** ddieterly is now known as ddieterly[away]21:17
*** ddieterly[away] is now known as ddieterly21:17
openstackgerritJulia Kreger proposed openstack-infra/glean: Add logging around interface carrier detection  https://review.openstack.org/35943021:18
*** david-lyle has joined #openstack-infra21:18
openstackgerritVasyl Saienko proposed openstack-infra/project-config: Switch ironic-multinode job to wholedisk agent_ssh  https://review.openstack.org/35943121:19
clarkbits also worth noting that I saw similar behavoir with ansible from puppetmaster to other dfw hosts21:20
rm_workjeblair / fungi: it SEEMS like it's always the same cloud(s) that fails on the dsvm jobs and requeues21:20
*** kaisers_ has quit IRC21:20
rm_workit's one of the one(s?) that is ipv6 only21:20
clarkbpabelanger: ^ that is what prompted me to switch how the ansible launchers restart playbook worked21:20
clarkband there is ipv6 in rax too21:20
*** dizquierdo has quit IRC21:21
clarkbpabelanger: and you reported it was rock solid from home which was ipv4 only at the time (I am guessing based on your talk of new HE tunnel)21:21
*** dimtruck is now known as zz_dimtruck21:21
clarkbrm_work: they are all clouds that ipv621:21
clarkboh wait no there is a bluebox in there which is ipv4 only21:21
rm_workclarkb: I mean *only* ipv621:21
*** spzala has quit IRC21:21
*** esikachev has quit IRC21:21
rm_workbecause if it supports both, you display the ipv4 link for telnet, right?21:21
pabelangerclarkb: Ah, right21:22
clarkbrm_work: yes I understood you but there is more than osic with the error and osic is our only ipv6 only cloud21:22
rm_workah i've only seen it happen with the ipv6 telnet'd jobs21:22
clarkbrm_work: see jeblair's paste above21:22
rm_workyou have a better way of identifying the affected nodes?21:22
jeblair(it's pabelanger's paste ftr :)21:22
pabelangerI should have a sample of jobs from zl01 in a few minutes21:22
rm_workwasn't sure if the exit-code-3 thing was related21:23
pabelangerA lot of dsvm job21:23
*** hashar has quit IRC21:23
*** zz_dimtruck is now known as dimtruck21:23
rm_workcool, so you're probably close (at least you can identify the nodes with issues)21:23
jeblairit's identifying all network errors though -- there could be multiple underlying causes21:23
*** matt-borland has quit IRC21:23
clarkbjeblair: good point21:23
rm_worki'll leave it to you guys i guess then, you seem to have it covered21:23
rm_workbut i'll be around for a while21:24
jeblairrm_work: i won't be fixing this :)21:24
*** pradk has quit IRC21:24
rm_workwhere do you think the problem is?21:24
jeblairrm_work: the assumption i'm working from is that a class of devstack jobs borks the network on nodes that only have ipv621:24
openstackgerritMerged openstack-infra/puppet-openstack_infra_spec_helper: Pin puppetlabs-spec-helper  https://review.openstack.org/35942121:25
*** ccarmack has left #openstack-infra21:25
*** cody-somerville has joined #openstack-infra21:25
rm_workyeah, seems right to me21:25
clarkbmight be worth filtering out the jobs affected21:25
clarkbeg is it always with neutron and never with nova network etc21:25
rm_workthat paste does indicate it's PRIMARILY osic nodes, the non-osic nodes showing up may be anomalies unrelated21:25
rm_workI saw one with *tempest*21:25
jeblairi think the next step is to collect the list of those jobs (pabelanger is doing this) so we can look for a pattern and maybe take a guess as to what's going wrong21:25
clarkbyup ++21:26
*** csomerville has joined #openstack-infra21:26
*** esikachev has joined #openstack-infra21:26
rm_workah, yeah21:26
pabelangerlist of failures for today from zl0121:26
rm_workall i can easily do is manually look around the zuul status page, not so helpful :P21:26
rm_workah cool21:26
*** dprince has quit IRC21:26
ianwdo we know about this erorr -> Bundler::GitError: The git source https://git.openstack.org/openstack-infra/puppet-openstack_infra_spec_helper is not yet checked out. Please run `bundle install` before trying to start your application21:26
anteayaianw: yes21:27
anteayathe fix is in the gate21:27
ianwanteaya: cool, thanks21:27
anteayasorry merged: https://review.openstack.org/#/c/359421/21:27
*** eharney has quit IRC21:27
jeblairphschwartz: can you ack or nak my request?21:27
clarkbjust a quick scan of that list looks like its predominantly tests that use neutron21:27
pabelangerclarkb: jeblair: need to step away for family time, but will poke more at it later tonight21:28
ianwahh, ok just missed it.  will recheck21:28
clarkbneutron itself, ironic, nodepool, etc all use neutron21:28
phschwartzjeblair: sorry was restarting zuul. Will grab from the log21:28
rm_workclarkb: i was seeing that too but i think it's just an issue of percentage of all tests that use neutron skews things :P21:28
jeblairphschwartz: thx21:28
rm_workah, yeah i mean almost anything in openstack uses neutron in some capacity <_<21:28
clarkbrm_work: I think we still have a higher percentage that use nova net fwiw21:29
clarkbrm_work: so I don't know that that is the case21:29
rm_workthat's surprising21:29
clarkbrm_work: yes because it is/was the default21:29
clarkband is/was more reliable21:29
rm_workaren't they actually finally *deleting that code* in the next cycle or something? lol21:29
*** cody-somerville has quit IRC21:29
rm_workguess maybe not >_>21:29
*** Swami has quit IRC21:29
clarkbrm_work: they only just got things switched in the last couple weeks21:30
anteayaianw: sounds good21:30
clarkbthis very problem could be related :P I am trying to see if any of those in pabelanger's list are nova net jobs to rule it out21:30
zigopabelanger: https://review.openstack.org/#/c/358819/ <--- Could you +2 adding deb-python-fixtures again please?21:30
*** hockeynut has quit IRC21:31
rm_workagain, i should just let you guys handle this probably :P i'm just distracting21:31
clarkboh we have kolla jobs in ther ewhich don't devstack21:31
jeblairmordred: http://logs.openstack.org/25/359425/1/check/gate-dsvm-nodepool/4896a4b/logs/devstacklog.txt.gz#_2016-08-23_21_29_04_118  needs a sudo?21:31
mordredjeblair: yah. thanks21:32
clarkbso probably not related to that unless they do similar things21:32
*** salv-orlando has joined #openstack-infra21:32
michaudsIs this the proper channel to report an issue with gerrit?21:32
clarkbmichauds: yes21:32
openstackgerritMonty Taylor proposed openstack-infra/nodepool: Directly use pip instead of setup_develop in plugin  https://review.openstack.org/35942521:32
phschwartzjeblair: http://paste.openstack.org/show/562758/21:33
clarkbjeblair: pabelanger and this error means the ssh poll for async ansible failed to connect right?21:33
jeblairclarkb: yes it's "AnsibleHostUnreachable"21:34
michaudsclarkb: I can't seem to POST data to https://review.openstack.org/gerrit_ui/rpc/AccountSecurity to update my offline contact information.21:34
*** gouthamr has joined #openstack-infra21:34
*** piet has joined #openstack-infra21:35
clarkbmichauds: make sure that you have set up your accounts properly. Did you follow http://docs.openstack.org/infra/manual/developers.html#account-setup to set that up?21:35
*** jheroux has quit IRC21:35
rm_workclarkb: the kolla jobs I see are "dsvm" jobs though?21:36
jeblairphschwartz: it looks like there may be lines missing between those 2 pastes?21:36
clarkbrm_work: I don't think they run devstack, they just use dsvm for supporting setup21:36
jeblairphschwartz: maybe there's a limit for how many lines in a paste?21:36
jeblairphschwartz: or overall size i guess21:37
phschwartzjeblair: not sure I did 1k lines starting from the last line of the first AttributeError21:37
clarkbspot checking a rax xenial host we have the correct ipv6 tempaddr value of 021:37
michaudsclarkb: yes, I have a Launchpad account and am also part of Openstack Foundation :)21:38
michaudsclarkb: Oh I need to upgrade to Foundation Member21:40
*** vhosakot has quit IRC21:40
*** vhosakot has joined #openstack-infra21:41
dougwigrm_work, jeblair, fungi: back.21:41
openstackgerritVasyl Saienko proposed openstack-infra/project-config: Switch ironic-multinode job to wholedisk agent_ssh  https://review.openstack.org/35943121:42
*** Goneri has quit IRC21:43
*** thorst has quit IRC21:44
*** sdake has quit IRC21:45
*** asettle has quit IRC21:45
*** vhosakot has quit IRC21:45
*** sdake has joined #openstack-infra21:45
*** sdake has quit IRC21:45
*** sdake has joined #openstack-infra21:46
*** thorst has joined #openstack-infra21:47
*** thiagop has quit IRC21:47
*** eranrom has quit IRC21:47
*** eranrom has joined #openstack-infra21:48
*** larainema has quit IRC21:49
clarkbok sitting with greghaynes and we seem to lose the most time doing the mv of the image from chroot to the bind mounted image dest21:49
*** piet has quit IRC21:49
clarkbhe thinks what is happening is the apt cache never gets cleaned up so we are copying more and more and more files21:50
clarkbalso possibly we need to pack/gc our git repo cache21:50
clarkbso the dib caching need curating21:50
*** dfflanders has joined #openstack-infra21:50
*** vhosakot has joined #openstack-infra21:51
clarkbwith that understood back to ansible network fails21:51
zigopabelanger: I'm having a very hard time to build webkit2gtk, which is needed to build sphinx. It seems it takes a huge amount of time to build. I wonder if we could just "cheat" here, and just download it from official jessie-backports. There shouldn't be many packages like that, just this one, hopefully. Your thoughts?21:51
*** thorst has quit IRC21:51
*** esikachev has quit IRC21:53
*** spzala has joined #openstack-infra21:53
*** csomerville has quit IRC21:54
*** ddieterly is now known as ddieterly[away]21:57
*** mariojv has joined #openstack-infra21:59
*** eggshell has quit IRC22:00
*** piet has joined #openstack-infra22:01
*** tonytan4ever has joined #openstack-infra22:02
*** amotoki has joined #openstack-infra22:02
*** xarses has quit IRC22:03
*** mariojv has left #openstack-infra22:04
*** priteau has quit IRC22:05
*** Gorian|work has joined #openstack-infra22:06
*** tonytan4ever has quit IRC22:07
*** amotoki has quit IRC22:07
*** vhosakot has quit IRC22:09
*** ddieterly[away] is now known as ddieterly22:09
*** notmorgan is now known as morganfainberg22:09
*** morganfainberg is now known as morgan22:09
*** morgan is now known as notmorgan22:11
*** nwkarsten has joined #openstack-infra22:12
*** nwkarsten has quit IRC22:12
*** nwkarsten has joined #openstack-infra22:12
dougwigproject-config and devstack-gate cores, lbaas v1 delete is just about ready to merge, but will break these two things (nodepool default service list, and devstack-gate default features): https://review.openstack.org/#/c/358257/ https://review.openstack.org/#/c/358258/22:13
dougwigboth still reference q-lbaas, even though all CI jobs now use the neutron-lbaas devstack plugin22:13
jeblairphschwartz: your paste has 479 lines, which is why i think you hit the paste limit.  can you re-paste as multiple chunks22:14
*** piet has quit IRC22:14
*** baoli has quit IRC22:14
clarkbdougwig: you will need to update devstack-gate first22:14
clarkbdougwig: otherwise you will break everyone22:14
dougwigclarkb: aye, that's what i'm trying to do.  https://review.openstack.org/#/c/358258/22:15
*** andymaier_ has quit IRC22:15
*** nwkarste_ has quit IRC22:15
clarkbpabelanger: rm_work: in this setup ansible sshs over and over again to poll the build status. It does seem odd that it would fail to ssh after it has succeeded many times unless something has changed on the test node22:16
*** nwkarsten has quit IRC22:17
rm_workyeah, the whole thing seems odd to me22:18
clarkbjeblair: pabelanger and we are not using paramiko correct? zuul launchers are creating teh ssh subprocess?22:20
rm_workI have to go for a bit...22:20
rm_worki'll be back around later though :/22:21
jeblairclarkb: zuul -> ansible -> openssh22:21
jeblairclarkb: also, worth noting that ansible uses controlmaster, so there is a persistent connection22:21
jeblair(which apparently dies)22:21
clarkboh its not making successive connections to poll? /me looks up controlmaster22:21
mordredmaybe we need to add a thing to .ssh/config to send keepalives?22:22
*** AnarchyAo has joined #openstack-infra22:22
*** fguillot has joined #openstack-infra22:23
clarkbwhat keeps the master running? looks like it will only reuse if it already exists?22:23
jeblairmordred: no -- there is activity every few seconds thanks to the ansible polling22:23
jeblairclarkb: ansible starts it for each host it connects to22:23
clarkbah and the 60s ControlPersist says wait 60s before you die if you were the master22:24
jeblairclarkb: if you want to see them, the processes on the launcher look like: zuul     31088  0.2  0.0  44588  1608 ?        Ss   22:23   0:00 ssh: /home/zuul/.ansible/cp/ansible-ssh-2001:4800:1ae1:18:f816:3eff:feab:c8ca-22-jenkins [mux]22:24
openstackgerritMerged openstack-infra/project-config: Remove q-lbaas from the nodepool pre-configured list  https://review.openstack.org/35825722:24
mgagne_clarkb: how long does it take for a new CI mirror to be built from scratch?22:26
clarkbthat sort of error implies that the master was not there for some reason though because a new connection failed to connect. /me begins to understand the setup22:26
*** sdague has joined #openstack-infra22:26
clarkbmgagne_: its fast since its mostly just an http server in front of an afs cache22:26
jeblairmgagne_: maybe 30 mins?22:26
mgagne_clarkb: right so afs just needs to heat up its cache to be effective?22:27
jeblairmgagne_: my union has a 4 hour minimum though ;)22:27
clarkbmgagne_: yup,22:27
jeblairmgagne_: yeah, after a couple of slow-ish jobs, it'll be warm.  someone suggested pip installing the upper-constraints file to make that happen out of band; dunno if that's been tried22:28
clarkbmgagne_: which can be done if you pip download only the global reqs and equivelent type things for ubuntu/debian/centos mirrors22:28
*** dfflanders has quit IRC22:28
clarkbjeblair: ya not sure if anyone has gone through the trouble since it hasn't seemed to be necessary22:28
*** xarses has joined #openstack-infra22:29
mgagne_good to know22:29
*** esberglu has quit IRC22:29
mgagne_clarkb, jeblair: we would be ready to offer new resources for CI infra which would be hosted in a different region than the current one22:30
*** dimtruck is now known as zz_dimtruck22:30
*** sdague has quit IRC22:31
jeblairmgagne_: cool -- do you want us to decrease/stop nyj01 or use both?22:31
mgagne_jeblair: I suggest we use both for now until someone asks us to stop using nyj01 for "reasons"22:31
*** ddieterly is now known as ddieterly[away]22:31
jeblairmgagne_: sure thing -- is the new one ready now?22:32
mgagne_jeblair: yes, ready like 5m ago22:32
mgagne_jeblair: mtl01 is the region name22:32
mgagne_jeblair: for "management" account, there is a public /28 available22:33
clarkbjeblair: is it possible we are hitting the 108 byte socket path limit? my quick check for a max length ipv6 addr representation is 85 bytes though22:33
clarkbbased on the control path in use22:33
jeblairclarkb: it's dying in the middle of the job i believe22:33
mgagne_jeblair: for nodepool, quota will be over 120 instances at least, maybe more22:34
*** michauds has quit IRC22:36
*** thorst_ has joined #openstack-infra22:36
*** xyang1 has quit IRC22:36
jeblairmgagne_: cool, thanks! i'm about to eod -- maybe another infra-root can start the work to add it22:36
*** ddieterly[away] has quit IRC22:36
mgagne_jeblair: I will be off to the ops meetup too since might not be responsive this week. This is just a heads up about what's coming for our contribution.22:37
*** thorst_ has quit IRC22:39
*** ccamacho has quit IRC22:42
anteayamorning jhesketh22:43
*** sdake has quit IRC22:44
*** sdake has joined #openstack-infra22:44
clarkbjeblair: looking at logs on zl01 it is dying after copying the main sh file it looks like I assume the next zuul_runner there is attempting to run the script22:44
clarkbso yes I agree its dying in the middle of the job22:44
*** yamahata has quit IRC22:47
*** spzala has quit IRC22:48
*** esikachev has joined #openstack-infra22:49
*** yamamoto has joined #openstack-infra22:51
*** Hal has quit IRC22:52
openstackgerritChris Krelle proposed openstack-infra/glean: Adjust wait time for interfaces to become available  https://review.openstack.org/35947122:52
*** esikachev has quit IRC22:54
*** rbrndt has quit IRC22:55
clarkbpabelanger: jeblair I have held 3803271 which is an osic job running a neutron tempest test. Not sure how frequent these "fails" are but hopeflly will catch one if we open the nets22:55
jeblairclarkb: you going to go ahead and open an ssh connection?22:56
*** fguillot is now known as fguillot_afk22:57
clarkbjeblair: ya22:58
*** fguillot_afk is now known as fguillot22:58
clarkbalso tailing zl01's log to see if I can hold one fast enough if I see it22:58
jeblairclarkb: probably not :(  but it might be worth setting an auto-hold for gate-tempest-dsvm-neutron-full-ubuntu-xenial ?22:59
jeblairclarkb: (okay, i guess if you're faster than the publisher playbook, you might be able to make it :)23:00
openstackgerritMerged openstack-infra/nodepool: Include ip address for ssh_connect exception  https://review.openstack.org/35936923:00
clarkbya we'll just hae to see might not be possible23:00
*** spzala has joined #openstack-infra23:00
clarkbjust held 380238623:01
clarkbit failed to ssh23:01
clarkband 380247123:01
clarkband 3802440 lets see if nodepool doesnt' delete any of those23:02
clarkbnow one thing I am noticing is they all come in clumps23:02
clarkbwhich maybe implies its not a specific job side thing breaking us23:02
*** thorst_ has joined #openstack-infra23:03
*** chlong has quit IRC23:03
*** tonytan4ever has joined #openstack-infra23:03
*** larainema has joined #openstack-infra23:04
clarkboh cool one of the fails was on the host I had held earlier. Unfortunately my ssh connection to it is gone gone gone23:04
clarkbcloudnull: you don't happen to be around do you?23:04
* clarkb checks console logs23:04
*** spzala has quit IRC23:05
clarkbconsole log shows nothing23:05
*** thorst_ has quit IRC23:05
*** thorst_ has joined #openstack-infra23:06
clarkband we aren't copying job logs because ssh doesn't work right?23:07
clarkbthis is mysterious and fun23:07
* clarkb attempts to get in via the mirror on the 10 net23:08
*** tonytan4ever has quit IRC23:08
*** vhosakot has joined #openstack-infra23:08
*** nwkarsten has joined #openstack-infra23:08
ianwjeblair: in http://git.openstack.org/cgit/openstack-infra/zuul/tree/zuul/launcher/ansiblelaunchserver.py#n113 ; the ansible.cfg in JobDir is found because because that's the pwd for the ansible-playbook call?  ie. no ANSIBLE_CONFIG setting23:09
*** yamahata has joined #openstack-infra23:09
clarkbI'm in \o/23:09
clarkbso that does work, the ipv4 stack is stilltehre23:10
ianwjeblair: ahh, doh, yeah, cwd= in the call, now i see it23:10
*** ddieterly has joined #openstack-infra23:10
*** thorst_ has quit IRC23:10
*** ddieterly is now known as ddieterly[away]23:11
*** rhallisey has quit IRC23:11
clarkbhrm doesn't look like 3803271 has been detected as a fail by ansible yet23:12
clarkblet me just double check that the one I caught on zl03 exhibits the same behavior23:12
*** Hal has joined #openstack-infra23:12
*** ddieterly[away] is now known as ddieterly23:13
*** nwkarsten has quit IRC23:13
clarkbI has more datas I can hit the ipv6 addr from the mirror host23:14
clarkbbut not from rackspace23:14
clarkb(I am tunneling through my irc box for ipv6)23:15
mordredclarkb: oh - that's weird23:18
clarkbpabelanger: jeblair cloudnull ubuntu-trusty-osic-cloud1-3802440 with uuid d44117b0-4835-4910-990a-fa1157d54dc8 and IPs 2001:4800:1ae1:18:f816:3eff:fe45:ab1b, is the one I have "captured"23:18
clarkbmordred: ^ you too23:18
clarkbssh from the mirror in osic cloud1 can hit both those IPs23:19
clarkbI cannot hit the IPv6 from rackspace and the ipv4 is local only23:19
clarkbthis makes me think its maybe not a problem with the instance itself23:19
clarkband instead may be something in the cloud?23:19
mordredclarkb: the 'public' ipv4?23:19
mordredclarkb: or the 'private' ipv423:20
clarkbmordred: the test instances only have the private ipv423:20
mordredah. nod23:20
clarkbso I did ssh from home to mirror host via public v4, then ssh to test instance ipv6 and private ipv4 and both work23:21
pabelangerjust catching up on backscroll23:21
clarkbI double checked that sshing from my rackspace irc screen box to the test instance ipv6 fails23:21
mordredclarkb: I can verify that I can get in from the mirror host23:21
mordrednot that I was doubting you23:21
clarkbmordred: can you double check you can't hit it from your "local" ipv6?23:22
mordredclarkb:  I do not have "local" ipv623:22
clarkbits possible this is a rax to osic ipv6 issue23:22
pabelangermgagne_: jeblair: are we using the same credentials for mtl01?23:22
clarkbsince my irc box and the zuul launcher are all in rax23:22
clarkbpabelanger: maybe you can test ^23:22
mordredoh - my irc box is in vexxhost ...23:22
mordredone sec23:22
clarkbmordred: kk thanks23:22
*** Julien-zte has joined #openstack-infra23:22
clarkbif it is broken for vexxhost too then I think its likely in osic proper23:23
pabelangerclarkb: not yet, still don't have broker setup properly23:23
mordredclarkb: I cannot get there from vexxhost eithre23:23
mordredcloudnull: ^^23:23
pleia2I have native v6 here, can't get to it23:23
clarkbI think the next thing would be for cloudnull to examine networking in osic for us23:24
nibalizerjesusaur: so looks like this failed around rubocop ? https://review.openstack.org/#/c/350835/23:24
mordredclarkb: so - I was afk for a sec ... other than "thjis ipv6 doesn't work" ... what's the problem we're trying to sort ... is this the "ssh to nodes breaks" problem?23:25
*** Gorian|work has quit IRC23:25
mordredclarkb: so.... there are two ipv6 addresses on eth023:25
mordred    inet6 2001:4800:1ae1:18:f816:3eff:fe45:ab1b/64 scope global dynamic23:25
mordred       valid_lft 2589662sec preferred_lft 602462sec23:25
mordred    inet6 fe80::f816:3eff:fe45:ab1b/64 scope link23:25
mordred       valid_lft forever preferred_lft forever23:25
mordredis that 'normal' ?23:25
mordredyah. vexxhost does that. nm23:26
clarkbya the scope link is your I forget the name for it address23:26
pleia2fe80 is local23:26
openstackgerritMerged openstack-infra/tripleo-ci: Remve old cache files on the mirror server  https://review.openstack.org/34880623:26
*** hongbin has quit IRC23:26
clarkband the other one is the global addresseable one23:26
* pleia2 nods23:26
clarkband your routes are smart enough to use the proper src addr when doing things23:26
mordredyah. soo ... route -6n looks a little odd to me - but I'm still not fully used to looking at v6 route tables23:27
mordredI guess my question is "is something about the v6 address on br-ex messing up v6 routing"23:27
clarkb::/0                           ::                         !n   -1  1  9265 lo23:27
clarkbthat actually might be what is braeking us?23:27
jesusaurnibalizer: oh rubocop-rspec... that's not the same but likely very related23:28
mordredthat was my next question23:28
clarkbI think that says default route is out lo23:28
clarkb2001:4800:1ae1:18::/64         ::                         UAe  256 1     0 eth0 is what makes it work in the cloud from the mirror host23:28
mordredclarkb: because we can ssh to these when nodepool creates them23:28
mordredit's just after stuff runs on them that something goes south23:28
anteayajesusaur: rubocop-rspec uses rubocop23:28
pabelangerclarkb: what if you remove it?23:28
clarkbpabelanger: I think we have to update it to say eth023:28
clarkbI want to look at a host that works really quick23:29
openstackgerritJohn L. Villalovos proposed openstack-infra/yaml2ical: Update hacking test-requirement  https://review.openstack.org/35947823:29
clarkb::/0                           fe80::def                  UGDAe 1024 4    42 ens323:29
*** yuanying has joined #openstack-infra23:30
clarkbwe have ^ on a xenial host where things are working23:30
clarkblet me compare trusty to trusty23:30
*** fguillot has quit IRC23:30
clarkb::/0                           fe80::def                  UGDAe 1024 2     0 eth023:30
clarkbthats trusty23:30
mordredGAH STAB STAB23:31
mordredthe devstack plugin stuff totally didn't work23:31
mordredbecause it installs python-openstackclient after running the plugin23:31
clarkbmordred: I was worried about that23:31
jesusaurnibalizer: so I guess that change needs to pin rubocop-rspec too?23:32
clarkbwe are using RAs here?23:33
pabelangerclarkb: ya, looks like you are on the right track23:33
*** spzala has joined #openstack-infra23:34
mordredclarkb: https://review.openstack.org/35947923:34
clarkbAug 23 23:28:56 ubuntu dhclient: Error printing text. is a fun one in syslog23:34
anteayajesusaur: https://review.openstack.org/#/c/359421/23:35
clarkbpabelanger: mordred pleia2 so I think either the cloud is sending us a new bad RA or maybe neutron is?23:35
jesusaurnibalizer: looks like rubocop 1.5.0 is the last version to support ruby 1.923:36
*** salv-orlando has quit IRC23:36
clarkblike we aren't isolating neutron's RAs for client networks from our actual interfaces maybe23:36
clarkbbut I am still sort of swimming through logs trying to figure out when that route is updated23:36
jesusauranteaya: nibalizer: oh i was confused about what change i was looking at23:37
anteayajesusaur: so the fix merged an hour after your change failed23:37
anteayaso maybe recheck?23:37
*** fguillot has joined #openstack-infra23:37
mordredclarkb: if that doesn't work, I'm going to see about making the nodepool plugin install into a virtualenv23:37
mordredin fact, I may just do that23:38
openstackgerritJohn L. Villalovos proposed openstack-infra/yaml2ical: Manual sync to global-requirements  https://review.openstack.org/35948023:38
jesusauranteaya: yeah23:38
clarkbpeople that know ipv6 better than I do, does linux log RA updates anywhere?23:38
*** spzala has quit IRC23:39
*** hexlibris has joined #openstack-infra23:40
clarkblike dhcp keeps everything in /var/lib/dhcp23:40
openstackgerritSagi Shnaidman proposed openstack-infra/tripleo-ci: Use cached images  https://review.openstack.org/35948123:40
pleia2hm, I've only ever seen updates in syslog, but I'm not sure if that was some kind of debug level23:41
clarkbneutron does have radvd processes running but reading their configs they are scoped to a neutron addr which should be "detached" from eth023:43
*** paulobanon has quit IRC23:43
clarkbsupposedly net.ipv6.conf.eth0.accept_ra = 1 means accept RAs if forwarding is disabled and we also have net.ipv6.conf.eth0.forwarding = 123:45
openstackgerritMonty Taylor proposed openstack-infra/nodepool: Install nodepool and shade into a virtualenv  https://review.openstack.org/35942523:45
mordredclarkb: are we only seeing the detach failures on neutron devstacks?23:46
clarkbon a host that works we have net.ipv6.conf.eth0.forwarding = 023:46
openstackgerritDmitry Ilyin proposed openstack-infra/project-config: Enable voting checks for the Fuel unit tests Puppet 4.5  https://review.openstack.org/35733523:46
clarkbmordred: I wasn't able to confirm that but its heavily neutron from the list23:46
jeblairmordred: normally i'm all ick on venvs, but this seems like a good pattern -- we're not interested in being part of an openstack deployment, we just happen to need one.23:47
clarkbmordred: anyways I think maybe something is flipping that sysctl and that may be breaking us23:47
mordredjeblair: yah. that was my thinking23:47
mordredclarkb: ++23:47
*** Julien-zte has quit IRC23:47
mordredlib/neutron_plugins/services/l3:        sudo sysctl -w net.ipv6.conf.all.forwarding=123:47
clarkbmordred: yay?23:47
pabelangerthere we go23:47
* clarkb goes to grab a beer now23:47
mordredI think that would be it23:47
mordredI mean, it should stop doing that :)23:47
jeblairmordred: you beat me23:47
mordredjeblair: git grep ftw23:48
clarkbmordred: I agree23:48
mordredjeblair: (I may have alreayd been hacking in devstack in a window)23:48
mordredclarkb: now ... how do we communicate "hey devstack, this is an interface that you should not enable forwarding on"23:48
mordredsc68cal: ^^ any chance you're around?23:49
clarkbmordred: another option is we can set net.ipv6.conf.eth0.accept_ra = 1 to 2 which means always accept ras23:49
clarkbregardless of forwarding state23:49
clarkb(which is apparently not in line with rfcs but they are only meant to be read not followed right?)23:49
* sc68cal connects and reads scrollback23:49
mordredsc68cal: tl;dr - devstack may be hosing the ipv6 connections on our ipv6 only cloud23:50
anteayasc68cal: it started this morning23:50
clarkbI think that is the wrong solution since this means that devstack would hose other peoples machiens in a similar situation23:50
pabelangerclarkb: agree23:50
clarkbso should fix in devstack with its forwading flippage23:50
*** esikachev has joined #openstack-infra23:50
anteayasc68cal: or better, we started looking at it this morning23:50
jeblairclarkb: i have a slight concern about the number of hits in that codesearch query23:51
* clarkb pulls it uop23:51
clarkboh wow23:51
clarkbwe might have to do the other thing too :/23:51
mordredjeblair: most of the hits seem to be in charms23:51
jeblairmordred: let me rephrase -- i have a concern about the breadth of projects that have similar looking settings :)23:52
jeblairlike -- it might be some voodoo that has been copied around prolifically23:52
*** _sarob has quit IRC23:53
sc68calok, I think I follow what's going on.23:54
*** sarob has joined #openstack-infra23:54
pabelangerjeblair: are you able to catch me up on the (new?) mtl01 region in internap?  Is that a new region we are launching or just moving an AFS mirror to it?23:55
*** rlandy has quit IRC23:55
*** esikachev has quit IRC23:55
sc68calis this for rfc6204w3 ?23:56
clarkbsc68cal: this is for run neutron in a cloud that is ipv6 only23:56
openstackgerritMerged openstack-infra/puppet-openstack_infra_spec_helper: Pin json_pure gem for ruby1.9 support  https://review.openstack.org/35083523:56
jeblairpabelanger: new region, we'll use both.  i don't know the answer about creds -- probably worth just trying :)23:56
*** spzala has joined #openstack-infra23:56
clarkbsc68cal: so the instances we get from taht cloud get RAs, then neutron shows up and says no don't do that and now we can't route23:56
sc68calclarkb: right, but you have a interface that is recieving RAs from an upstream source, and also advertises to links connected to the node23:57
sc68calbasically https://tools.ietf.org/html/rfc620423:57
pabelangerjeblair: sure, I'll poke around a bit tonight.23:57
mordredsc68cal: http://paste.openstack.org/show/562777/ is the interfaces on a borked host and also the routes23:57
clarkbsc68cal: we have an interface that receives RAs because neutron is the underlying cloud. Then neutron in that instance does whatever the heck neutron does23:57
sc68calit's the same issue that customer premise equipment has in ISPs23:58
clarkbwell the issue is neutron should not touch eth0 in the gate23:58
clarkbever please don't do it23:58
*** zhurong has joined #openstack-infra23:58
sc68calcustomer router recieves RAs from ISP equipment but also needs to distribute RA's to boxes connected to it23:58
clarkb(replace eth0 with whatever the actual interface is)23:58
*** dingyichen has joined #openstack-infra23:58
*** AnarchyAo has quit IRC23:59
clarkbso I think the issue here is the assumption that it can edit the all settings in sysctl rather than the subset it needs23:59
mordredcurrent theory is that we think that the neutron running on the host is also managing to distribute RAs to the host itself, yeah?23:59
openstackgerritJohn L. Villalovos proposed openstack-dev/hacking: Fix issues detected by pycodestyle  https://review.openstack.org/35948723:59
clarkbmordred: maybe? I checked the radvd config and its scoped to an interface23:59
clarkband that interface shouldn't have a path to eth023:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!