Wednesday, 2016-03-02

mnasergot it, thanks :)00:00
mtreinishpleia2: ^^^ finally merged00:00
anteayathank you00:00
anteayano idea when I will be in montreal next00:00
anteayabut now you are in my address book :)00:00
mtreinishonce that gets applied we can run it for a little while and see if it gives us more clues on what's going on00:00
*** yamahata has joined #openstack-infra00:00
pleia2mtreinish: yay, let me fire this up again00:01
pleia2(once it gets to the server, that is)00:01
anteayamnaser: out of curiousity, how many vms do you think vexxhost will donate when we are at quota?00:02
mnasertough question to answer but the answer is as much as we can, but initially we'd like to grow it up to 100 and see how things are working00:03
EmilienManteaya: you have to ping me if you're in MTL00:03
mnaseri recently discovered an interesting cool local montreal openstack meetup00:04
anteayaEmilienM: I will indeed00:04
*** sflanigan has quit IRC00:05
*** pahuang has quit IRC00:05
*** camunoz has quit IRC00:05
*** aeng has quit IRC00:05
anteayamnaser: fair enough, and it might not have been fair of me to ask so sorry to put you on the spot00:05
anteayamnaser: nice00:05
anteayado share a report after you attend an event00:05
mnaserhttp://montrealopenstack.org/en/ -- cool group of people, so try to see if you can drop by :)00:05
openstackgerritGrant Murphy proposed openstack-infra/project-config: Add bandit gate job for Octavia  https://review.openstack.org/28691200:05
mnaseri attended the most recent one, met some cool people with interesting ideas.  dmsimard was there too to talk rdo :)00:06
mnaseri'll have to say he tried to convince me to bring resources so credit goes to him for bringing up the idea again :p00:06
clarkbmnaser: ok I think I have a quick canary tempest run going which should give us an idea of relative performance (which I don't expect to be a real problem)00:06
*** ybathia has joined #openstack-infra00:06
*** sridhar_ram has quit IRC00:06
mnasernice, hopefully it runs quickly :)00:07
EmilienMmnaser: you in mtl? nice, i lived there before - in qc now00:07
mtreinishclarkb: oh, you just reminded me I need to work on adding the 2nd page performance graphs to o-h (once we get the collection working again)00:07
*** annegentl_ has quit IRC00:08
clarkbmtreinish: sorry I havne't been ignoring your subunit things just mostly head down on trying to add clouds00:08
clarkbmilestones and all that00:08
mnaserEmilienM: yep!  been here for most of my life, nothing but winter and cold complaints :P00:08
mtreinishclarkb: no worries, I realize that there are other priorities right now00:09
*** sridhar_ram has joined #openstack-infra00:09
clarkbmtreinish: that and taxes00:09
mtreinishI honestly had forgotten about your request for run_time graphs on the 2nd page though00:09
clarkbI got a discount for doing them a month and a half early00:09
pleia2mtreinish: ok, new subunit-gearman-worker.py is in place, I'm going to start it now and collect some logs00:09
mtreinishoh, and damn my taxes. I need to do that soon too00:09
mtreinishpleia2: cool, thanks00:10
smarcetfungi: its taking its time the fix to be merged00:11
*** yamahata has quit IRC00:11
dimsmtreinish : oh man it's Mar 1 already00:12
fungismarcet: yes, all but one job completed, but gate-infra-puppet-apply-bare-trusty is taking longer than usual00:12
*** ashleighfarnham has quit IRC00:12
smarcetfungi: thx for info :)00:12
clarkbI need to pick the don't clone all the things work back up again in d-g00:12
mtreinishpleia2: well it looks like we added 1 run and no others, so I'm guessing there is a nice stack trace in there already :)00:13
clarkbtripleoci? ya that is setup for every d-g job00:13
clarkbwould be an easy one to split off like grenade00:13
fungiclarkb: do the repeated loops through /usr/zuul-swift-logs-env in https://jenkins05.openstack.org/job/gate-infra-puppet-apply-bare-trusty/826/console look normal to you?00:13
pleia2mtreinish: just stopped it, grabbing log00:14
clarkbfungi: ya I think its a new connection for each file00:14
clarkbfungi: so we get the warning over and over and over00:14
fungiclarkb: oh, so this is all it trying to upload logs after completing the job?00:15
fungiit's taking, like, forever00:15
clarkbyup00:15
fungiand from rax-dfw, which is where the swift container is being hosted00:15
pleia2mtreinish: erg, not sure how helpful this is http://paste.openstack.org/show/488878/ (I'll email the full log too)00:15
fungiso extra crazy on the crazy00:15
fungii wonder if rackspace's swift is struggling?00:16
*** pahuang has joined #openstack-infra00:17
mtreinishpleia2: well that's the extra logging I was looking for, between here: http://git.openstack.org/cgit/openstack-infra/puppet-subunit2sql/tree/files/subunit-gearman-worker.py#n18700:17
mtreinishand here: http://git.openstack.org/cgit/openstack-infra/puppet-subunit2sql/tree/files/subunit-gearman-worker.py#n19500:17
mtreinishsubunit_v2 is getting closed00:17
mtreinishstill have no clue why, but it narrows it down quite a bit00:17
pleia2oh good, I'm glad it means something to you :)00:17
clarkbpleia2: nibalizer: want to review the vexxhost creds change?00:18
*** zz_dimtruck is now known as dimtruck00:18
pleia2sure00:18
mtreinishpleia2: also looking at the full log I can confirm we're not reusing any IO objects (the addrs are all different)00:19
pleia2+2, but I'll leave for a couple minutes in case nibalizer wants to look too00:19
clarkbpleia2: danke00:19
mtreinishbut, I still ahve no idea why it's a closed object when we process it00:20
fungii may kill this job and see if it completes any faster on another worker00:20
fungii'm not sure how many more files it has to upload00:20
nibalizerclarkb: sure link?00:21
clarkbnibalizer: https://review.openstack.org/28689500:21
nibalizerwhat kind of quota did they give us?00:21
*** sflanigan has joined #openstack-infra00:21
*** sflanigan has joined #openstack-infra00:21
clarkbnibalizer: mnaser wants to start smaller like 25 test instance then ramp up from there00:21
nibalizercool00:21
nibalizerthanks vexxhost!00:21
clarkbmnaser: sounds like you are ok with the eventual goal of 100 just want to not do it all at once and break things00:22
clarkbmnaser: which is perfectly fine by me00:22
nibalizerclarkb: im realizing that we should have a contacts file00:22
nibalizerso that we know who to poke for what00:22
*** aeng has joined #openstack-infra00:22
*** camunoz has joined #openstack-infra00:22
*** aeng has quit IRC00:22
clarkbnibalizer: if you peruse the passwords file I have done this00:22
funginibalizer: well, it's in the passwords list00:22
clarkbnibalizer: :)00:22
nibalizeroh nice00:22
fungiyeah, that00:22
nibalizerty00:22
anteayaclarkb: is there a project config patch up yet?00:22
smarcetfungi: build failed00:22
mnasernp :) and yep, that's the eventual target00:22
smarcetPatch Set 2: Verified-200:22
*** aeng has joined #openstack-infra00:22
smarcetBuild failed (gate pipeline).  F00:22
clarkbanteaya: not yet, I want to get the base thing in and do initial testing then add the nodepool change in project-config00:23
fungismarcet: i know, i killed that job and am sending it back in to hopefully get a faster one00:23
smarcetoh ok00:23
smarcetgot it :)00:23
nibalizerclarkb: lgtm00:23
anteayaclarkb: sounds good00:24
*** gildub has joined #openstack-infra00:25
*** Sukhdev has quit IRC00:25
fungismarcet: i was hoping it would just rerun the job when i killed the worker, but in this case it unfortunately marked the job as failed. still should go back through pretty quickly this time unless we get really, really unlucky00:25
fungiyeah, i just watched another change running the same job upload its files in a few seconds00:27
fungiso something was definitely wrong with the worker where we ended up before00:27
smarcetfungi: got it00:27
fungizuul says estimated time to merge 286896,2 is now 3 minutes00:29
fungiso should hopefully end up in the 00:45 puppet pulse out to the server00:29
openstackgerritMerged openstack-infra/system-config: Add vexxhost cloud credentials  https://review.openstack.org/28689500:30
mnaserbtw, clarkb, you'll notice that right now some servers are not running at full speed (processor speed).  that's being resolved tonight so cpu speed for those affected will go  from 1.6 to 2.93 per core00:30
mnaserwhich will probably halve the runtime00:30
*** thorst_ has joined #openstack-infra00:30
*** AndyU has quit IRC00:31
mnaserprocessor clock speed that is00:31
*** ianw has joined #openstack-infra00:32
pabelanger\o/ canadians00:34
ianwlinode grumble "maintenance" grumble "everything's fine status page"00:34
fungismarcet: okay, this is absurd. we've somehow hit another worker taking forever to upload files after the job succeeds. i'm going to go ahead and merge this change manually so it's not delayed further00:34
*** Qiming has quit IRC00:34
smarcetfungi: thx00:34
*** piet has joined #openstack-infra00:34
anteayapabelanger: hehehe00:35
openstackgerritMerged openstack-infra/system-config: OpenstackId relase 1.0.13  https://review.openstack.org/28689600:36
fungismarcet: ^ there we go00:36
ianwclarkb: i'm new to these many layers.  with your change, i do see the swift client object change between uploads.  however, the x-auth-token does *not* seem to change.  i think it's effectively cached by os-client-config?  unless we create another client-object00:36
*** erlon has quit IRC00:36
fungijhesketh: any guesses why i've seen two gate-infra-puppet-apply-bare-trusty jobs running in rax-dfw take seemingly forever to upload logs to swift?00:37
smarcetfungi: thx a lot for all your help :)00:37
*** keedya has joined #openstack-infra00:37
*** rockyg has quit IRC00:37
fungijhesketh: current example (still running) is https://jenkins04.openstack.org/job/gate-infra-puppet-apply-bare-trusty/816/console00:37
anteayaI'm out first thing tomorrow morning, back late morning00:37
* jhesketh looks00:37
*** rbrndt has joined #openstack-infra00:37
*** thorst_ has quit IRC00:38
fungijhesketh: i watched one in another rax region upload logs in a few seconds00:38
fungisame job00:38
clarkbianw: I think oscc "caches" it but swiftclient detects that it is old and will try to make a new one that it uses00:38
*** thorst_ has joined #openstack-infra00:38
*** derekh has quit IRC00:38
fungiand, amusingly, this is taking forever in the region where we're actually hosting the swift container00:38
jheskethfungi: yeah, nothing obvious is jumping out at me...00:39
jheskethother jobs seem to be uploading okay00:39
ianwclarkb: with what i was doing before, uploading two images back to back, wouldn't you expect to see a new token used between them?00:39
ianws/used/issued/00:39
fungijhesketh: i can ssh into that client and see if, say, there's lots of packet loss or something00:39
jheskethfungi: the number of files could be part of it00:40
jheskethpossibly a good idea00:40
clarkbianw: only if the first token had expired00:40
fungijhesketh: well, as i said, saw the exact same job go quickly in another rax region, so same number of log files00:40
*** thorst_ has quit IRC00:40
*** thorst_ has joined #openstack-infra00:41
ianwclarkb: but i thought the issue was swiftclient can't reissue tokens?00:41
clarkbianw: it can now that oscc has been updated to pass all the uath info it has through00:41
clarkbianw: let me go grab the traceback paste again00:41
jheskethfungi: right, but we've seen 2 gate-infra-puppet-apply-bare-trusty take a while and we haven't experienced this on other jobs?00:41
fungijhesketh: is there a particular swift endpoint hostname i should ping?00:41
fungijhesketh: right, so far my sample size is 200:41
fungiand they were both the same job00:42
*** sarob has joined #openstack-infra00:42
fungi172.99.116.7 is this worker00:42
clarkbianw: http://paste.openstack.org/show/488685/ that is it trying to reauth00:42
jheskethfungi: the endpoints are here https://jenkins04.openstack.org/job/gate-infra-puppet-apply-bare-trusty/816/parameters00:42
*** keedya has quit IRC00:42
*** keedya has joined #openstack-infra00:43
*** ajmiller has quit IRC00:43
fungi[Wed Mar  2 00:41:40 2016] iptables dropped: IN=eth0 OUT= MAC=bc:76:4e:06:30:ef:e4:c7:22:6c:6e:41:08:00 SRC=193.105.134.220 DST=172.99.116.7 LEN=40 TOS=0x00 PREC=0x00 TTL=238 ID=36025 PROTO=TCP SPT=49975 DPT=8080 WINDOW=1024 RES=0x00 SYN URGP=000:43
fungioh, that's something trying to hit 8080/tcp on the worker00:43
fungiso unrelated00:44
jheskethfungi: I wonder if we're hitting some rate limiting given the number of concurrent threads we have uploading00:44
fungino obvious packet loss00:45
fungiyeah, maybe00:45
fungidoesn't immediately look like network performance issues anyway (0% loss, rtt<2ms)00:45
jhesketh100 write operations per second per container00:45
jhesketh If you reach this rate limit, Cloud Files slows the processing of write requests for the container to 100 write operations per second per container.00:46
fungiooh, so maybe multiple jobs uploading logs simultaneously00:46
*** jlk has quit IRC00:46
jheskethmaybe, but would we be doing more than that?00:46
*** sarob has quit IRC00:46
*** jlk has joined #openstack-infra00:46
*** dimtruck is now known as zz_dimtruck00:46
*** jlk has quit IRC00:46
*** jlk has joined #openstack-infra00:46
fungidunno. depends on how many complete at the same time i guess00:46
*** annegentl_ has joined #openstack-infra00:47
openstackgerritMatthew Treinish proposed openstack-infra/subunit2sql: Use instance variable in ByteStreamToStreamResult init  https://review.openstack.org/28692500:47
jhesketherror 429 is too many requests, so if it fails with that we'll have a good idea00:47
*** Swami has quit IRC00:48
mtreinishpleia2: just for a sanity check which version of subunit2sql is installed on the gearman worker? I'm starting to think the bug is in the lower layers00:51
pleia2elizabeth@subunit-worker01:~$ /usr/local/bin/subunit2sql --version00:53
pleia21.3.000:53
*** rossella_s has quit IRC00:54
mtreinishpleia2: ok, that's the latest00:54
*** rossella_s has joined #openstack-infra00:54
fungiugh, it looks like we're still beset by git remote timeouts. this one just tripped a gate reset https://jenkins03.openstack.org/job/gate-cinder-python27/3128/console00:55
fungiat least zuul-cloner properly used the cache00:55
*** annegentl_ has quit IRC00:56
fungihttps://review.openstack.org/282099 would be awesome to have00:58
*** zhurong has joined #openstack-infra00:59
*** smarcet has quit IRC00:59
*** tphummel has quit IRC01:00
*** rbrndt has quit IRC01:00
fungijhesketh: so that puppet apply job finally completed successfully, just spent >30 minutes uploading logs when it was done01:01
*** rbrndt_ has joined #openstack-infra01:02
*** zz_dimtruck is now known as dimtruck01:02
fungijob ran for 35 minutes in total, 90% of which was spent on log uploading01:02
*** sdake has joined #openstack-infra01:03
*** achanda has quit IRC01:03
*** rbrndt_ has quit IRC01:05
openstackgerritMatthew Treinish proposed openstack-infra/puppet-subunit2sql: Switch use of cStringIO.StringIO to io.BytesIO  https://review.openstack.org/28692601:06
jheskethfungi: yeah and the other puppet apply is still uploading :-s01:06
fungioh, you found another one?01:07
*** chenli has joined #openstack-infra01:09
openstackgerritMatthew Treinish proposed openstack-infra/subunit2sql: Remove unused flush from read_subunit()  https://review.openstack.org/28692801:09
*** dchen has joined #openstack-infra01:13
*** tphummel has joined #openstack-infra01:13
*** ybathia has quit IRC01:15
*** sdake has quit IRC01:16
clarkbfungi: mnaser http://paste.openstack.org/show/488885/ that is tempest's results01:19
clarkbnow to figure out how that compares to other clouds01:19
*** Qiming has joined #openstack-infra01:23
*** Sukhdev has joined #openstack-infra01:23
jheskethfungi: sorry, missed your question: I was watching https://jenkins02.openstack.org/job/gate-infra-puppet-apply-bare-trusty/705/console but it finished01:23
jheskethit did, however, also take 30min01:23
*** cody-somerville_ has joined #openstack-infra01:23
clarkbfungi: mnaser http://paste.openstack.org/show/488886/ rax there seems to be on the quick end and ovh in the middle and this is on par wit hovh01:23
jheskethand also in dfw01:23
clarkbtherefore full steam ahead :)01:24
*** cody-somerville has quit IRC01:24
clarkbI am going to attempt adding a mirror nowish01:25
clarkbhrm01:25
clarkbmaybe I should check quotas first /me doe sthis01:25
*** jamesmcarthur has joined #openstack-infra01:25
*** kzaitsev_mb has joined #openstack-infra01:26
*** tphummel has quit IRC01:26
*** tphummel has joined #openstack-infra01:27
*** piet has quit IRC01:29
clarkbonce I figure out how to do it01:29
clarkbok looks like openstackjenkins has a rough instance quota of 25 which is spot on for the initial requst from mnaser. Perfect01:31
*** sdake has joined #openstack-infra01:31
*** sdake has quit IRC01:33
* craige prods jeblair if he's still about, for a fresh load of storyboard, at his convenience.01:33
*** sputnik13 has quit IRC01:33
*** _amrith_ is now known as amrith01:34
*** sam_wan has joined #openstack-infra01:36
*** beisner has quit IRC01:38
*** Daisy has joined #openstack-infra01:38
openstackgerritTristan Cacqueray proposed openstack-infra/shade: Add get_stack_outputs method  https://review.openstack.org/27648701:40
*** sridhar_ram has quit IRC01:40
*** beisner has joined #openstack-infra01:40
*** fawadkhaliq has quit IRC01:42
*** yamamoto has joined #openstack-infra01:45
fungicraige: i can mysqldump it. where were you grabbing the result from? anywhere in particular?01:46
jeblairfungi, craige: there is a script on the host01:46
jeblairi wrote it so anyone could easily do it01:46
fungiexcellent. doing now01:46
openstackgerritTristan Cacqueray proposed openstack-infra/shade: Add environment_files to stack_create  https://review.openstack.org/27755901:46
fungii manually did one for SotK not long ago, not realizing we had it scripted01:46
jeblairmake-db-dump.sh01:47
craige^^01:47
*** tiswanso has joined #openstack-infra01:47
craigeand I hit it from a URL I have bookmakred  fungi01:47
jeblaircraige: next time ask roots to run "sudo make-db-dump.sh" :)01:47
craigeWill do jeblair. I kept forgetting te script name and couldn;t find therepo it lived in01:48
*** tiswanso has quit IRC01:48
fungiseems to be /root/make-db-dump.sh01:48
*** apoorvad has quit IRC01:48
*** tiswanso has joined #openstack-infra01:48
fungijust found it in root's shell history myself01:48
jeblairer yeah; "sudo /root/make-db-dump.sh" then01:48
*** tiswanso has quit IRC01:48
*** tiswanso has joined #openstack-infra01:49
fungihrm, sudo -H probably01:49
*** rhallisey has joined #openstack-infra01:49
*** tiswanso has quit IRC01:49
fungiahh, nope, still prompts for a password. i bet the ~root/.my.cnf needs work01:49
jheskethyeah I thought there would be a script on the host but my hesitancy was not knowing the sensitivity of the data, if it needed scrubbing or the security of distributing it01:49
*** sripriya has quit IRC01:49
jeblairjhesketh: the script scrubs the data01:49
fungihrm, nope, sudo -H mysql connects fine01:49
*** tiswanso has joined #openstack-infra01:50
jheskethjeblair: ah okay, I didn't even log in to check01:50
craigeAdded trhe script to my etherpad jeblair01:50
jeblairfungi: to be fair, i probably ran it with "sudo su -; ./make-db-dump.sh"01:50
clarkbfwiw I am currently working out how to boot this mirror, its first time with my cinder stuff so different and I have to figure out how to specify the az01:50
clarkbbut it is in progress!01:50
clarkbjeblair: you said that the 3.16 kernels on 14.04 don't work with afs right?01:50
* clarkb finds an image with 3.13 kernel01:51
*** gongysh has joined #openstack-infra01:51
*** achanda has joined #openstack-infra01:51
fungioh, it's explicitly prompting with -p01:51
jeblairclarkb: i don't recall the versions; it was the ones with the later archive thingy (which is in the ubuntu server image but not the ubuntu cloud image)01:51
*** mtanino has quit IRC01:51
clarkbjeblair: thanks I think that is 3.16 and normal is 3.1301:52
jeblairfungi: ah, might be able to drop that then :)01:52
jeblairclarkb: could probably check a working mirror to be sure01:52
*** esikachev has joined #openstack-infra01:52
fungijeblair: i have made a working version by dropping -u, -p and -h01:52
fungithe original has a .old suffix appended01:53
fungicraige: i've run it now01:53
jeblairclarkb: if you are stuck with an image with the archive thingy, our puppet will remove the archive, and then some about of bashing "apt-get remove" at various kernel things with higher numbers should get it back into a state01:53
jeblairfungi: ++01:53
*** chenli has quit IRC01:53
*** chenli has joined #openstack-infra01:54
*** jaypipes has quit IRC01:54
*** jpr has quit IRC01:54
clarkbjeblair: I found one. vexxhost does both the .2 .3 .4 .5 images which have new kernels and up to date unpointed release images which have old kernelks01:54
*** Daisy has quit IRC01:54
jeblairoh interesting01:54
jeblair(old, but maintained!, kernels, for the record)01:55
*** Daisy has joined #openstack-infra01:55
*** ybathia has joined #openstack-infra01:55
craigethanks fungi, nabbing it01:55
clarkbnow I just need to sort out how to pass the availability zone01:55
*** Daisy has quit IRC01:55
clarkb(the az stuff is documented in the passwd file not sure if there is a better place for that but cinder needs a specific az)01:55
*** Daisy has joined #openstack-infra01:55
craigeGot fungi. Thanks fungi jeblair01:55
craigeUm, "got it, fungi"01:56
clarkblooks like create_server takes an arg01:56
*** thorst_ has quit IRC01:56
*** esikachev has quit IRC01:56
*** thorst_ has joined #openstack-infra01:57
openstackgerritMatthew Treinish proposed openstack-infra/puppet-subunit2sql: Remove inaccurate comment  https://review.openstack.org/28693201:57
mriedemianw: can we get a reply on this old patch? https://review.openstack.org/#/c/215929/01:57
*** Sukhdev has quit IRC01:58
fungiit's my new ad campaign: "got fungi?"01:58
fungii stole it from an antifungal manufacturer01:58
* Clint twitches.01:59
mriedemianw: actually, nevermind, we should probably just ignore that one01:59
*** pvaneck has quit IRC02:00
*** bpokorny has quit IRC02:01
*** gokrokve has quit IRC02:01
ianwmriedem: in general, i still prefer to avoid things that override project defaults, because the project changes and devstack never does and then we're not testing what we think we're testing.  i'd have to context switch back to that issue though02:01
mriedemianw: yeah i've lost it more or less,02:01
mriedemit was for an lxc job02:02
mriedemthomas maddox can pick it up if he wants since he's pursuing that job now02:02
*** jamesmcarthur has quit IRC02:03
*** ashleighfarnham has joined #openstack-infra02:03
*** kzaitsev_mb has quit IRC02:05
*** thorst_ has quit IRC02:05
*** woodster_ has quit IRC02:06
clarkbthis is the longest launch node command I have ever used but it is doing things02:07
fungiheh02:07
fungidue to the alternate az for the cinder volume?02:07
clarkband needing to specify all the cinder volume details (label, mount path, etc02:07
clarkbalso vexxhost uses 'public' for the public IPs so I thin kexisting nodepool will just work02:08
*** rfolco_ has joined #openstack-infra02:08
*** rfolco_ has quit IRC02:08
clarkbthough launch node isn't saying that it has ssh'd in to do things so may be mor ework to do there02:09
clarkboh! could be a security group thing I only checked the one account02:10
clarkbthat is dod02:10
mordredclarkb: did we get vexxhosts?02:11
*** rhallisey has quit IRC02:11
fungimordred: get, got and gotted02:11
mordredWOOT!02:11
* mordred hands mnaser a pony02:11
*** Daisy has quit IRC02:12
fungiif clarkb doesn't get waylaid by babies soon, we may have vexxhost in nodepool before the night is out02:12
* mnaser is literally standing behind the very servers you're using02:12
mnaser:p02:12
mordredmnaser: nice!02:12
*** Daisy has joined #openstack-infra02:12
mnaserthanks mordred02:12
mordredmnaser: be careful - he only eats at Joe Beef02:13
clarkbmordred: if you stick around we might just get noepool using it tonight02:13
mnaserso the cpu scaling stuff is solved, so these bad boyz are going to go from 1.6ghz per core to 2.93ghz and "turbo" at 3.33ghz02:13
clarkbmnaser: wow nice02:14
mordredooh! turbo!02:14
mnaseranyone else think its funny how we had hyperthreading and "turbo" in the pentium 4 times02:14
mnaserand now that's all back02:14
* fungi remembers that button on his computer02:14
mnaser:p02:14
clarkbok I think security groups are fixed now, time to wait for shade launch node to either fail or move on02:14
clarkb(this way I don't have to do manual cleanup)02:14
clarkbI will also need to add a patch for az support to shade launch node02:14
mnaseranyways most of my dc work is done \o/ i'll bbl, if you need anything please let me know02:14
clarkbI mean I wrote it, I need to push it up once it works02:14
fungiall the button did was change the 2x7-segment led on the case to change from 25 to 5002:15
fungior if you were lucky, 33 to 6602:15
mordredI so wanted a 33/6602:15
mnaserbtw, clarkb -- there is much more capacity at ca-ymq-1 but no cinder, less capacity at ca-ymq-2 but faster procs, 10gig internal networking and cinder02:15
mnaserbut we're adding more in ca-ymq-2 as we go02:15
mnaserthe reason we have less capacity is people just end up using it because it's better, but the tests should be able to run in any az02:16
clarkbmnaser: we will only do the explicit az where we need cinder02:16
mnaseryep thats ideal02:16
clarkbmnaser: whcih is one host today so thats fine02:16
fungithough that does mean all our workers are connecting from az1 to the mirror hosted in az202:16
mnasertheres a private line between them02:17
clarkbfungi: I think its mostly logical and not a big deal02:17
clarkbya that02:17
*** Daisy has quit IRC02:17
mnaseryou should get gige speeds02:17
fungiso if there are serious bandwidth bottlenecks (they'd have to be pretty terrible) we might notice02:17
clarkbif that becomes an issue we can just use a bigger flavor for the mirror host and use local disk02:17
*** ybathia has quit IRC02:17
fungigig-e is way more than sufficient for mirror traffic02:17
clarkbok time out for the failed ssh is forever I am going to manually cleanup and try again02:17
mnaseralso be good if mirrors ran over tenant network ip @ ca-ymq-2 so any workers @ ca-ymq-2 will use 10gig and ca-ymq-1 will be over 1g02:17
*** aeng has quit IRC02:18
mnasertraffic over public runs @ 1gig vs tenant networks @ 10gig02:18
*** camunoz has quit IRC02:18
fungireally the amount of network traffic these generate should be pretty minimal. we try to avoid network use as much as possible in jobs and pre-cache as much as we can on the images02:18
mnaseryep thats ideal02:19
fungiso actually, our image uploads (once a day) are probably more aggregate network transfer than our workers and mirror syncs02:19
clarkbya its mostly just to get an up to date apt pacakge list and a few small python wheels02:19
clarkbshould be small02:19
clarkb(also I may have derped the cinder attachment by getting impatient. that will leanr me02:19
mnaser:P02:19
*** sflanigan has quit IRC02:19
mnaserill bbl, driving home now02:19
fungithanks again mnaser!02:20
*** keedya_ has joined #openstack-infra02:23
clarkbmordred: another openstackclient thing, it tells  you instance name not uuid for volume attachment02:23
*** keedya has quit IRC02:23
clarkband instance names need not be unique02:24
mordredclarkb: that's awesome02:24
mordredclarkb: (you know these days most of he time I just fire up a python repl and import shade :) )02:24
clarkbmordred: I am really trying to use this openstackclient thing because everyone keeps deprecating the old stuff02:25
clarkbmordred: fwiw I think my current thing is hanging on waiting for the volume to attach02:25
clarkbmordred: anything obviously wrong with https://review.openstack.org/#/c/285477/5/launch/shade-launch-node.py ?02:26
clarkbperhaps the az thing is tripping shade up?02:26
mordredclarkb: looking02:27
mordredclarkb: it shouldn't be - but maybe?02:27
mordredI'm pretty sure that's going to be working by id which shouldn't be a problem02:27
clarkbstatus is in-use so I think its all attached02:28
clarkbbut shade-launch-node doesn't seem to be moving forward02:28
mordredclarkb: if you put at the top of the script "shade.simple_logging(debug=True)" (or setup logging module logging) shade will tell you every api call it makes just like nodepool02:28
mordredclarkb: although that won't tell you if it's not seeing the .... OH02:28
mordredclarkb: I think I may know what it is02:28
clarkbmordred: cool I await for magicallness02:28
mordredoh - nope. nevermind.02:29
clarkbboo02:29
mordredyou're not running with caching on02:29
mordredI was thinking cache invalidation logic02:29
mordredbut nope02:29
clarkbat this point I may just wait for timeout to happen to confirm it is indeed the volume thing02:29
*** gokrokve has joined #openstack-infra02:29
mordredclarkb: actually, shade.simple_logging(debug=True, http_debug=True) will have it log the resuests stuff02:29
clarkbI will ad that for the next run02:30
mordredcool02:30
mordredit'll be chatty and stuff, but should show you if it's the status02:30
clarkbya02:30
*** camunoz has joined #openstack-infra02:31
*** sflanigan has joined #openstack-infra02:31
* Shrews needs a way to disable highlighting on "shade" in the evenings02:31
*** aeng has joined #openstack-infra02:32
ianwclarkb: i have a theory -> https://etherpad.openstack.org/p/raxuploadissues ; it may be hard to follow02:32
clarkboh! shade doesn't have a timeout by default it will just go forever so I should kill this and try again with debugging in place02:33
*** gokrokve has quit IRC02:34
ianwi think shade might create a swiftclient.service.SwiftService object via auth_token ... maybe it expires, but then the client goes and tries to _retry when it gets an auth failure, at that point it doesn't have auth info and hits the "i have no tenant" error02:35
clarkbmordred: ya I think it is spinning on volume_attach wait=True02:36
*** yamamoto has quit IRC02:36
clarkbmordred: the volume attach was the last thing it did and now its waiting for 2 seconds over and over02:36
clarkbmordred: my best theory is az required for that op?02:36
*** tphummel has joined #openstack-infra02:37
openstackgerritMerged openstack-infra/nodepool: Enhance message for image ssh auth  https://review.openstack.org/22256502:38
clarkbmordred: though I am not seeing repeated requests for the volume so there is some weight to your cache theory02:39
clarkbmordred: how do I disable caching?02:39
*** sripriya has joined #openstack-infra02:39
*** Daisy has joined #openstack-infra02:40
clarkbactually I am pretty vconvinced this is the problem02:40
*** Daisy has quit IRC02:42
*** Daisy has joined #openstack-infra02:42
clarkbya thats it, the cache is only invalidated the first time around02:43
clarkbneeds to be invalidated each time around02:43
clarkblet me see if I can install a local version and test it and confirm it working then push patch to shade02:43
*** gokrokve has joined #openstack-infra02:45
*** mriedem has quit IRC02:46
*** Daisy has quit IRC02:47
*** rguillebert has quit IRC02:47
*** dimtruck is now known as zz_dimtruck02:48
*** jamesmcarthur has joined #openstack-infra02:49
clarkbmordred: confirmed local patch got past that02:49
clarkbwill push in a few02:49
*** cody-somerville_ is now known as cody-somerville02:52
*** cody-somerville has joined #openstack-infra02:52
*** rlandy has quit IRC02:54
clarkbnow running into problems with device enumeration02:55
*** annegentl_ has joined #openstack-infra02:56
stevemarclarkb: we're more than happy to see osc fixes :)02:58
stevemarclarkb: but i do appreciate you giving osc an honest try02:59
clarkbstevemar: ya I am trying to use it and more likely to fall back on old clients to get things done currently but if I find time will swing around and write some patches03:00
stevemarclarkb: thats more than most people are willing to do :)03:00
clarkbmordred: so on vexxhost /dev/vdb is your config drive but thats where we try to make swap and that explodes03:00
* clarkb makes a hack in make_swap.sh03:01
*** kzaitsev_mb has joined #openstack-infra03:01
*** annegentl_ has quit IRC03:02
*** thorst_ has joined #openstack-infra03:04
clarkband curiously cinder reports it also attached to /dev/vdb03:04
clarkb/dev/vdb is popular tonight03:04
*** chenli has quit IRC03:05
clarkband confirmed that according to blkid /dev/vdb is actually the config drive03:05
clarkbmnaser: ^ not sure if oyu are still awake but it seems like cinder doesn't agree with the nova instance on what the device name for attached volumes is03:06
clarkbmnaser: I probably won't debug that more tonight since dinner just showed up03:06
*** kzaitsev_mb has quit IRC03:06
*** zz_dimtruck is now known as dimtruck03:08
fungii've heard tell of auto-attach selecting the wrong block device name, and someone suggested you're better off explicitly feeding it one instead03:09
fungii have not substantiated this as a viable option, however03:09
*** alivigni has joined #openstack-infra03:10
*** thorst_ has quit IRC03:10
clarkbfungi: the problem with that is you can conflict, but  Iguess thats no worse than getting the wrong mapping03:10
fungilbyl i guess03:10
clarkbmnaser: consider this an informal bug report, /dev/vdb is config drive and /dev/vdc is the attached volume but /dev/vdb is what cinder/nova seem to agree on as the attached volume03:11
clarkbfungi: I can add explicit device support and just hardcode to /dev/vdc which should get past this03:11
clarkbwhich I am tempted to do, however it seems like something openstack should be handling for me03:11
*** gokrokve has quit IRC03:12
fungii couldn't agree more03:13
fungiit's a bug for sure03:13
*** annegentl_ has joined #openstack-infra03:13
mnaserhello im here03:15
mnaserthat's an openstack bug clarkb03:15
mnaseryou want to know the next fun part?03:15
mnaserreboot the machine03:15
mnaserconfigdrive goes to /dev/vdc03:15
*** dims has quit IRC03:15
*** sdake has joined #openstack-infra03:15
mnaserthe device goes to /dev/vdb03:15
* mnaser sighs03:15
clarkbmnaser: wow03:16
clarkbmnaser: thats amazing,03:16
clarkbI will go ahead and add explicit device path support03:16
clarkbthingee: ^ how to fix?03:16
mnaser#justopenstackthings03:16
mnaserclarkb: ask a workaround, do you need configdrive or is 169.254.169.254 sufficent?03:16
*** gyee has quit IRC03:16
mnaseryou could add --config-drive=false to your boot command and as far as i know, that'll force it to not boot from configdrive03:17
*** dimtruck is now known as zz_dimtruck03:18
clarkbmnaser: we don't need config drive but generally we prefer it because it is more reliable than the metadata service03:18
clarkbmnaser: I think if I explicitly attach at /dev/vdc it will work fine03:18
clarkbabout to test this03:18
mnaseryep, i should file a bug about this in cinder/nova03:18
mnaserran into this issue with some customers in the past03:18
clarkb++ please do03:18
clarkband maybe we can rope thingee into helping with fixing it03:18
mnaseri wonder if its an image problem too03:19
*** chenli has joined #openstack-infra03:20
mnaseraka images+cloud-init should relabel /dev/disk/by-label/config-2 to /dev/vdz or something, ill do some searching03:20
* thingee was having dinner, and catching up03:21
clarkbhrm it seems to be ignoring me when I specify /dev/vdc and using /dev/vdb anyways03:21
*** jamesmcarthur has quit IRC03:21
mnaserclarkb: http://docs.openstack.org/developer/heat/template_guide/openstack.html#OS::Cinder::VolumeAttachment-prop-mountpoint03:21
mnaserunrelated, but kinda related.03:22
clarkbthingee: tl;dr is / is on /dev/vda, config drive is on /dev/vdb, and an attached cinder volume is at /dev/vdc. but cinder and nova both agree that the volume is attached at /dev/vdb03:22
*** weshay has quit IRC03:22
clarkbthingee: so then you attempt to format it and od things and it explodes because the poor vfat thing that is there for config drive is all sad03:22
*** yamamoto has joined #openstack-infra03:22
*** gokrokve has joined #openstack-infra03:23
*** david-lyle has joined #openstack-infra03:24
mnaserclarkb: volumeid "96507b68-5fdb-47a1-8653-f1c1d7d7eed4" is located at /dev/disk/by-id/virtio-96507b68-5fdb-47a1-803:25
mnaserthat may be a much more stable way of determining where it is... until cinder+nova stuff are settled03:25
clarkbmnaser: are you retrieving that via a cinder or nova api?03:26
clarkboh its the uuids I see03:26
openstackgerritYAMAMOTO Takashi proposed openstack-infra/project-config: Tweak regex for doc-only changes  https://review.openstack.org/28695603:26
*** sarob has joined #openstack-infra03:26
mnaseryeah the cinder volume uuid, not sure if you need it inside the machine (but i figure if you know from advance its /dev/vdb .. you should be able to get the id)03:27
clarkbyup I can get the id from where I am querying03:27
clarkbthen use that to determine the device path03:27
*** larainema has quit IRC03:27
mnaseryep, looks like its trimmed only the first 20 characters03:27
*** gokrokve has quit IRC03:27
clarkbI wonder if that is consistent across cloud sthough03:28
mnaserim checking where that's coming from right now, clarkb :) im curious03:29
clarkbmnaser: also is /dev/disk/by-uuid not going to have the cinder uuid I bet03:29
clarkbmnaser: because that would be even easier03:29
mnaserit doesnt, i checked :(03:29
mnaser"in a world..."03:29
*** larainema has joined #openstack-infra03:29
clarkbok so that might be a good feature request :)03:30
*** annegentl_ has quit IRC03:30
clarkbI think I found the bug in my explicit device path request, if this works I will go with it for now03:30
*** sarob has quit IRC03:31
*** camunoz has quit IRC03:32
*** sflanigan has quit IRC03:33
clarkbthat didn't work, it appears that cinder/nova are really convinced that it is mounted at /dev/vdc03:33
clarkber vdb03:33
thingeeclarkb: so cinder only cares about /dev/disk ,not what the vm has on /dev/sdb03:33
*** zz_dimtruck is now known as dimtruck03:33
*** aeng has quit IRC03:33
*** tphummel has quit IRC03:34
thingeeclarkb: so technically not a consistent thing between cinder and nova.03:34
*** sigmavirus24 is now known as sigmavirus24_awa03:34
clarkbthingee: but the cinder api reports where it is attached as /dev/vdb03:34
clarkbthingee: so it must care03:34
thingeethat's what nova gives it though. :)03:34
thingeecinder is not on the vm03:35
fungiso it's more that nova is making wild accusations about devnames03:35
mnasernova should know there is a configdrive attached at /dev/vdb though.03:35
mnaseralso, i think its supposed to be /dev/disk/by-id/virtio-serial .. where serial is the uuid of the volume, but it seems like it's being trimmed off at 20 characters03:36
mnaserudevadm info /sys/class/block/vdb | grep ID_SERIAL => E: ID_SERIAL=96507b68-5fdb-47a1-803:36
mnaserbut in the libvirt definition, <serial>..</serial> contains the full uuid03:36
thingeeclarkb, mnaser also cryptsetup which is what nova is doing for encryption of block devices has some limit with the device path. I looked into it at one point, but couldn't figure out the code.03:36
thingeeclarkb, mnaser need that fixed before we can add uuids to diskpath03:36
clarkbprogress! now I fail on pvcreate03:37
clarkb(it isn't installed03:37
mnaseroh heh03:38
thingeewhen I was working at a block storage vendor, I had to have them trim the iqn down because of the cryptsetup limit :P03:38
mnaserthingee: that sounds awfully fun to do, lol03:38
* thingee looks at nova code03:38
mnaserclarkb: feel free to upload any images.  the ones we use might be a bit weird because we use an agent to config network and stuff (but we're moving to qemu-guest-agent soon)03:38
*** sdake has quit IRC03:39
mnaserso if the vanilla ubuntu ones work for you, you can use them too03:39
clarkbok that is probably my best bet03:39
thingeeoh wow, I forgot the number of "ebs" mentions in this code.03:41
*** kaisers has quit IRC03:42
*** fawadkhaliq has joined #openstack-infra03:43
clarkb'ubuntu-trusty' image upload is in prgress03:43
*** kaisers has joined #openstack-infra03:44
*** yuanying has quit IRC03:44
*** amrith is now known as _amrith_03:44
thingeeclarkb: so the image was deployed by image that has these partitions?03:44
fungiclarkb: this may be our first attempt at using a dib worker image for a long-lived server, so experience could involve new and exciting discoveries03:45
fungithingee: instance booted with rootfs, nova attaches configdrive as a second block device, calls to cinderclient (via openstackclient i guess?) create third block device which are then attached via novaclient (openstackclient?) calls03:46
fungithat last step, nova is attaching and attempting to use the same device name it already used when adding the configdrive03:47
clarkbfungi: I am using the canonical published image actually since that is apparently what we did in bluebox and infracloud03:47
*** dimtruck is now known as zz_dimtruck03:47
fungiclarkb: oh, so canonical's ubuntu trusty cloud image, not our dib-built "ubuntu-trusty" image03:47
clarkbthingee: right the image doesn't have a vdb or vdc, what fungi describes is what happens03:47
clarkbfungi: ya03:47
clarkbif my 10mbps upload ever finishes (if I was smart I wouldnt do this from home)03:48
openstackgerritIan Wienand proposed openstack-infra/shade: Also reset swift server client  https://review.openstack.org/28696403:48
*** annegentl_ has joined #openstack-infra03:48
clarkbcomcast's 300gb per month quota is  just a suggestion right?03:48
mnaserclarkb thingee - https://github.com/qemu/qemu/blob/master/hw/block/virtio-blk.c#L561-L569 and https://github.com/qemu/qemu/blob/master/include/standard-headers/linux/virtio_blk.h#L5203:48
ianwclarkb: ^ so this is my best guess here.  essentially a SwiftService object has a client object inside it, so we need to refresh it in the same way03:48
mnaserso qemu truncates serials to first 20 characters by default03:49
*** camunoz has joined #openstack-infra03:49
*** sflanigan has joined #openstack-infra03:49
*** sflanigan has joined #openstack-infra03:49
*** aeng has joined #openstack-infra03:49
mnaserclarkb: just an fyi, the images we use are actually published by ubuntu (we're a certified ubuntu cloud) .. they just have some extra stuff that i know will come in the way for your use case instead of being useful03:49
clarkbmnaser: gotcha03:50
openstackgerritIan Wienand proposed openstack-infra/shade: Also reset swift service object at upload time  https://review.openstack.org/28696403:50
mnasernova has this poopy limitation where it caches images locally based on uuid03:50
mnaserif it didnt do that, we'd have this thing where we have a "master" uuid that always deploys latest ubuntu release from http url as location in glance03:51
openstackgerritIan Wienand proposed openstack-infra/shade: Also reset swift service object at upload time  https://review.openstack.org/28696403:51
mnaserbut that wouldn't work because nova assumes that if uuid doesnt change, the image doesnt change.03:51
*** gokrokve has joined #openstack-infra03:51
clarkbmnaser: apparently pvcreate is no longer included (or maybe was never included) on ubuntu's base images03:52
mnaseri think its not anymore probably because they don't use lvm in the images?03:52
clarkbwith that I think I really need to stop working for that day, I have a viable workaround for the device listing thing03:52
*** sdake has joined #openstack-infra03:53
clarkbI just need to figure out how I want to get lvm tooling on the images03:53
*** esikachev has joined #openstack-infra03:53
clarkbmnaser: that would probably explain it03:53
fungiclarkb: do we try to add the volume before puppeting?03:53
*** jpr has joined #openstack-infra03:53
fungiclarkb: otherwise, the lvm2 package seems like something we should be declaring in puppet anyway03:53
clarkbfungi: yes, because puppet slaps stuff down in the volumes03:54
fungiaha03:54
fungicatch-2203:54
fungigood call, sleep on it03:54
clarkbthis is actually an excellent case for buiding our own images specifically for this03:54
clarkbalso as an alternative I could just use the bigger flavor which will have bigger local disk and use that instead03:55
* fungi is pretty sure our dib-built ubuntu-trusty images have lvm2 installed03:55
clarkbrather than try to rely on cinder and images and all that working out03:55
*** gokrokve has quit IRC03:55
fungibut also glean, so new unknowns03:55
openstackgerritMerged openstack/requirements: Bump to final Mitaka python-saharaclient  https://review.openstack.org/28650303:57
*** esikachev has quit IRC03:57
thingeeif I had to guess why nova and cinder have /dev/sdb for the attached volume is because of the device being reserved in the second attach not completing when the third volume/create//attach was happening.03:58
*** jpr has quit IRC03:58
*** sflanigan has quit IRC03:58
fungithat would be odd since the configdrive is potentially needed during boot03:58
fungiis our launch-node script attaching the new volume after the instance boots, or adding it in the boot call?03:59
clarkbfungi: it is a separate attach after boot04:00
thingeeah k04:00
*** chenli has quit IRC04:01
mnaserif you created the volume and booted it attached, i suspect you won run into this issue04:01
*** zz_dimtruck is now known as dimtruck04:01
*** kzaitsev_mb has joined #openstack-infra04:02
mnasernova boot ... --block-device source=volume,id=<..>,dest=volume,shutdown=preserve ...04:03
mnaserthat might prevent the issue from happening04:03
*** aeng has quit IRC04:04
thingeemnaser: isn't that what's happening now since fungi said the config drive is needed for boot? or do you mean attach the third one at boot?04:05
*** sam_wan has quit IRC04:05
mnaserthingee: right now, i believe that a nova boot then nova volume-attach is happening (2 steps) -- what im suggesting is to boot directly with the volume attached04:05
mnaseri suspect this may just boot it with the right order as it will be aware that it has to add the volume04:06
clarkbyes it is two steps right now04:06
thingeemnaser: again clarifying, attach which? config? the third volume?04:06
*** yuanying has joined #openstack-infra04:07
thingeeas I understood there are two attaches happening04:07
mnaserconfigdrive isn't attached, it's always setup as the last device on the instance04:07
*** kzaitsev_mb has quit IRC04:07
thingeeI thought fungi said it's needed for boot though04:07
mnaserso if you boot a machine, local storage gets /dev/vda, configdrive is automatically setup (no manual attach) as /dev/vdb .. once volume-attach is called, it is "appended" to the config so the volume appears at /dev/vdc04:07
mnaserwhen rebooting, it creates the libvirt config with /dev/vda for local, /dev/vdb for attached storage, and /dev/vdc (last one) for configdrive04:08
thingeehuh weird04:08
fungior at least that last step is what we want, however for some reason even without reboot it's attaching the new cinder volume at /dev/vdb even though the configdrive is already using /dev/vdb04:08
*** thorst_ has joined #openstack-infra04:08
*** sripriya_ has joined #openstack-infra04:09
fungilast step of the first description i mean04:09
fungiafaik this is all happening without a reboot, correct clarkb?04:09
mnaserwhen using it in nova boot, i suspect this will work around it because when nova will deploy the new instance, it'll attach the volume at /dev/vdb and toss configdrive at the last position /dev/vdc04:10
*** sripriya has quit IRC04:10
fungiso nova is indicating it attached the cinder volume at /dev/vdb even though it was already using /dev/vdb for the configdrive04:10
calebbfungi: clarkb says "Yes no reboot, I can't computer, too much hamburger"04:10
fungimmmhamburger04:10
fungialso hey calebb!04:10
calebbhi04:10
mnaserfungi: yes, it's a bit complicated because what happens is that when the disk gets attached, openstack tells libvirt to attach a drive, the newly attached drive will appear as /dev/vdc in the OS (because /dev/vdb already is used by configdrive)04:11
mnaserwe've ran into this a few times with customers, its quite an annoyance04:11
*** annegentl_ has quit IRC04:11
*** sflanigan has joined #openstack-infra04:11
mnaserespecially when they update their fstab files, reboot and are greeted with a machine that refuses to continue to boot because /dev/vdc is no longer the device they thought it once was04:11
fungiyeah, essentially nova is confused about the device name the volume actually ended up at in the guest04:12
mnaserif you ask me, nova should never be involved in device names04:13
mnaserbecause so many difference operating systems can deal about this in many different ways04:13
fungiit does seem overreaching to me too, yes04:13
mnaser /dev/vdb makes no sense for a windows server04:13
fungiif anything, it should keep to biosdevnames04:14
fungisince it really has no visibility above the hypervisor level04:14
*** gokrokve has joined #openstack-infra04:15
mnaseryep.  hell libvirt doesnt even know its at /dev/vdb04:15
mnaserlooking at the libvirt config for the config drive.. i notice this04:15
mnaser      <target dev='vdz' bus='virtio'/>04:15
mnaseryet in the os... it's located on /dev/vdc not /dev/vdz04:15
*** thorst_ has quit IRC04:16
*** annegentl_ has joined #openstack-infra04:17
*** aeng has joined #openstack-infra04:17
clarkbok I couldn't not try this real quick and added a step to script to install lvm204:18
clarkbif this works I can sleep a happy person04:18
clarkbif it breaks then tomorrow morning I have my mission04:18
mnasercool clarkb let me know04:19
mnaserhttps://bugs.launchpad.net/ubuntu/+source/libvirt/+bug/103536404:19
openstackLaunchpad bug 1035364 in libvirt (Ubuntu) "guest display wrong disk name than expected" [High,Invalid]04:19
mnaserthis kinda sums it up04:19
clarkbwoot! puppet is running and doing stuff04:25
*** ianw has quit IRC04:25
clarkbthe reboot test on device order will be important04:25
mnaserthe order should be ok post-reboot by my experience but we shall see!04:26
openstackgerritClark Boylan proposed openstack-infra/shade: Invalidate volume cache when waiting for attach  https://review.openstack.org/28697704:27
clarkbmordred: ^ thas one of the patches04:27
*** ianw has joined #openstack-infra04:28
*** Jeffrey4l has joined #openstack-infra04:28
ianwtesting04:30
clarkbianw: testing fix for swift uploads thing?04:30
ianwclarkb: sorry, no, testing that my linode irc bouncer is back04:30
clarkbah04:30
clarkbtest is successful then04:31
ianwclarkb: but i did test with that change, and it worked04:31
ianwbut i haven't figured out how to really expire a token04:31
ianw(yet)04:31
*** chenli has joined #openstack-infra04:32
clarkbhttp://162.253.54.85/ /me does a happy dance04:33
*** cody-somerville has quit IRC04:33
clarkbadding to dns now, this means tomorrow we can nodepool this cloud \o/04:33
mnaserwoo04:34
*** annegentl_ has quit IRC04:34
clarkbhttp://mirror.ca-ymq-1.vexxhost.openstack.org will be a thing as soon as dns update propogates04:34
*** cody-somerville has joined #openstack-infra04:35
mnaserclarkb: cool, we also have a local ubuntu mirror -- http://ubuntu.mirror.vexxhost.com/ -- if you want to use that instead of maintaining one04:36
*** annegentl_ has joined #openstack-infra04:36
clarkbmnaser: we already maintain one that we expose out via afs everywhere04:36
mnaseroh i see04:36
clarkbmnaser: we need to do it for various other cloud regions so its a thing04:36
mnaserhttp://mirror.ca-ymq-1.vexxhost.openstack.org/ is loading for me so that's sweet04:36
clarkbmnaser: the big thing in that mirror is the whele cache though04:36
fungiat this point the ubuntu mirror is baked into the same afs volumes we use for pypi mirroring, pre-built wheel mirroring, soon rh/epel and npm04:37
fungieasier if we just drop in one that serves it all04:37
mnaseri see, anyways, on that note, i have to head off to bed, long day tomorrow deploying a bunch of clouds04:37
fungithanks again and have a great night!04:38
mnasernight folks, i should be around on irc tomorrow if anything04:38
mnaserno problem04:38
*** oomichi_ has joined #openstack-infra04:40
*** yamahata has joined #openstack-infra04:41
*** gokrokve_ has joined #openstack-infra04:44
*** gokrokve has quit IRC04:47
*** tiswanso has quit IRC04:49
openstackgerritClark Boylan proposed openstack-infra/system-config: Add support to shade-launch-node for cinder attach  https://review.openstack.org/28547704:50
clarkband thats the rest of my patches to make this work04:50
*** annegentl_ has quit IRC04:50
*** links has joined #openstack-infra04:51
openstackgerritIan Wienand proposed openstack-infra/shade: Also reset swift service object at upload time  https://review.openstack.org/28696404:51
clarkband now for the add to nodepool patch04:51
*** camunoz has quit IRC04:52
*** rossella_s has quit IRC04:53
openstackgerritArmando Migliaccio proposed openstack-infra/project-config: Skip long-running jobs on neutronclient doc changes  https://review.openstack.org/28698204:54
*** rossella_s has joined #openstack-infra04:54
*** Jeffrey4l has quit IRC04:55
openstackgerritIan Wienand proposed openstack-infra/shade: Also reset swift service object at upload time  https://review.openstack.org/28696404:56
openstackgerritClark Boylan proposed openstack-infra/project-config: Add vexxhost to nodepool with max servers 1  https://review.openstack.org/28698304:56
*** dchen has quit IRC04:56
clarkbif we get ^ in tonight it will be the fastest ne wcloud ever04:56
clarkbbut I cna't pay attention to it so probably not a great idea04:56
*** Sukhdev has joined #openstack-infra04:57
*** fawadkhaliq has quit IRC04:58
*** pahuang has quit IRC04:58
fungii'm around and will keep an eye on it04:59
fungiso approved04:59
fungimordred: ^ you may also be awake04:59
fungiand i assume jhesketh is awake since this is, like, lunchtime for him or some such05:01
*** ashleighfarnham has quit IRC05:01
*** dimtruck is now known as zz_dimtruck05:02
clarkbfirst thing it will do is upload images05:03
*** kzaitsev_mb has joined #openstack-infra05:03
clarkbthen it should try booting things which should just work because vexxhost uses 'public' which is the magic string nodepool unerstands for IPs05:04
fungior shade understands05:04
fungii never can keep straight where that logic extraction line was drawn05:04
*** kzaitsev_mb has quit IRC05:08
*** sdake has quit IRC05:09
jheskethfungi: yep, I'm around, happy to keep an eye on it05:09
jheskethclarkb: ^05:09
clarkbjhesketh: cool, I expect worst case that it just doesn't work for some reason which will prevent jobs on it and should be fine05:09
clarkbit should fail safe05:09
*** pahuang has joined #openstack-infra05:10
*** dchen has joined #openstack-infra05:10
jheskethclarkb: yeah, or nodepool gets stuck on a bad cloud like over the weekend05:12
jheskethbut that's easy to rollback05:12
*** thorst_ has joined #openstack-infra05:14
*** dchen has quit IRC05:15
*** amotoki has joined #openstack-infra05:16
openstackgerritMerged openstack-infra/project-config: Add vexxhost to nodepool with max servers 1  https://review.openstack.org/28698305:17
*** salv-orl_ has joined #openstack-infra05:19
clarkbwell thats exciting, looks like openstack ansible inventory may be broken?05:19
clarkband I broke it \o/05:19
clarkbInventory script (/etc/ansible/hosts/openstack) had an execution error: Error fetching server list on openstackci-osic:RegionOne:05:20
clarkbhttps://review.openstack.org/#/c/286881/1 is the fix I think05:21
*** thorst_ has quit IRC05:21
clarkbrechecked it as the one job there timed out05:21
fungiosic keystone endpoint craziness?05:21
clarkbfungi: I got project_name and project_id mixed up :/05:22
*** salv-orlando has quit IRC05:22
fungioh05:22
clarkbhttps://review.openstack.org/#/c/286881/1 sorts it out05:22
fungithat's easy to do, yeah05:22
clarkblooking at the timing though I don't understand why it started when it did05:25
fungicache refresh?05:25
clarkboh maybe ya05:26
*** piet has joined #openstack-infra05:26
clarkboh and that won't apply without manual intervention05:28
clarkbfun05:28
clarkbI will edit ansible-clouds.yaml by hand now05:28
fungiand possibly also delete the cache (or i guess there is no cache update happening yet)05:29
*** harlowja_at_home has joined #openstack-infra05:29
clarkbI see why it broke when it did. I updated hiera to point at new key but puppet sitll uses old key which results in empty string05:29
clarkbjhesketh: if you can review https://review.openstack.org/#/c/286881/1 that would be plenty helpful in correcting the puppetmaster weirdness05:31
jheskethclarkb: looking05:33
jheskethclarkb: +w05:34
clarkbjhesketh: thank you05:34
*** sridhar_ram1 has joined #openstack-infra05:34
jheskethclarkb: you might need to edit it by hand to get it to apply, but you seem to have done that already05:34
clarkbyup05:35
clarkbprepared to do it again too05:35
*** achanda has quit IRC05:35
clarkbI did confirm that my manual edit got puppet running on the last cron fire05:35
clarkbbut I think it will rebreak things because that patch hasnt merged yet05:35
*** pcaruana has joined #openstack-infra05:36
jheskethclarkb: the inventory is cached for somethign like 24 hours, so you probably won't need to do it again unless you need to pick up more changes05:36
jheskethie the clouds files will update, but the ansible inventory won't look or care about it05:36
clarkbah right05:36
jheskethwe should possible get puppet to clear that cache on changes to the cloud files05:37
*** ayoung has quit IRC05:38
openstackgerritgreghaynes proposed openstack/diskimage-builder: Fix spurios = in dib-python readme  https://review.openstack.org/28699105:38
*** abregman has joined #openstack-infra05:38
*** achanda has joined #openstack-infra05:39
*** achanda has quit IRC05:39
openstackgerritMerged openstack-infra/system-config: Use project_name not _id with OSIC  https://review.openstack.org/28688105:41
*** abregman has quit IRC05:42
*** piet has quit IRC05:42
clarkboh right the nodepool buidler does not reload its config05:43
* clarkb will restart it05:43
*** abregman has joined #openstack-infra05:44
clarkband with that uplaods are uploading05:44
openstackgerritgreghaynes proposed openstack/diskimage-builder: Reorder developer quickstart docs  https://review.openstack.org/28699705:45
clarkbI should work on fixing that05:45
greghaynesclarkb: oops05:45
clarkbgreghaynes: basically have to restart service whenever we change cloud prvider details05:45
greghaynesclarkb: it should be easy enough if we can set a timeout on waiting for a gear job05:45
clarkbya05:46
*** jcooley_ has joined #openstack-infra05:47
*** jcooley_ has left #openstack-infra05:47
*** jcooley_ has joined #openstack-infra05:47
*** vikram_ has joined #openstack-infra05:47
vikram_I need some real openstack deployment examples.. Anyone can help?05:48
clarkbvikram_: for deploying clouds? devstsack, the openstack puppet modules, openstack ansible modules, chef modules ,etc should all provide examples05:48
*** pcaruana has quit IRC05:49
vikram_clarkb: We need to deploy for one of our customers05:49
vikram_clarkb: Need to know what's industry best practice05:49
openstackgerritgreghaynes proposed openstack/diskimage-builder: Fix cloud-init-disable-resizefs README title  https://review.openstack.org/28699905:49
*** gokrokve has joined #openstack-infra05:50
clarkbvikram_: I think that largely depends on your specific needs. There are definitely some things that peopel seem to chose consistently though. KVM, rabbitmq, mysql for example05:50
vikram_clarkb: Precisely, what & where services need to run to have better perf and scalability05:50
vikram_clarkb: Any live deployment scenario I can find somewhere for reference05:51
*** rcernin has joined #openstack-infra05:51
clarkbvikram_: http://docs.openstack.org/arch-design/ may help05:52
openstackgerritMerged openstack/requirements: Bump up glance_store to 0.12.0  https://review.openstack.org/28627205:52
*** gokrokve_ has quit IRC05:53
vikram_clarkb: Ok.. Will have a look .. Thanks!05:53
*** gokrokve_ has joined #openstack-infra05:53
openstackgerritMerged openstack/requirements: bump aodhclient upper-constraints  https://review.openstack.org/28626305:53
*** zz_dimtruck is now known as dimtruck05:53
*** esikachev has joined #openstack-infra05:54
*** sarob has joined #openstack-infra05:54
clarkbOpenStackCloudException: Image creation failed: Error finding address for https://image-ca-ymq-1.vexxhost.net/v2/images/1b1e8768-df21-4252-87c2-89ee76d2a7a8/file: Unable to establish connection to https://image-ca-ymq-1.vexxhost.net/v2/images/1b1e8768-df21-4252-87c2-89ee76d2a7a8/file05:55
*** gokrokve has quit IRC05:55
clarkbnot sure if that is consistent yet05:55
clarkbI confirmed that locally I can establish a tcp connection to that host on port 44305:55
*** gokrokve has joined #openstack-infra05:56
openstackgerritgreghaynes proposed openstack/diskimage-builder: Add ability to check for mount leaks  https://review.openstack.org/28001805:56
clarkblooks like it just happened again so we may want to consider setting max-servers to -1 while we figure that out05:57
*** gokrokve_ has quit IRC05:57
*** Sukhdev has quit IRC05:57
*** Sukhdev has joined #openstack-infra05:58
*** esikachev has quit IRC05:58
clarkbbut I really need to not computer anymore. If we don't set max-servers to -1 we will juist attempt to upload images in a tight loop over and over assuming the fail is consistent05:59
*** sarob has quit IRC05:59
clarkbmax-servers: -1 will stop uploads05:59
*** achanda has joined #openstack-infra06:00
*** camunoz has joined #openstack-infra06:02
openstackgerritClark Boylan proposed openstack-infra/project-config: Disable image uplaods to vexxhost  https://review.openstack.org/28700306:03
clarkbjhesketh: ^ if you wish to disable it06:03
clarkbI did manage to uplaod successfully using openstackclient too so dunno what is going on yet06:04
jheskethclarkb: lets disable it for now06:04
*** kzaitsev_mb has joined #openstack-infra06:04
jheskethclarkb: will it still try and build images even when it's disabled though?06:04
clarkbjhesketh: dib will build images because the other cloud sneed them, nodepool just won't upload the results to vexxhost with max-servers set to -106:05
clarkbif we set max-servers to 0 it will upload images but not boot any test slaves06:05
*** jaosorior has joined #openstack-infra06:06
jheskethah cool, so we don't need to remove it from the image providers06:06
jhesketh(or the images from the provider)06:06
clarkbcorrect06:06
clarkbI wonder if this is some form of rate limiting as upload for the first image in the list seems to still be going strong06:06
*** kzaitsev_mb has quit IRC06:09
clarkbjhesketh: I actually have a hunch it is a rate limiting strategy since the first upload seems to be going strong and only the others fail06:10
jheskethrate limit to 1 though?06:10
clarkbjhesketh: we can likely confirm that with mnaser tomorrow (I would still go ahead and merge the max-servers: -1 change)06:10
clarkbjhesketh: ya one upload at a time seems like06:11
jheskethclarkb: already +w'd it06:11
jheskeththat'd be a surprising value, but maybe06:11
*** binbincong has quit IRC06:13
*** dimtruck is now known as zz_dimtruck06:17
*** thorst_ has joined #openstack-infra06:19
*** dingyichen has quit IRC06:22
*** gnuoy_ has joined #openstack-infra06:23
*** gongysh has quit IRC06:24
*** binbincong has joined #openstack-infra06:24
*** exploreshaifali has joined #openstack-infra06:25
*** gongysh has joined #openstack-infra06:26
*** thorst_ has quit IRC06:26
*** kushal has joined #openstack-infra06:28
*** gnuoy_ has quit IRC06:30
*** HeOS has joined #openstack-infra06:32
*** amotoki has quit IRC06:33
*** amotoki has joined #openstack-infra06:34
*** armax has quit IRC06:35
*** amotoki has quit IRC06:36
*** krtaylor has quit IRC06:37
*** jamielennox is now known as jamielennox|away06:38
openstackgerritOpenStack Proposal Bot proposed openstack-infra/project-config: Normalize projects.yaml  https://review.openstack.org/28701106:41
*** gnuoy_ has joined #openstack-infra06:43
*** ybathia has joined #openstack-infra06:44
*** gnuoy_ has quit IRC06:44
*** HeOS has quit IRC06:45
openstackgerritMerged openstack-infra/project-config: Disable image uplaods to vexxhost  https://review.openstack.org/28700306:47
*** ybathia has quit IRC06:47
*** zeih has joined #openstack-infra06:47
*** jamielennox|away is now known as jamielennox06:49
*** krtaylor has joined #openstack-infra06:49
*** hichihara has joined #openstack-infra06:49
openstackgerritOpenStack Proposal Bot proposed openstack/requirements: Updated from generate-constraints  https://review.openstack.org/28590106:53
*** sridhar_ram1 is now known as sridhar_ram06:54
*** gokrokve has quit IRC06:56
*** shardy has joined #openstack-infra07:00
*** chenli has quit IRC07:01
*** kzaitsev_mb has joined #openstack-infra07:05
jamespageyolanda, when your around - could you take a peek at https://review.openstack.org/#/c/28685307:05
jamespagebeisner's CI for charms needs to be able to provide verifcation feedback...07:06
jamespagethanks :-)07:06
*** scheuran has joined #openstack-infra07:06
*** e0ne has joined #openstack-infra07:06
*** jamielennox is now known as jamielennox|away07:08
*** kzaitsev_mb has quit IRC07:10
*** gildub has quit IRC07:10
*** nmagnezi has joined #openstack-infra07:12
*** amotoki has joined #openstack-infra07:14
*** oomichi_ has quit IRC07:16
yolandahi jamespage, going to take a look07:18
yolandaah, AJaeger was faster07:19
*** jtomasek has joined #openstack-infra07:19
jamespagelol - thanks anyway yolanda!07:22
*** thorst_ has joined #openstack-infra07:23
*** maishsk has joined #openstack-infra07:24
yolandanext time :)07:26
*** thorst_ has quit IRC07:31
*** pahuang has quit IRC07:33
*** amotoki has quit IRC07:35
*** HeOS has joined #openstack-infra07:36
*** [1]Thelo has joined #openstack-infra07:37
openstackgerritMerged openstack-infra/project-config: add release notes jobs for solum  https://review.openstack.org/24330107:38
*** Thelo has quit IRC07:40
*** [1]Thelo is now known as Thelo07:40
openstackgerritMerged openstack-infra/project-config: Enable verified label for charms  https://review.openstack.org/28685307:42
*** dingyichen has joined #openstack-infra07:42
*** amotoki has joined #openstack-infra07:43
AJaegerSorry, yolanda ;)07:45
AJaegerlifeless: yes, it'S basically just change tox, see glance, cinder or nova on how it was done there.07:46
*** mikelk has joined #openstack-infra07:47
*** amotoki has quit IRC07:48
*** esikachev has joined #openstack-infra07:50
*** harlowja_at_home has quit IRC07:55
openstackgerritMerged openstack-infra/project-config: Normalize projects.yaml  https://review.openstack.org/28701107:56
*** kingia_ has quit IRC07:56
*** kingia has joined #openstack-infra07:56
*** markvoelker has quit IRC07:58
*** jcoufal has joined #openstack-infra07:58
*** e0ne has quit IRC07:59
*** pcaruana has joined #openstack-infra08:02
*** achanda has quit IRC08:03
*** hichihara has quit IRC08:04
lifelessAJaeger: so I'm not sure what you're waiting on me to email ?08:05
*** kzaitsev_mb has joined #openstack-infra08:06
*** exploreshaifali has quit IRC08:06
*** mrmartin has joined #openstack-infra08:07
AJaegerlifeless: I think we should write an email to tell projects how to move forward with constraints - with just updating tox.ini.08:07
openstackgerritGiulio Fidente proposed openstack-infra/tripleo-ci: Use netiso in the ha job  https://review.openstack.org/27342408:07
AJaegerThat's what we discussed with sdague last week or so - and concluded with postponing it to first figure out whether everything works08:07
*** achanda has joined #openstack-infra08:07
AJaegerOr what do you see as best next step forward?08:08
*** Sukhdev has quit IRC08:09
*** vgridnev has joined #openstack-infra08:09
*** sridhar_ram has quit IRC08:10
*** Sukhdev has joined #openstack-infra08:10
*** kzaitsev_mb has quit IRC08:11
*** zz_dimtruck is now known as dimtruck08:11
*** maishsk_ has joined #openstack-infra08:12
*** Sukhdev has quit IRC08:13
*** maishsk has quit IRC08:13
*** maishsk_ is now known as maishsk08:13
*** matrohon has joined #openstack-infra08:14
jamespagebeisner, the swift-proxy reverse-mirror to bzr should be fixed now08:15
jamespageits a workaround..08:15
*** [HeOS] has joined #openstack-infra08:15
lifelessAJaeger: I was encouraging Nakato to write such a mail, but then the style changed with the discussion w/nova about target namings08:15
*** dingyichen has quit IRC08:15
openstackgerritThomas Herve proposed openstack-infra/devstack-gate: Remove double timestamp from console logs  https://review.openstack.org/28613608:16
AJaegerlifeless: I see. Will she follow up now that this is settled?08:17
lifelessAJaeger: lets ask :)08:18
lifelessNakato: ^08:18
AJaegerlifeless, Nakato: both of you did a great job getting us to this point, thanks!08:18
*** HeOS has quit IRC08:18
*** achanda has quit IRC08:18
*** dingyichen has joined #openstack-infra08:19
*** korzen has joined #openstack-infra08:19
*** sarob has joined #openstack-infra08:21
*** esp has joined #openstack-infra08:25
*** jamielennox|away is now known as jamielennox08:25
*** vincentll has joined #openstack-infra08:25
*** dingyichen has quit IRC08:25
*** tcammann_ has joined #openstack-infra08:25
*** jlanoux has joined #openstack-infra08:26
AJaegerlifeless: can you discuss with here later, please? Getting the three of us together at one time seems to be hard with time zones ;/08:26
*** sarob has quit IRC08:27
*** NobodyCa1 has joined #openstack-infra08:28
*** thorst_ has joined #openstack-infra08:29
*** lxsli0 has joined #openstack-infra08:32
*** esp has quit IRC08:32
*** bastelfreak has joined #openstack-infra08:32
*** tcammann has joined #openstack-infra08:32
*** NobodyCa1 has quit IRC08:33
*** NobodyCa1 has joined #openstack-infra08:34
*** arxcruz has joined #openstack-infra08:35
*** thorst_ has quit IRC08:35
*** Daisy has joined #openstack-infra08:36
*** kushal has quit IRC08:36
*** dizquierdo has joined #openstack-infra08:37
*** lxsli0 has quit IRC08:38
*** tcammann_ has quit IRC08:38
*** esikachev has quit IRC08:38
openstackgerritStefan Andres proposed openstack-infra/shade: Fix lookup for network parameter in neutron_available_floating_ips  https://review.openstack.org/28017808:38
*** tcammann has quit IRC08:39
*** chenli has joined #openstack-infra08:39
*** NobodyCa1 has quit IRC08:39
*** ifarkas has joined #openstack-infra08:40
*** flepied1 has quit IRC08:48
*** peterlisak has quit IRC08:48
*** onovy has quit IRC08:48
*** exploreshaifali has joined #openstack-infra08:48
*** fhubik has joined #openstack-infra08:49
*** esikachev has joined #openstack-infra08:50
*** hashar has joined #openstack-infra08:50
*** ihrachys has joined #openstack-infra08:51
*** rossella_s has quit IRC08:54
*** dimtruck is now known as zz_dimtruck08:54
*** yaume has joined #openstack-infra08:54
*** rossella_s has joined #openstack-infra08:54
*** markvoelker has joined #openstack-infra08:58
openstackgerritJF Taltavull proposed openstack-infra/project-config: Add irc channel for the i18n french team  https://review.openstack.org/28704808:59
openstackgerritBogdan Dobrelya proposed openstack-infra/project-config: Adjust acls for fuel-noop-fixtures  https://review.openstack.org/28610909:00
*** zeih has quit IRC09:04
openstackgerritMerged openstack-infra/project-config: Skip long-running jobs on neutronclient doc changes  https://review.openstack.org/28698209:04
*** gokrokve has joined #openstack-infra09:05
*** asettle has joined #openstack-infra09:05
openstackgerritMerged openstack-infra/project-config: Skip dsvm jobs on release note only tempest changes  https://review.openstack.org/28683109:05
openstackgerritMerged openstack-infra/project-config: update django_openstack_auth to use publish-to-pypi jobs  https://review.openstack.org/28674709:05
*** lucas-dinner is now known as lucasagomes09:06
*** kzaitsev_mb has joined #openstack-infra09:07
*** daemontool_ has joined #openstack-infra09:08
*** daemontool_ is now known as daemontool09:08
*** gokrokve has quit IRC09:09
*** derekh has joined #openstack-infra09:10
*** Daisy has quit IRC09:11
*** daemontool__ has quit IRC09:11
*** Daisy has joined #openstack-infra09:11
*** esker has quit IRC09:11
*** esker has joined #openstack-infra09:12
*** kzaitsev_mb has quit IRC09:12
*** arxcruz has quit IRC09:13
*** chenli has quit IRC09:14
*** chenli has joined #openstack-infra09:15
*** Daisy has quit IRC09:16
openstackgerritAndreas Jaeger proposed openstack-infra/project-config: Sort projects in big skip list  https://review.openstack.org/28652709:18
*** kzaitsev_mb has joined #openstack-infra09:18
*** zhurong has quit IRC09:19
AJaegeryolanda: could you review the change above (286527) and also 285356 as time permits, please?09:19
*** jistr has joined #openstack-infra09:19
yolandasure09:19
*** zhurong has joined #openstack-infra09:20
*** vgridnev has quit IRC09:20
AJaegerthanks09:21
*** _degorenko|afk is now known as degorenko09:21
*** gongysh has quit IRC09:21
*** BobBall has joined #openstack-infra09:22
*** flepied has joined #openstack-infra09:22
*** maishsk has quit IRC09:26
*** arxcruz has joined #openstack-infra09:27
*** dizquierdo has quit IRC09:28
*** e0ne has joined #openstack-infra09:28
*** chenli has quit IRC09:29
openstackgerritC.J. Collier proposed openstack-infra/gerritbot: Improve gerritbot error handling  https://review.openstack.org/28705609:29
tonybPolicyish question ... I need to test a devstack plugin on the experimental queue.  does that plugin need to be on git.o.o? or can it be on github for now?09:30
*** maishsk has joined #openstack-infra09:30
*** vgridnev has joined #openstack-infra09:30
tonybclearly if it's on github that adds an additional failure path but is that a problem?09:31
*** Daisy has joined #openstack-infra09:31
*** [HeOS] is now known as HeOS09:31
*** sorantis has joined #openstack-infra09:32
*** x00350071 is now known as xiangxinyong09:33
*** thorst_ has joined #openstack-infra09:33
*** pblaho has joined #openstack-infra09:33
AJaegertonyb: if you use export DEVSTACK_LOCAL_CONFIG="enable_plugin networking-midonet git://git.openstack.org/openstack/networking-midonet"09:33
AJaegernote the comment before in jenkins/jobs/networking-midonet.yaml:09:33
*** dtardivel has joined #openstack-infra09:33
AJaeger"caches in nodepool, however make it a valid url for documentation purposes."09:34
AJaegerAnd we only cache git.openstack.org repositories...09:34
AJaegerSo, short answer: Needs to be on git.o.o09:34
tonybAJaeger: Okay thanks.09:34
openstackgerritFausto Marzi proposed openstack-infra/project-config: Dedicated specs repo for Freezer  https://review.openstack.org/28030709:35
tonybI knew about the cache in the nodepool image but I didn't thknk they were that tied.09:35
* tonyb goes and does the new project dance.09:35
*** e0ne has quit IRC09:37
*** thorst_ has quit IRC09:40
*** exploreshaifali has quit IRC09:41
*** zhurong has quit IRC09:44
*** kashyap has joined #openstack-infra09:44
*** fabio_ has joined #openstack-infra09:45
*** kzaitsev_mb has quit IRC09:45
*** lane_kong is now known as lynn09:45
kashyapHeya, both the Zuul urls mentioned in the "Before You Start" section are 404 - http://docs.openstack.org/infra/manual/creators.html09:45
*** Qiming has quit IRC09:45
*** kzaitsev_mb has joined #openstack-infra09:45
*** lynn is now known as Guest2997009:46
*** Guest29970 is now known as lynn_kong09:46
AJaegerkashyap: Oops, my fault ;( Do you speak RST and want to send a patch for infra-manual?09:47
kashyapAJaeger: Sure, rST is just all indentation and a tiny bit of syntax :-)09:48
*** vincentll has quit IRC09:48
* kashyap thought I'd begin my day with debugging a Ceph problem...09:48
* AJaeger screwed up that tiny bit ;)09:49
AJaegerkashyap: thanks09:49
*** sheeprine has quit IRC09:49
*** sheeprine has joined #openstack-infra09:50
*** maishsk has quit IRC09:53
*** maishsk has joined #openstack-infra09:53
*** fhubik is now known as fhubik_brb09:55
openstackgerritMerged openstack-infra/irc-meetings: Remove openstack-neutron-release from acceptable channels list  https://review.openstack.org/27818009:57
*** _nadya_ has joined #openstack-infra09:58
*** vgridnev has quit IRC10:00
*** e0ne has joined #openstack-infra10:00
*** vgridnev has joined #openstack-infra10:00
markus_zZara: Are you around?10:01
*** fabio_ has quit IRC10:01
*** electrofelix has joined #openstack-infra10:02
*** exploreshaifali has joined #openstack-infra10:02
*** maishsk has quit IRC10:02
*** maishsk has joined #openstack-infra10:05
*** cznewt has quit IRC10:06
*** kushal has joined #openstack-infra10:07
jamespageyolanda, AJaeger - hey - can i get initial access to the charms-ci group please10:08
yolandajamespage, i'll add you10:09
*** kushal has quit IRC10:09
*** onovy has joined #openstack-infra10:10
yolandadone10:10
*** kushal has joined #openstack-infra10:11
*** lynn_kong is now known as Larrie10:12
*** Larrie is now known as larrie10:12
jamespageyolanda, thankyou10:12
*** peterlisak has joined #openstack-infra10:12
*** Jeffrey4l has joined #openstack-infra10:13
jamespagebeisner, uosci is now in the right group10:13
*** kzaitsev_mb has quit IRC10:14
*** vincentll has joined #openstack-infra10:14
*** sarob has joined #openstack-infra10:16
*** cznewt has joined #openstack-infra10:17
*** amotoki has joined #openstack-infra10:18
*** rguillebert has joined #openstack-infra10:18
*** sarob has quit IRC10:20
*** Daisy has quit IRC10:20
*** fhubik_brb is now known as fhubik10:21
*** Daisy has joined #openstack-infra10:21
*** dtantsur|afk is now known as dtantsur10:25
*** Daisy has quit IRC10:25
openstackgerritKashyap Chamarthy proposed openstack-infra/infra-manual: creators.rst: Fix broken internel hyperlinks  https://review.openstack.org/28708310:25
kashyapAJaeger: ^ Hope I got the syntax correctly this time :-)10:26
*** larrie is now known as larrie_kong10:28
*** vgridnev has quit IRC10:31
*** vgridnev has joined #openstack-infra10:32
*** jordanP has joined #openstack-infra10:32
lucasagomeshi all, there's a devstack-gate which will allow ironic to start testing it's ipmitool drivers in gate (it's only 3LOC) if you have a time please take a look https://review.openstack.org/#/c/284036/ thank you10:34
korzenhi anteaya, can you help merging Neutron Grenade multinode DVR experimental job? https://review.openstack.org/#/c/25021510:36
*** thorst_ has joined #openstack-infra10:38
Zaramarkus_z: I am now!10:42
markus_zZara: I'm going to lunch in a few minutes, are you free in 45 minutes? We could shortly discuss the requirements if you want.10:43
*** esikache1 has joined #openstack-infra10:43
*** esikachev has quit IRC10:43
*** _nadya__ has joined #openstack-infra10:43
*** _nadya_ has quit IRC10:43
dguitarbiteAJaeger: How much more time should it take for this patch to get merged? https://review.openstack.org/#/c/281721/10:44
Zaramarkus_z: ah, cool, works for me!10:45
markus_zZara: great, see you then10:45
*** thorst_ has quit IRC10:45
*** ihrachys has quit IRC10:47
openstackgerritValeriy Ponomaryov proposed openstack-infra/project-config: Move Manila experimental jobs to check pipeline  https://review.openstack.org/28709910:49
*** Qiming has joined #openstack-infra10:49
TheloHI all, in nodepool each time I try to build an image ( via command line ) I get the following logs, and then it stucks ( http://paste.openstack.org/show/488926/ ) does anyone know why?10:49
*** d0ugal has quit IRC10:50
*** d0ugal has joined #openstack-infra10:50
*** d0ugal has quit IRC10:50
*** d0ugal has joined #openstack-infra10:50
*** yamahata has quit IRC10:51
*** dims_ has joined #openstack-infra10:51
*** dizquierdo has joined #openstack-infra10:52
*** hichihara has joined #openstack-infra10:54
openstackgerritThomas Bechtold proposed openstack-infra/project-config: Publish renderspec docs  https://review.openstack.org/28710510:54
*** |-paul-| has joined #openstack-infra10:55
*** fhubik is now known as fhubik_brb10:57
*** binbincong has quit IRC10:57
*** esikache1 has quit IRC11:00
*** sdague has joined #openstack-infra11:02
*** esikache1 has joined #openstack-infra11:04
*** fhubik_brb is now known as fhubik11:05
*** sorantis has quit IRC11:05
*** gildub has joined #openstack-infra11:07
*** Daisy has joined #openstack-infra11:08
openstackgerritIvan Kolodyazhny proposed openstack-infra/project-config: Add gate job for Cinder Functional tests  https://review.openstack.org/28711511:09
*** _amrith_ is now known as amrith11:10
*** _nadya__ has quit IRC11:10
*** binbincong has joined #openstack-infra11:10
*** daemontool_ has joined #openstack-infra11:15
*** sorantis has joined #openstack-infra11:17
*** daemontool has quit IRC11:17
openstackgerritDavid Caro proposed openstack-infra/jenkins-job-builder: Adding the build-discarder property  https://review.openstack.org/27395211:18
AJaegerdguitarbite: I hope I remember discussing later today - feel free to do this yourself as well.11:19
*** hashar has quit IRC11:19
*** salv-orlando has joined #openstack-infra11:19
*** kzaitsev_mb has joined #openstack-infra11:19
*** Daisy has quit IRC11:21
*** Daisy has joined #openstack-infra11:21
*** salv-orl_ has quit IRC11:22
*** Daisy has quit IRC11:26
*** sambetts|afk is now known as sambetts11:27
*** ldnunes has joined #openstack-infra11:32
*** vikram_ has quit IRC11:37
markus_zZara: ready for takeoff?11:38
Zaramarkus_z: sure :)11:38
*** mkj_wk_ is now known as mkj11:38
*** daemontool_ has quit IRC11:39
ZaraI've avoided commenting on the etherpad directly in case that was distracting, but we can do that if useful11:39
markus_zcool cool, so, you wanna ask questions or should I just go through the list and explain it?11:39
*** daemontool_ has joined #openstack-infra11:39
markus_zZara: Let's just add it directly to the etherpad11:39
Zaraokay :)11:39
*** ociuhandu has joined #openstack-infra11:39
*** _nadya_ has joined #openstack-infra11:40
*** fhubik is now known as fhubik_brb11:41
*** yamamoto has quit IRC11:41
*** kzaitsev_mb has quit IRC11:42
Zarahm, I'm not sure whether our syntax hilighting is sufficient. you can put a markdown code block in, but I'm not sure how sophisticated it is11:42
markus_zZara: Do you have a storyboard item which shows all its glory (comments, ...)?11:42
*** cznewt has quit IRC11:43
*** thorst_ has joined #openstack-infra11:43
*** zul has quit IRC11:44
*** resker has joined #openstack-infra11:44
Zaramarkus_z: this one might be useful: https://storyboard.openstack.org/#!/story/200040211:45
*** _nadya_ has quit IRC11:45
*** zul has joined #openstack-infra11:45
markus_zZara: Events and Comments go in the same "queue"?11:46
*** esker has quit IRC11:47
*** gildub has quit IRC11:47
Zaramarkus_z: yeah, I think the idea was to keep all the history together, but it's messy. we definitely want to be able to better filter what the user sees (atm they have to change their global events preferences to filter by just comments, which is annoying)11:49
*** cznewt has joined #openstack-infra11:50
Zaraalso, for action history, you can see the list of timeline events, but we don't have a way to go 'show story x as it was on y date'11:50
*** thorst_ has quit IRC11:51
* SotK does some vague testing and finds that storyboard seems to wrap code blocks to fit on the screen, but there is no hard limit11:51
Zaraso that's why I've gone 'I *think* we have this', because people can mean different things by 'history' :)11:51
SotKI'm happy to make it not break and enforce horizontal scrolling, if that is preferred11:51
*** ihrachys has joined #openstack-infra11:52
markus_zZara: The mix of human comments and emitted events is confusing to me. A discussion in this story is hard this way.11:54
*** _nadya_ has joined #openstack-infra11:55
AJaegersdague, could you review two project-config changes of mine later, please? 286527 and 28535611:56
*** doug-fish has joined #openstack-infra11:56
Zarayeah, I want a user to be able to click 'show only comments' (and maybe default to that, while storing the whole list of events somewhere). I think the idea was to put events in the list so the history was clear  (eg: if the story description updated between comments, people could be referring to different things)11:58
sdagueAJaeger: can you provide full urls, it's faster to open from my irc client?11:59
Zarawe have plans to go back to timeline events and improve them, though right now I don't know exactly when that will be. I suspect early in the Newton cycle, though that might be pessimistic of me.12:00
markus_zZara: I really like the boards and worklists12:00
Zara\o/ SotK^12:00
SotK:D12:01
Zara(I'm also planning to look at the *layout* of timeline events, hopefully next week, but there won't be any substantial changes. may make things easier to follow, though)12:02
*** kzaitsev_mb has joined #openstack-infra12:02
Zarabut I make no promises! :D12:02
Zaraanyway, thank you, that's good to hear :)12:02
markus_zZara: No hurries.12:03
*** sdake has joined #openstack-infra12:03
Zara:) it's useful to see what a big project cares about; we haven't paid much attention to comments (recently, anyway) because we're small, but I can see the need for a project the size of Nova.12:04
markus_zZara: As soon as a discussion (forum-like) is possible within a story, I would like to use sb instead of lp. But that's the view from a developer, I'm not involved in release mgmt. and things.12:04
openstackgerritJames Slagle proposed openstack-infra/tripleo-ci: Use swapfile environment in CI  https://review.openstack.org/28679312:04
*** gokrokve has joined #openstack-infra12:04
markus_zZara: The tasks can be set to multiple different projects, right? We have sometimes bugs which affect multiple projects at once.12:05
*** aysyd has joined #openstack-infra12:05
Zaramarkus_z: understood. :) and yes, tasks can be set to multiple projects.12:06
markus_zI like the response times. very fast12:07
openstackgerritMerged openstack-infra/project-config: Adjust acls for fuel-noop-fixtures  https://review.openstack.org/28610912:08
markus_zZara: I haven't checked the REST API sb provides, so no feedback from me there12:08
*** gokrokve has quit IRC12:09
markus_zZara: well, cool, do you have any open questions for me?12:09
Zaramarkus_z: I think SotK knows more about that than I do, tbh, so he might have something to add on that. (or might not!)12:10
Zaraer, the REST API that is12:10
* SotK goes to write things in the API section of the etherpad12:11
Zara\o/12:11
Zaramarkus_z: I don't think so, but I'm pretty sleepy atm and might think of some later12:11
*** sdake has quit IRC12:13
*** kashyap has left #openstack-infra12:15
*** amrith is now known as _amrith_12:16
markus_zZara: SotK: ok, cool, maybe we can talk at the summit, see you12:17
SotKmarkus_z: thanks for the comments :)12:17
Zaracool, thanks again, those requirements are really helpful :)12:17
AJaegersdague: https://review.openstack.org/286527 and https://review.openstack.org/285356 are full URLs12:21
*** asettle has quit IRC12:24
*** yamamoto has joined #openstack-infra12:24
*** yamamoto_ has joined #openstack-infra12:25
*** erlon has joined #openstack-infra12:25
*** fhubik_brb is now known as fhubik12:26
*** Liuqing has joined #openstack-infra12:27
openstackgerritVincent proposed openstack-infra/project-config: add Synergy project  https://review.openstack.org/28315712:28
*** yamamoto has quit IRC12:29
*** kiran-r has joined #openstack-infra12:29
*** |-paul-| has quit IRC12:36
dguitarbiteAJaeger: Ok, Ill try to bring it up :)12:37
openstackgerritMerged openstack-infra/project-config: Revert "Remove check requirements from Kingbird"  https://review.openstack.org/28535612:37
*** jaosorior has quit IRC12:37
openstackgerritMerged openstack-infra/project-config: Sort projects in big skip list  https://review.openstack.org/28652712:37
*** jaosorior has joined #openstack-infra12:38
*** asettle has joined #openstack-infra12:39
*** hichihara has quit IRC12:39
*** jaosorior has quit IRC12:39
*** jaosorior has joined #openstack-infra12:40
*** claudiub has joined #openstack-infra12:40
*** esker has joined #openstack-infra12:42
openstackgerrityolanda.robla proposed openstack/diskimage-builder: Generate fedora-atomic images using dib  https://review.openstack.org/28716712:42
*** Daisy has joined #openstack-infra12:43
*** yamamoto_ has quit IRC12:43
*** resker has quit IRC12:44
*** rfolco has joined #openstack-infra12:44
esikache1hey, folks! can you merge? https://review.openstack.org/#/c/284693/12:44
*** edmondsw has joined #openstack-infra12:46
openstackgerritMerged openstack-infra/project-config: Move Manila experimental jobs to check pipeline  https://review.openstack.org/28709912:47
*** thorst_ has joined #openstack-infra12:48
*** markvoelker has quit IRC12:48
*** markvoelker has joined #openstack-infra12:49
*** lucasagomes is now known as lucas-hungry12:51
lucas-hungryhi all, there's a devstack-gate which will allow ironic to start testing it's ipmitool drivers in gate (it's only 3LOC) if you have a time please take a look https://review.openstack.org/#/c/284036/ thank you12:52
*** fhubik is now known as fhubik_brb12:52
*** rossella_s has quit IRC12:53
*** rossella_s has joined #openstack-infra12:54
*** fawadkhaliq has joined #openstack-infra12:54
AJaegersdague: could you review lucas-hungry's change, please ? ^12:54
*** yamamoto has joined #openstack-infra12:55
*** baoli has joined #openstack-infra12:56
*** Qiming has quit IRC12:59
*** links has quit IRC12:59
*** dprince has joined #openstack-infra13:00
*** _nadya_ has quit IRC13:01
sdaguelucas-hungry: is there a thing above it that demonstrates it working13:03
*** yaume_ has joined #openstack-infra13:03
*** yaume has quit IRC13:04
openstackgerritFausto Marzi proposed openstack-infra/project-config: Dedicated specs repo for Freezer  https://review.openstack.org/28030713:05
*** rhallisey has joined #openstack-infra13:06
*** hashar has joined #openstack-infra13:07
*** Daisy has quit IRC13:08
*** Daisy has joined #openstack-infra13:08
*** Jeffrey4l has quit IRC13:12
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Add mirror server  https://review.openstack.org/23841413:12
*** baoli has quit IRC13:12
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Used the mirror server for the centos image and git repos  https://review.openstack.org/28525713:12
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Use instack-qcow2 from cache if available and appropriate  https://review.openstack.org/28525913:12
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: Upload the instack qcow image after periodic jobs  https://review.openstack.org/28525813:12
openstackgerritDerek Higgins proposed openstack-infra/tripleo-ci: [NO MERGY] Test a fake periodic job  https://review.openstack.org/22978913:12
*** yaume_ has quit IRC13:12
*** Daisy has quit IRC13:13
*** yaume has joined #openstack-infra13:13
*** Jeffrey4l has joined #openstack-infra13:15
*** dkranz has joined #openstack-infra13:15
*** baoli has joined #openstack-infra13:19
*** fawadkhaliq has quit IRC13:21
*** fhubik_brb is now known as fhubik13:22
*** vgridnev has quit IRC13:23
*** vgridnev has joined #openstack-infra13:23
*** kgiusti has joined #openstack-infra13:24
openstackgerritIhar Hrachyshka proposed openstack-infra/project-config: Make gate-grenade-dsvm-neutron-multinode voting  https://review.openstack.org/28718413:24
*** baoli has quit IRC13:25
*** yaume has quit IRC13:27
*** baoli has joined #openstack-infra13:27
*** yaume has joined #openstack-infra13:27
*** yamamoto has quit IRC13:28
*** baoli has quit IRC13:29
*** zhurong has joined #openstack-infra13:30
*** annegentl_ has joined #openstack-infra13:30
*** pradk has quit IRC13:32
*** pleia2 has quit IRC13:33
*** yaume has quit IRC13:33
*** yaume has joined #openstack-infra13:34
openstackgerritOleg Gelbukh proposed openstack-infra/project-config: Add project 'fuel-tuning-box'  https://review.openstack.org/28613713:35
*** vgridnev has quit IRC13:35
*** baoli has joined #openstack-infra13:37
*** vivekd has joined #openstack-infra13:37
*** piet has joined #openstack-infra13:38
*** yamamoto has joined #openstack-infra13:39
*** yamamoto has quit IRC13:39
*** yaume has quit IRC13:40
*** yaume has joined #openstack-infra13:41
openstackgerritAntoine Musso proposed openstack-infra/jenkins-job-builder: Complete support of Yaml Axis Plugin  https://review.openstack.org/28669013:41
*** esker has quit IRC13:43
*** esker has joined #openstack-infra13:44
openstackgerritMerged openstack-infra/shade: Add ability to pass just filename to create_image  https://review.openstack.org/28290713:44
*** jed56 has joined #openstack-infra13:46
*** vgridnev has joined #openstack-infra13:48
*** yaume has quit IRC13:50
*** yaume has joined #openstack-infra13:51
*** _nadya_ has joined #openstack-infra13:51
mordredShrews: yah. I agree on getting releasing shade13:51
Shrewsmordred: if we're finally doing that, there are some fixes i have up that should go in13:52
Shrewsand some others from infra folks we should review13:53
*** yamamoto has joined #openstack-infra13:54
*** rlandy has joined #openstack-infra13:54
*** doug-fish has quit IRC13:55
*** yaume has quit IRC13:55
*** yaume has joined #openstack-infra13:56
*** amitgandhinz has joined #openstack-infra13:57
* mordred goes lookie13:57
Shrewsi just approved clarkb's cache fix13:57
*** dizquierdo has quit IRC13:58
Shrewswe should check for that cache bug in other areas :(13:58
*** daemontool_ is now known as daemontool13:59
mordredShrews: ++13:59
mordredShrews: also, https://review.openstack.org/#/c/286964 looks like one we should land14:00
*** yaume has quit IRC14:00
mordredShrews: and makes me think we shoudl also pass username/password/etc down to SwiftService like we do for swift connection now14:00
mordredlemme check their source nad see if that works14:00
dmsimardAny chance I could get a final +2+A on https://review.openstack.org/#/c/286579/ ? It's to wrap up the packstack integration jobs. We can make them voting shortly afterwards.14:00
*** yaume has joined #openstack-infra14:01
*** sorantis_ has joined #openstack-infra14:01
*** yamamoto has quit IRC14:02
*** lucas-hungry is now known as lucasagomes14:03
*** sorantis has quit IRC14:04
*** sorantis_ is now known as sorantis14:04
lucasagomessdague, hi there, sorry I was having lunch... Anything that demonstrates ipmitool drivers running with devstack?14:04
*** bapalm has quit IRC14:05
*** yaume has quit IRC14:05
lucasagomeshmm so, the support is there, the problem is that the devstack-gate script is hardcoding the drivers (pxe_ssh or agent_ssh)14:05
sdaguelucasagomes: right, something that depends-on the d-g change to show it all running14:05
*** yaume has joined #openstack-infra14:06
sdaguebecause it's not something in the set of tests14:06
*** jbernard_ has joined #openstack-infra14:06
*** bapalm has joined #openstack-infra14:06
lucasagomessdague, the dependency was a devstack patch ( https://review.openstack.org/#/c/280267/ ) which is already merged14:06
lucasagomessdague, oh gotcha what you mean14:07
*** weshay has joined #openstack-infra14:07
lucasagomeslemme see think how I can do that...14:07
*** yamamoto has joined #openstack-infra14:07
lucasagomessdague, we have the jobs already enabled in ironic as non-voting, if I submit a patch for ironic depending on the d-g changes you think it will get pulled?14:08
* lucasagomes tries14:08
openstackgerritPaul Belanger proposed openstack-infra/project-config: Add nodepool vexxhost dashboard for grafana  https://review.openstack.org/28721114:08
sdaguelucasagomes: yeh, it should14:08
sdaguedevstack-gate changes can be tested that way14:09
pabelangerclarkb: fungi: jeblair: grafana dashboard for vexxhost ^14:09
lucasagomessdague, ok, gimme 1 min I will do it and I ping you back with the results14:09
sdaguelucasagomes: cool14:09
openstackgerritMerged openstack-infra/groups: Add a standalone map page  https://review.openstack.org/28594314:09
yolandahi mordred, can i get your help on merging some patches that already have a +2? they have been there long14:09
yolandai can send you the list if you have some minutes14:09
*** yaume has quit IRC14:12
*** yaume has joined #openstack-infra14:12
*** doug-fish has joined #openstack-infra14:13
Shrewsmordred: i wonder if there is a cleaner way to handle the bad cloud issue in https://review.openstack.org/28588214:13
Shrewsthat solution seems like a fix for only one thing, but maybe we could make it broader across the api somehow14:14
Shrewsno broader solution immediately comes to mind though14:17
*** exploreshaifali has quit IRC14:18
*** oanson has quit IRC14:18
*** tiswanso has joined #openstack-infra14:20
*** jbernard has left #openstack-infra14:20
*** alivigni has joined #openstack-infra14:21
*** esker has quit IRC14:21
*** mrmartin has quit IRC14:21
mordredShrews: yah, I agree on both points14:22
*** placko has joined #openstack-infra14:22
*** ildikov has quit IRC14:23
Shrewsmordred: -1'd it so we can take some time discussing/thinking about it14:23
*** annegentl_ has quit IRC14:23
*** yamamoto has quit IRC14:23
mordredkk14:23
*** annegentl_ has joined #openstack-infra14:23
openstackgerritMonty Taylor proposed openstack-infra/shade: Pass username/password to SwiftService  https://review.openstack.org/28722414:24
mordredShrews: there's a follow up to that patch from ianw14:24
*** baoli has quit IRC14:24
*** yaume has quit IRC14:25
openstackgerritSean McGinnis proposed openstack/requirements: Raise upper-constraint on python-cinderclient to 1.6.0  https://review.openstack.org/28722614:25
*** yaume has joined #openstack-infra14:25
openstackgerritMerged openstack-infra/shade: Invalidate volume cache when waiting for attach  https://review.openstack.org/28697714:26
mordredShrews: nod on the clarkb patch14:27
*** hashar has quit IRC14:27
Shrewsmordred: 'key' is the dict key for password in swift?14:29
Shrewsline 742 in your patch14:29
*** mriedem has joined #openstack-infra14:31
*** tiswanso has quit IRC14:31
*** baoli has joined #openstack-infra14:31
mordredShrews: yah14:32
*** mrmartin has joined #openstack-infra14:32
mordredone sec- lemme doublecheck something14:32
*** tiswanso has joined #openstack-infra14:33
mordredugh. the url one has a different name14:33
mordredoh! they've got a compat layer. one sec,14:33
openstackgerritMonty Taylor proposed openstack-infra/shade: Pass username/password to SwiftService  https://review.openstack.org/28722414:34
mordredShrews: there ya go14:34
mordredShrews: http://git.openstack.org/cgit/openstack/python-swiftclient/tree/swiftclient/service.py#n9514:34
*** baoli has quit IRC14:34
Shrewsah, that's what i was looking for14:35
openstackgerritMonty Taylor proposed openstack-infra/shade: Pass username/password to SwiftService  https://review.openstack.org/28722414:35
mordredShrews: one more update I just saw from reading that14:35
openstackgerritPaul Belanger proposed openstack-infra/puppet-mediawiki: Add SmiteSpam extension  https://review.openstack.org/28723214:35
mordredShrews: it is apparently important to their logic to pass in auth_version14:35
*** jsavak has joined #openstack-infra14:36
Shrewsmordred: k. will approve when jenkins says it's ok14:37
Shrewsand you stop patching  :-P14:37
*** baoli has joined #openstack-infra14:37
* Shrews needs moar coffeeeee14:37
*** Swami has joined #openstack-infra14:38
*** ayoung has joined #openstack-infra14:39
mordredShrews: I'm done patching :)14:40
mordredShrews: I think that's all the current outstanding that need landing other than tristan's heat patches14:40
mattymoCan anyone comment on when Summit talks decisions will be finalized?14:40
mattymoin Austin14:40
Shrewsmordred: agreed14:40
pabelangerSo, wiki.o.o looks good. Day 2, no spamming14:40
openstackgerritMerged openstack-infra/shade: Also reset swift service object at upload time  https://review.openstack.org/28696414:41
pabelangernow we need to decided how to enable account creation again and remove existing spam14:41
pabelanger287232 will help with spam14:41
mordredShrews: and I have a -1/question on the first of those14:41
*** hashar has joined #openstack-infra14:41
openstackgerritMerged openstack-infra/elastic-recheck: Add query for volume-backed live migration abort bug 1524898  https://review.openstack.org/28672514:42
openstackbug 1524898 in OpenStack Compute (nova) "Volume based live migration aborted unexpectedly" [High,Confirmed] https://launchpad.net/bugs/152489814:42
mordredpabelanger: awesome job!14:42
*** annegentl_ has quit IRC14:42
*** annegentl_ has joined #openstack-infra14:43
*** baoli has quit IRC14:44
*** Liuqing has quit IRC14:44
*** yamamoto has joined #openstack-infra14:44
*** flepied has quit IRC14:45
*** yaume_ has joined #openstack-infra14:45
mriedemcodesearch.o.o isn't available?!14:46
*** Swami_ has joined #openstack-infra14:46
openstackgerritMerged openstack-infra/subunit2sql: Use instance variable in ByteStreamToStreamResult init  https://review.openstack.org/28692514:46
mriedemhow will i live?14:46
openstackgerritMerged openstack-infra/subunit2sql: Remove unused flush from read_subunit()  https://review.openstack.org/28692814:47
pabelangermriedem: ya, looks down14:47
*** gokrokve has joined #openstack-infra14:47
* mordred looking14:47
pabelangerchecking puppetdb to see what is going on14:47
*** annegentl_ has quit IRC14:47
pabelangerpuppet seems okay14:47
mordredI have started hound14:48
*** baoli has joined #openstack-infra14:48
*** baoli has quit IRC14:48
*** baoli has joined #openstack-infra14:48
mordredit'll take a few minutes for it to get started up - it does a bunch of git operations14:48
*** jsavak has quit IRC14:48
mordrednm. back up. you should be able to live again mriedem14:48
*** rbrndt_ has joined #openstack-infra14:49
mriedemawesome14:49
*** yaume has quit IRC14:49
mriedemthose were dark times14:49
mriedemdark times indeed14:49
openstackgerritMerged openstack-infra/subunit2sql: Add API methods for getting tests by prefix  https://review.openstack.org/28333414:49
openstackgerritabregman proposed openstack-infra/project-config: Add new expermintal job - centos with neutron  https://review.openstack.org/28723914:50
*** Swami has quit IRC14:50
*** jsavak has joined #openstack-infra14:50
*** baoli has quit IRC14:51
*** sdake has joined #openstack-infra14:51
*** gokrokve has quit IRC14:52
*** baoli has joined #openstack-infra14:52
*** doug-fish has quit IRC14:52
pabelangerfungi: if you have time this morning, do you mind checking out 286785 ? It adds experimental bindep jobs for ansible-jobs, which I think is the late group of testing needed for bindep14:52
*** doug-fish has joined #openstack-infra14:54
*** baoli has quit IRC14:54
*** pradk has joined #openstack-infra14:55
fungipabelanger: why also a -nv suffixed version?14:55
*** sshnaidm_ has joined #openstack-infra14:56
jamespageyolanda, hey - is there any extra setup required todo automatic bug commenting and closure based on gerrit reviews and landing?14:56
*** sshnaidm has quit IRC14:56
*** xyang1 has joined #openstack-infra14:56
*** doug-fish has quit IRC14:56
*** sshnaidm_ has quit IRC14:56
*** ildikov has joined #openstack-infra14:56
pabelangerfungi: mostly forward looking, there are some functional testings that are -nv currently.  I figure once this patch is good, I can simply remove the original jobs without having to modify the new bindep ones14:56
*** doug-fish has joined #openstack-infra14:57
yolandajamespage, you mean, getting -1 from jenkins on tests?14:57
fungipabelanger: aha, got it. so when this goes live, some projects will be using the -nv version14:57
pabelangerfungi: yup14:57
fungisensible. thanks!14:57
*** zhurong has quit IRC14:58
openstackgerritMerged openstack-infra/shade: Use isinstance() for result type checking  https://review.openstack.org/28681114:58
*** dmsimard is now known as dmsimard|afk14:58
pabelangerWe'll let it run experimental for a few runs, then setup a time with odyssey4me to replace existing jobs14:58
*** gongysh has joined #openstack-infra14:58
fungiat this point i'm waiting a couple more days for the feature freeze rush to die off before i un-wip the first patch to cut over the python-jobs group to using ubuntu-trusty14:58
odyssey4mefungi ah, the first switchover to ubuntu-trusty... this is going to be fun14:59
AJaegerfungi, for project-config we removed already dvipng and pandoc since there are no users - what do you think of https://review.openstack.org/#/c/286242/ and https://review.openstack.org/#/c/284371/ ?14:59
fungithat'll be the biggest chunk of the transition, and i expect to find most of our corner cases there in lesser-known projects14:59
AJaegerfungi: we could move project-config or manuals to ubuntu-trusty if you want...14:59
odyssey4mewe ran it for a while, and lately got a lot more failures than successes - eventually we figured that we'd hold back and stop wasting nodepool resources14:59
odyssey4mepabelanger explain the bindep thing to me?15:00
odyssey4meif you have a moment15:00
fungipabelanger: yep, getting those out before a transition makes sense, good research15:01
fungier, AJaeger ^15:01
funginot pabelanger15:01
pabelangerodyssey4me: sure, we are moving to -bindep to replace our bare-precise images, which are pinned only to rax. Once on bindep jobs, we'll be using the ubuntu-trusty DIB, across all clouds15:02
pabelangerodyssey4me: TL;DR bare-precise old and busted, ubuntu-trusty new and hotness15:02
fungiodyssey4me: bindep is a tool and mechanism for declaring system-provided dependencies across multiple platforms15:02
odyssey4meok cool - down with that15:02
sgordonAJaeger, got a second to talk about https://review.openstack.org/#/c/277892/ ?15:02
AJaegerfungi, those were the only obvious extras. I didn't check the rest.15:02
AJaegersgordon: sure15:02
odyssey4mefungi  pabelanger yeah, that's why I was not making the correlation15:02
pabelangerodyssey4me: also allows you to create other-requirements.txt to control which binaries get installed by default15:03
jeblairfungi: it looks like vexxhost is in progress, waiting on image uploads now; is that correct?15:03
sgordonAJaeger, if i understand correctly you are saying some of those skips dont take effect because e.g.   - name: gate-functional-dsvm-magnum-api15:03
sgordon    branch: ^(?!stable/kilo).*$15:03
sgordon is later in the file15:03
sgordon?15:03
AJaegersgordon: correct15:03
sgordonblergh15:03
AJaegeryou might need to reorder these and check the layout output to see whether it does what you want...15:03
sgordonAJaeger, is there anything that checks that isnt the case for others as well?15:03
*** sshnaidm has joined #openstack-infra15:03
AJaegersgordon: not really ;( This is manually checking15:04
sgordonAJaeger, just because i see after a lot of people glanced over it nobody else noticed this...15:04
odyssey4mepabelanger other-requirements.txt is for the bundep stuff I expect, and we could possible use that when deploying a service from source to reduce our need to carry apt package lists15:04
fungiodyssey4me: the end goal is that we use basically blank slate workers to run all jobs, devstack and tox alike, so this is providing us with a solution to install distro package dependencies for jobs during job runtime, allows individual projects to express their own dependency lists instead of using our fallback default, and allows us to figure out what packages to pre-cache on our workers when we15:04
fungibuild images for them15:04
sgordonAJaeger, ok well i will try and fix this one then15:04
clarkbjeblair: basically yes, we ran into some upload fails last night we need to sort out15:04
jeblairclarkb: are we expecting them to fail again?15:04
fungijeblair: right, what clarkb just said15:04
AJaegerfungi: we could move project-config or manuals to ubuntu-trusty if you want. moving project-config allows us to test it first but if it fails, we really have a problem ;)15:04
clarkbjeblair it seemed consistent last night, was going to ask mnaser about it today15:05
mnaserhi o/15:05
jeblairclarkb: vexxhost does not use floating ip, correct?15:05
clarkbjeblair: correct15:05
fungiclarkb: your last theory was that they might be limiting concurrent uploads to glance, right?15:05
jeblairpabelanger: dashboard approved!15:05
clarkbfungi: ya because the first upload seemed fine15:05
AJaegerhttps://review.openstack.org/285722 (project-config) and https://review.openstack.org/283445 (manuals) are the switches to ubuntu-trusty, fungi15:06
pabelangerodyssey4me: right. So one of the things we could do is add a ansible module for bindep. Then leverage other-requirements.txt to handle package dependencies15:06
clarkband earlier in the day I did manual upload with no problems15:06
openstackgerritGiulio Fidente proposed openstack-infra/tripleo-ci: WIP: Enable network isolation in all CI jobs  https://review.openstack.org/27342415:06
pabelangerI plan on creating ansible-role-bindep to play with it15:06
mnaserglance issues at our side?15:06
mnaser(sorry i dont have too much context)15:06
clarkbmnaser: maybe, I can dig out our logs15:06
pabelangerjeblair: \o/15:06
mnasersmall silly question15:06
mnaserhow big is the image being uploaded / how long is it taking15:06
pabelangerjeblair: since you are looking at grafana, do you mind looking at 264994 again?  Adds some templating support into grafana15:07
*** sfinucan has joined #openstack-infra15:07
mnaserwe've seen big images fail to upload because the glance token expires by the time the upload is done so the registry poops out15:07
odyssey4mepabelanger nice idea15:07
openstackgerritMerged openstack-infra/project-config: Create ansible functional using bindep  https://review.openstack.org/28678515:08
*** zz_dimtruck is now known as dimtruck15:08
jeblairpabelanger: the main thing i'm concerned about is whether it's useful to have the different regions on one graph... tell you what... how about we copy that dashdoard to a new 'experimental' one so we can all look at it and see which we like better?15:08
*** sigmavirus24_awa is now known as sigmavirus2415:08
pabelangerjeblair: sure I can do that15:08
mnaserclarkb: i think it might be at our side, im checking15:09
*** mtanino has joined #openstack-infra15:09
mnaseri see a bunch of "Caught error: The client disconnected while sending the POST/PUT body (4286474992 more bytes were expected)" and the request ip is nodepool.openstack.org15:09
openstackgerritMerged openstack-infra/system-config: Add instructions on how to disable a third party ci account  https://review.openstack.org/26535915:09
mnaseroddly enough they all seem to hover around that amount so im verifying, we have a nginx frontend infront of glance for ssl stuff, checking there15:09
*** jsavak has quit IRC15:10
clarkbmnaser: ok I am just starting my morning so may be slow to respond but can help where I can15:11
fungiit's possible that was our nodepool retrying an upload of the same image, so having it fail at the same bytecount each time is not too strange15:11
mnaserclarkb: np, but i think it might be at your side15:11
mnaserI see it logging the errors as 499 in the nginx logs15:11
mnaserhttp://stackoverflow.com/a/1841093215:11
jamespageyolanda, no - so we will reference bugs in the commit entries; nice if we got automatic status setting on the linked bugs... does that make sense?15:12
mnaserand 499 in eror logs implies some timeout at your side possibly15:12
openstackgerritPaul Belanger proposed openstack-infra/project-config: Create experimental nodepool rackspace dashboard  https://review.openstack.org/26499415:12
AJaegerclarkb, jeblair, fungi: I have a question about one review. training-labs likes to publish tarballs, could you check https://review.openstack.org/#/c/281721/ and tell whether that's the right approach? The change looks fine on technical level but I'd like confirmation on the approach15:12
clarkbmnaser: http://paste.openstack.org/show/488987/ is what shade is logging15:12
openstackgerritPaul Belanger proposed openstack-infra/project-config: Create experimental nodepool rackspace dashboard  https://review.openstack.org/26499415:12
pabelangerjeblair: experimental ^15:12
AJaegerjamespage: do you mean http://docs.openstack.org/infra/manual/developers.html#working-on-bugs ?15:13
clarkbmnaser: which is also complaining about connection stuff, so could be15:13
* jamespage looks15:13
AJaegerjamespage: and http://docs.openstack.org/infra/manual/creators.html#create-bug-tracker ?15:13
clarkbmnaser: I think this may be our first glance v2 cloud? so entirely possible this is a bug in our client stuff15:14
clarkbmordred: ^ you may have ideas too15:14
jamespageAJaeger, that's the one - we already have a bug tracker but its not under https://bugs.launchpad.net/<projectname>15:14
mnaserclarkb: v1 is available if you want to fallback to it, but it seems every single upload from nodepool has failed15:14
jamespageits https://bugs.launchpad.net/charms/+source/<projectname [sans charm-]>.15:14
mnaser(from what the logs show)15:14
*** _amrith_ is now known as amrith15:14
clarkbmnaser: ya I don't see any successful ones on my side15:14
mordredclarkb: shouldn't be our first glance v2 cloud15:15
jamespageAJaeger, for example - https://bugs.launchpad.net/charms/+source/nova-compute15:15
mordredclarkb: s/shouldn't/isn't/15:15
mordredclarkb: but looking15:15
fungiAJaeger: on 281721 i need to check whether $ZUUL_BRANCH is actually set in post. for some reason i thought it wasn't15:15
AJaegerjamespage: no idea whether we support this, hope others can answer...15:15
jamespageok15:15
*** esikache1 has quit IRC15:15
*** austin81 has joined #openstack-infra15:16
mnaseractually, i see a few images that successfully uploaded15:16
*** jaosorior has quit IRC15:16
mnaser3x 401s and 4x 204s (which is an a-ok for post-upload)15:16
clarkbmnaser: oh I misread my output15:16
clarkbyup we have 3/4 images "ready"15:16
mnaseralso our tokens are set to 6 hours15:17
*** yamamoto has quit IRC15:17
*** jaypipes has joined #openstack-infra15:17
AJaegerfungi: oh, we might need $ZUUL_REFNAME, don't we?15:17
*** annegentl_ has joined #openstack-infra15:17
clarkbhttp://paste.openstack.org/show/488988/15:17
clarkbso it does eventually work?15:17
*** infra-red has joined #openstack-infra15:17
*** yamamoto has joined #openstack-infra15:18
*** annegentl_ is now known as annegentle15:18
mnaserhm im not sure if thos eare succesful uploads15:18
yolandajamespage, sorry but i cannot help with that, other infra-root can help?15:18
mnaserconsidering that after the 401, the 204 for the same id executed in a few seonds15:18
clarkbmnaser: it isn't attempting to boot them yet so possible that it isn't15:18
*** ajmiller has joined #openstack-infra15:18
sdagueclarkb: is there a fix up for openstack health?15:18
clarkbsdague: ?15:18
*** yamamoto has quit IRC15:18
sdaguebecause we've got an 8 hour gate delay, and openstack health hasn't indexed anyting in 2 weeks15:19
sdagueso it's hard to understand15:19
mnaserclarkb: logs for file uploads at our side - http://paste.openstack.org/show/488990/15:19
clarkbsdague: I think mtreinish had subunit parsing bugs that he and pleia2 got sorted out yesterday15:19
clarkbsdague: but I haven't been able to keep up with that15:19
sdagueok15:19
sdaguemtreinish is there a thing ready to go?15:20
clarkbmnaser: I am going to manually boot off of those images which should tell us if they worked15:20
mnaserclarkb: okay, and if you can see if a manual glance image-upload works or not, would be helpful in figuring out if there is any network issues15:20
mtreinishsdague: I'm still grasping at straws trying to figure out why the subunit worker isn't working15:20
jeblairjamespage, yolanda, AJaeger: see the 'groups' setting described here: http://docs.openstack.org/infra/manual/creators.html#add-the-project-to-the-master-projects-list15:21
*** yamamoto has joined #openstack-infra15:21
jeblairmtreinish: is there a thing you need me to do?15:21
clarkbmnaser: I do know a manual `openstack image create` worked yesterday bu twith a smaller image15:21
mtreinishjeblair: I'm not sure, I've got a full log from the worker. We're consistently hitting: http://paste.openstack.org/show/488878/15:21
mnaseryeah maybe it's a size thing or a timeout thing, but removing nodepool from the equation might be helpful to see whats going on15:22
mtreinishbut I can't figure out where the file is being closed15:22
*** kushal has quit IRC15:22
jamespagejeblair, looking15:22
AJaegerjeblair: you're right - jamespage you use https://bugs.launchpad.net/charms/, set groups to charm and it should work fine, shouldn't it?15:22
clarkbmnaser: yup15:23
lucasagomessdague, here it goes https://review.openstack.org/#/c/287216/15:23
openstackgerritRick Salevsky proposed openstack-infra/jenkins-job-builder: Fix disabled condition in case disabled is a string  https://review.openstack.org/28725715:23
*** baoli has joined #openstack-infra15:23
lucasagomessdague, if you look at the pxe_ipmitool-nv job it's now using ipmitool (w00t) http://logs.openstack.org/16/287216/1/check/gate-tempest-dsvm-ironic-pxe_ipmitool-nv/442affc/logs/screen-ir-cond.txt.gz#_2016-03-02_14_48_47_85815:24
*** jcooley_ has quit IRC15:24
sdaguelucasagomes: great, thanks for that15:24
sdague+215:24
lucasagomessdague, thank YOU!15:25
openstackgerritMerged openstack-infra/system-config: Add troubleshooting section to github  https://review.openstack.org/28172615:25
clarkbmnaser: 949d5f5e-82e2-4cb0-bad6-91a8883e48a5 doesn't ping or ssh and booted off of one of those images so I think maybe the image isn't good15:25
*** ccrouch has joined #openstack-infra15:25
lucasagomessorry for bothering, it's because we really want to enable it in Ironic... the ipmitool is our reference driver so being able to test it in gate is a good thing15:25
clarkb(security groups should be open I have been able to ssh and ping other hosts in that project with the same security groups)15:25
lucasagomes"test what you ship, ship what you test" kinda thing :-)15:25
*** jsavak has joined #openstack-infra15:25
pabelangerAJaeger: if you have a moment, 26499415:25
clarkbmnaser: mordred the error we log on our side is interesting "Error finding address for https://...: Unable to establish connection to https://...15:27
jamespageAJaeger, jeblair: I'll raise a change todo that - might work although the individual project name might still mismatch due to the leading charm- on the project15:27
clarkbmnaser: mordred almost implies name lookup problems, I did confirm that I could establish tcp connections to that host from nodepool.o.o last night fwiw15:27
*** woodster_ has joined #openstack-infra15:27
*** jamesmcarthur has joined #openstack-infra15:27
openstackgerritMerged openstack-infra/system-config: Add infracloud runbook to regenerate images  https://review.openstack.org/28385615:28
openstackgerritMerged openstack-infra/system-config: Remove obsolete pandoc install  https://review.openstack.org/28437115:28
openstackgerritMerged openstack-infra/system-config: Remove obsolete dvipng install  https://review.openstack.org/28624215:28
openstackgerritMerged openstack-infra/storyboard-webclient: Make header 'quick navigate' and 'search' clearer  https://review.openstack.org/27898715:28
*** baoli has quit IRC15:28
openstackgerritGrant Murphy proposed openstack-infra/project-config: Add bandit gate job for Octavia  https://review.openstack.org/28691215:29
jeblairjamespage, AJaeger: i thought in the past we had used it to make a mapping -- it's the most badly named parameter ever, it doesn't actually have anything to do with groups.15:29
jeblairjamespage, AJaeger: it may be worth a look at the jeepyb source to see what it actually does15:29
jeblairjamespage, AJaeger: and other examples in projects.yaml15:29
mordredclarkb: yah - I agree - it seems to be normal requests things rather than anything really related to openstack services15:29
*** jsavak has quit IRC15:29
*** jsavak has joined #openstack-infra15:30
AJaegerjamespage: can't help you with that due to time, do you need more pointers?15:30
*** roxanaghe has joined #openstack-infra15:30
*** gordc has joined #openstack-infra15:31
*** baoli has joined #openstack-infra15:31
clarkbmordred: do you think it is worth setting image_api_version: "1" ?15:31
clarkbto see if there is a difference in behavior between api versions15:32
jamespageAJaeger, I think I have enough to poke around and figure things out15:32
mordredclarkb: I do not15:32
mordredclarkb: I have used shade to upload images to vexxhost using version 2 - also we use version 2 at ovh15:33
AJaegerI have one more question for the experts: https://review.openstack.org/#/c/286956/ enhances the global skip regex and I wonder whether the job isn't just named wrongly. Do we have a naming convention that I'm not aware off? Should it be gate-networking-midonet-v2-dsvm-rally or gate-rally-dsvm-networking-midonet-v2 ?15:33
AJaegerjamespage: Great!15:33
mordredclarkb: which is not helpful in that it does not explain what the heck is going on15:34
mordredclarkb: but it sure does seem like it's getting name resolution errors or something else15:34
clarkbbut v2 should work with large images across oceans15:34
mordredyes15:34
Zaraemail patches are still waiting in review, so I'm here to bug everyone! :D https://review.openstack.org/#/c/270329/ and https://review.openstack.org/#/c/270331/15:34
clarkbok, I need to do breakfast and feed kids and stuff will pick this back up after, but please don't stop investigating if you are able :)15:34
mordredif it broke, I would expect it to break with the server dying in the middle of transit15:34
*** baoli has quit IRC15:35
jeblairmtreinish: can you reproduce locally?  could there have been a testtools change that alters how it works with stream files?15:36
*** dizquierdo has joined #openstack-infra15:36
clarkbmordred: mnaser also we will want to clean up our image list (there are a bunch that are "queued")15:36
clarkbI will use openstack client to upload after breakfast to at least see if that works with our images15:37
mtreinishjeblair: I asked lifeless about that and he said subunit doesn't close the file15:37
mtreinishjeblair: I'm working on spinning things up locally right now to try and reproduce it15:37
*** baoli has joined #openstack-infra15:38
*** piet has quit IRC15:39
*** harlowja_at_home has joined #openstack-infra15:40
mordredclarkb: well, actually, instead of doing that15:40
*** yamahata has joined #openstack-infra15:40
mordredclarkb: when you get back, can we upload it using shade by hand with debug logging turned on?15:40
clarkbmordred: sure, you can go ahead and start that without me15:40
mordredclarkb: because if that succeeds but nodepool fails, then we know there is somethign extra bonghits goin on, but if it fails, we can get debug logs15:40
mordredclarkb: cool. which image are we uploading?15:40
*** korzen has quit IRC15:40
*** abregman has quit IRC15:40
AJaegerjeblair, fungi: coming back to fungi's comment - after grepping around, I think we need ZUUL_REFNAME instead of ZUUL_BRANCH for the post job of 281721. Do you agree that's better?15:40
*** sorantis has quit IRC15:41
fungiAJaeger: i've just gotten to where i can dig deeper on that one. a moment15:41
clarkbmordred any of fedora-23, devstack-centos7, ubuntu-trusty, or devstack-trusty qcow215:41
mriedemAJaeger: it looks like this change only applies to master branches, is that correct? https://review.openstack.org/#/c/286180/1/zuul/layout.yaml15:41
mriedemor by design?15:42
*** infra-red has quit IRC15:42
AJaegermriedem: it shouldn't15:42
clarkbmordred: the /var/log/nodepool/builder-debug.log should give you extact image filepsths15:42
fungiAJaeger: an example https://jenkins.openstack.org/job/cinder-upstream-translation-update/1588/parameters/15:42
mriedemAJaeger: oh nvm15:42
jeblairAJaeger: oh yeah probably so15:42
mriedemthe change i was watching in liberty wasn't a test change, ignore me15:42
*** yamamoto has quit IRC15:42
AJaegerfungi: so, you're right. thanks.15:42
*** gokrokve has joined #openstack-infra15:42
fungiAJaeger: so, yes, there is no ZUUL_BRANCH provided in post, just ZUUL_REFNAME and ZUUL_REF but i think we've decided previously that the first of those is what we should match on15:42
*** yamamoto has joined #openstack-infra15:43
openstackgerritAndreas Jaeger proposed openstack-infra/project-config: Adds training-labs post jobs to publish tarballs  https://review.openstack.org/28172115:43
AJaegerfungi, jeblair: I just pushed using REFNAME ^15:43
*** baoli has quit IRC15:43
*** piet has joined #openstack-infra15:43
*** baoli has joined #openstack-infra15:43
AJaegerfungi, we currently use both REF and REFNAME - time to cleanup?15:43
*** yamamoto has quit IRC15:43
*** dizquierdo has quit IRC15:43
fungiAJaeger: where is ZUUL_REF being used as a branch name?15:44
*** yamamoto has joined #openstack-infra15:44
fungiAJaeger: there are legitimate reasons to use one or the other, but in post they happen to have the same value15:45
AJaegerjenkins/jobs/requirements.yaml:          export ZUUL_REF={branch-override}15:45
openstackgerritDiana Whitten proposed openstack/requirements: Bump django-compressor to 2.0  https://review.openstack.org/28666315:45
fungiAJaeger: i think that's because something's using ZUUL_REF so the script is overriding its content, but i'll have to look closer at that job definition15:45
AJaegerjenkins/scripts/propose_update.sh uses also ZUUL_REF15:46
fungiyeah, that seems to be the script that job is calling15:46
*** gokrokve has quit IRC15:47
AJaegerfungi: indeed ;(15:47
fungiAJaeger: i agree, seems to be using it to determine the branch name. we only run that script in post?15:47
fungii think that probably should switch (gracefully) to ZUUL_REFNAME at some point, though doing so will require a minimum of two patches15:48
fungitemporarily either the job needs to set both or the script needs to use either, then a second patch after image updates can clean out the old variable name15:49
*** placko has quit IRC15:49
*** keedya has joined #openstack-infra15:49
clarkbmordred any luck finding an image file?15:49
AJaegeryes, only in post. I see - will do later...15:49
*** fhubik is now known as fhubik_brb15:49
mordredclarkb: there is a shade upload running in a screen15:49
*** esikache1 has joined #openstack-infra15:49
fungiAJaeger: though also it seems relatively low priority for now since it's working15:50
clarkbmordred: awesome15:50
mordredclarkb: (maybe I should put what I just did in the repl into a little script in case people need to do something similar for debugging)15:50
*** Sukhdev has joined #openstack-infra15:51
AJaegerthanks, fungi.15:51
*** jistr has quit IRC15:51
jeblairmtreinish: if i'm following subunit2sql correctly, it gets jobs as fast as it can, decompresses the subunit file, and puts the entire file contents onto an internal queue.  then a second thread pops items off of that queue and processes them, right?15:51
*** sdake_ has joined #openstack-infra15:51
openstackgerritMerged openstack-infra/shade: os_router playbook cleanup  https://review.openstack.org/28677015:52
*** sdake has quit IRC15:52
mtreinishjeblair: right15:52
fungijeblair: just to confirm, we had previously determined that for jobs in ref-updated pipelines, if we need to identify the name of the branch we should prefer ZUUL_REFNAME instead of ZUUL_REF for that, right? making sure i haven't gotten them backwards15:52
*** jpr has joined #openstack-infra15:53
*** gokrokve has joined #openstack-infra15:53
*** gokrokve has quit IRC15:53
*** gokrokve has joined #openstack-infra15:53
jeblairmtreinish: that will mean scaling this (if it ever becomes necessary) may be difficult because the system may consume jobs faster than it can process them (but those jobs won't be available for other workers)... but perhaps more relevent to the current situation....15:53
jeblairmtreinish: i don't think we can tell which subunit stream it bombed on15:54
jeblairmtreinish: (i'm also slightly worried about the memory consumption of a whole lot of cstringio objects waiting in memory to be processed)15:54
mtreinishjeblair: well, I stole the approach from the logstash processor :)15:54
mtreinishjeblair: and speaking of memory consumption it crashed because of oomkiller 2 weeks ago15:55
*** sdake has joined #openstack-infra15:55
jeblairmtreinish: i think the logstash processor may wait until its internal queue item is consumed before reporting the job as complete... let me check that15:55
*** gongysh has quit IRC15:56
mtreinishjeblair: I added the str(stringio) (which has the memory addr for the object) to the log so we can see when something gets pushed on the queue and when the processor pulls it off15:56
*** fhubik_brb is now known as fhubik15:57
*** eric_lopez has quit IRC15:57
*** piet has quit IRC15:57
openstackgerritMonty Taylor proposed openstack-infra/shade: Add debug message about file hash calculation  https://review.openstack.org/28728115:57
mordredclarkb: ^^ just added that so that it's clear when shade is spinning on hash calculation15:58
*** sdake_ has quit IRC15:58
AJaegerI have one more question for the experts: https://review.openstack.org/#/c/286956/ enhances the large skip-if regex and I wonder whether the job isn't just named wrongly. Do we have a naming convention that I'm not aware off? Should it be gate-networking-midonet-v2-dsvm-rally or gate-rally-dsvm-networking-midonet-v2 ? fungi, jeblair, clarkb, can either of you answer on the naming here, please?15:58
jeblairmtreinish: you are right about the log processor, it also has two disconnected processing threads, though it seems that most of the processing is in the gearman-synced thread, so the other shouldn't get too far behind.  nonetheless, i bet we can make subunit simpler...15:58
*** flepied has joined #openstack-infra15:59
*** baoli has quit IRC15:59
fungiAJaeger: yeah, saw your question earlier--was going to ponder that one next15:59
*** esikache1 has quit IRC15:59
jeblairmtreinish: ah, so we should be able to map an error to a specific subunit file?15:59
AJaegerfungi, sorry for my impatience ;(15:59
fungiAJaeger: no, it's fine, thanks for the reminder15:59
*** baoli has joined #openstack-infra16:00
*** maishsk has quit IRC16:00
mtreinishjeblair: yeah, that was my intent, I put the full log here: http://blog.kortar.org/wp-content/uploads/2016/02/subunit-worker-A-debug.log216:01
fungiAJaeger: so basically if "rally" appears SOMEWHERE before "-dsvm" in the job name, then they don't need to adjust the regex16:01
jeblairmtreinish: so if i'm following you and the code correctly, the example in http://paste.openstack.org/show/488878/ really did bomb on the file listed at the top because the mem addresses of the 2 stringio log lines match?16:01
*** rhallisey has quit IRC16:01
openstackgerritThomas Herve proposed openstack-infra/project-config: Speed up tox test count  https://review.openstack.org/28728516:01
mnaserclarkb: i'm pretty confused about that error to be honest16:02
*** piet has joined #openstack-infra16:02
mtreinishjeblair: yep, that's what it looks like16:02
mtreinishand that's where I get very confused :)16:02
mnaserit seems to imply that the connection is being dropped or it's unable to communicate16:02
*** rhallisey has joined #openstack-infra16:02
AJaegerfungi, it needs to be rally-dsvm16:02
jeblairmtreinish: then hrm!  i can't reproduce locally, even after a local revert of your removal of the flush call.16:02
pabelangerAJaeger: looks like gate failure on 28721116:02
jeblairmtreinish: this is weird16:02
pabelangerAJaeger: git clone issue16:02
jeblairmtreinish: i'll see if breakfast helps.16:03
*** matrohon has quit IRC16:03
fungiAJaeger: oh, yep you're correct. there's no wildcarding between those16:03
AJaegerpabelanger: please recheck it16:03
mtreinishjeblair: heh, I slept on it and I'm still not any closer :)16:03
pabelangerAJaeger: ah, I wasn't sure that would help.  Doing so now16:04
AJaegerfungi: So, my question is: Is there any kind of naming that we should try to suggest/enforce - or is it frree for all16:04
fungiAJaeger: so it could be gate-networking-midonet-v2-rally-dsvm-nv if they wanted16:04
mnaserclarkb: if you can manually upload a nodepool image, that'd help in narrowing it down (unless you're still getting that error when image-create too)16:04
*** rhallisey has quit IRC16:04
AJaegerfungi: yes16:04
mtreinishjeblair: it really looks like it's reusing that first subunit stringio object. Looking at the full log the first 1 works and then all the others after fail16:04
clarkbmnaser: mordred has manual uplaod with all the debug logging running now16:04
*** tiswanso has quit IRC16:04
*** pcaruana has quit IRC16:05
mnasercool16:05
clarkbmnaser hopefully that clears it up16:05
mtreinishjeblair: but, I just don't know how that could be happening16:05
*** rhallisey has joined #openstack-infra16:05
fungiAJaeger: i would like to see us shrink that regex if possible, but also the proliferation of skip-if rules for each individual project is getting insane16:05
mnaserclarkb: i secretly hope it doesnt work, because that'll be easier to troubleshoot than something involving nodepool :-p16:05
clarkbya16:05
fungiAJaeger: so pushing back on additions like this may just be wasted effort16:05
*** whoops has quit IRC16:06
AJaegerfungi, yeah - seems each project is adding one now16:06
*** fawadkhaliq has joined #openstack-infra16:06
eharneycould someone help me figure out why https://review.openstack.org/#/c/264982/ doesn't seem to be getting into the gate queue after check passes?16:06
fungiAJaeger: it may be easier to not object to additions which are technically correct, and look forward to this all ending up in the individual projects' repos come zuul v316:06
*** fhubik is now known as fhubik_brb16:07
fungiAJaeger: so providing them with more attractive solutions is fine, but we've got other battles where our time would be better spent16:07
mordredmnaser, clarkb: manual upload using shade worked fine16:07
* mnaser wonders if concurrency is an issue16:07
AJaegerfungi: yes, that outlook sounds fine. Ok, will review the change later and +2 (didn't object to it so far, just commented)16:08
mnaseri can't imagine it being one though16:08
AJaegerfungi: Thanks!16:08
*** smarcet has joined #openstack-infra16:08
*** esker has joined #openstack-infra16:08
*** piet has quit IRC16:08
*** jaypipes has quit IRC16:08
*** pots79 has joined #openstack-infra16:08
fungieharney: why would that go back into the gate again?16:09
*** tiswanso has joined #openstack-infra16:09
*** esker has quit IRC16:09
nibalizergood morning16:09
*** whoops has joined #openstack-infra16:09
mordredclarkb: maybe we should figure out a way to turn on the http_debug logging for just vexxhost while running in nodepool and log just that to a diffrent file16:09
fungieharney: i'm confused by what you expect from retesting changes that have been merged for days16:09
mnasermordred, clarkb: could you kick off a reupload of images via nodepool to see if the issue repeats itself?  maybe there was some network issues this morning (as far as i know, there shouldn't be :X)16:09
eharneyfungi: whoops, i pasted the wrong link, i meant  https://review.openstack.org/#/c/264983/16:09
*** esker has joined #openstack-infra16:09
*** _nadya_ has quit IRC16:09
*** BigWillie has joined #openstack-infra16:10
mordred(I'm going to delete the image I just uploaded)16:10
*** armax has joined #openstack-infra16:11
*** sdake_ has joined #openstack-infra16:12
*** daemontool_ has joined #openstack-infra16:12
*** daemontool has quit IRC16:13
*** exploreshaifali has joined #openstack-infra16:13
*** rockyg has joined #openstack-infra16:13
*** ikalnitsky_ has joined #openstack-infra16:13
*** fawadkhaliq has quit IRC16:14
*** mwhahaha_ has joined #openstack-infra16:14
*** bauzas_ has joined #openstack-infra16:14
fungieharney: it depends-on a cinder change which didn't merge until after this one was approved, and apparently cinder and cinderclient no longer share any jobs so are not gated in a shared change queue16:14
fungieharney: you'll need to add or replace the workflow +1 on 264983 to get zuul to notice it's ready for gating now16:15
*** odyssey4me_ has joined #openstack-infra16:15
*** bryan_att has quit IRC16:15
*** mwhahaha has quit IRC16:15
*** larrie_kong has quit IRC16:15
*** niska has quit IRC16:15
*** mgkwill has quit IRC16:15
*** ikalnitsky has quit IRC16:15
*** bauzas has quit IRC16:15
*** larrie_kong has joined #openstack-infra16:15
*** thorst_ has quit IRC16:15
*** maximov_ has quit IRC16:15
*** gnuoy has quit IRC16:15
*** odyssey4me has quit IRC16:15
*** briancurtin has quit IRC16:15
*** gnuoy has joined #openstack-infra16:15
*** johnthetubaguy has quit IRC16:15
*** smarcet has quit IRC16:15
*** sdake has quit IRC16:15
*** markmcclain has quit IRC16:15
*** msuriar has quit IRC16:15
*** sulo has quit IRC16:15
*** alaski has quit IRC16:15
*** bauzas_ is now known as bauzas16:15
*** alaski_ has joined #openstack-infra16:15
*** kencjohnston has joined #openstack-infra16:15
*** johnthetubaguy has joined #openstack-infra16:15
*** larrie_kong has quit IRC16:15
*** larrie_kong has joined #openstack-infra16:15
*** markmcclain has joined #openstack-infra16:15
*** mwhahaha_ is now known as mwhahaha16:15
*** msuriar has joined #openstack-infra16:15
*** ikalnitsky_ is now known as ikalnitsky16:16
eharneyfungi: ah ok, thanks16:16
*** thorst_ has joined #openstack-infra16:16
*** smarcet has joined #openstack-infra16:16
*** briancurtin has joined #openstack-infra16:16
*** niska has joined #openstack-infra16:17
*** sulo has joined #openstack-infra16:17
*** alaski_ is now known as alaski16:17
fungieharney: basically that's a failsafe, since the lack of any shared integration testing between cinder and cinderclient means that if the cinder change broke this cinderclient change the only way you'll have to know is after the cinder change has merged16:17
*** maximov_ has joined #openstack-infra16:17
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add simple script to help debug shade image uploads  https://review.openstack.org/28729616:17
mordredclarkb: ^^ there is a script version of what I did in the repl earlier16:17
*** mattt has left #openstack-infra16:17
eharneyfungi: is this a new failsafe, or it just happened due to reconfiguration of some jobs?  (just curious)16:18
AJaegereharney: http://docs.openstack.org/infra/manual/developers.html#limitations-and-caveats16:18
fungieharney: it's how the feature has always worked. basically depends-on has two different uses: for projects sharing a change queue it means test the change with the dependency applied. for projects without a shared change queue it means don't land this change until after the declared dependency merges16:19
* AJaeger will be back later...16:19
*** bryan_att has joined #openstack-infra16:19
eharneyfungi: i guess i always was thinking of the latter case16:20
fungieharney: if cinder and/or cinderclient recently removed some jobs such that they ceased having any in common, then depends-on behavior will also have changed between them accordingly16:20
eharneyright16:20
*** infra-red has joined #openstack-infra16:20
mtreinishjeblair: just for a sanity check is there anything alarming in dmesg on subunit-worker01.o.o?16:20
jeblairmtreinish: i think i see the problem; patch in progress16:21
*** sdake_ is now known as sdake16:21
*** mgkwill has joined #openstack-infra16:21
fungimtreinish: oom killer16:22
fungi[Tue Mar  1 01:00:38 2016] Killed process 16382 (subunit-gearman) total-vm:17108324kB, anon-rss:7615884kB, file-rss:0kB16:22
*** jaypipes has joined #openstack-infra16:22
*** Swami__ has joined #openstack-infra16:22
mtreinishfungi: that's surprising, because it shouldn't have been running16:23
mrmartinfungi, pabelanger: I sucessfully deployed a wiki.o.o clone into a dev machine, but it was not trivial, lot's of things are missing from the puppet and .gitmodules are not up-to date.16:24
mtreinishjeblair: cool16:24
fungimrmartin: yes, i think the volunteer who originally set up that server did a lot of it manually with the intent of getting more of it into puppet, but never found the time. it's not a great situation16:25
pabelangermrmartin: Ya, I expected that.  I haven't gotten around to testing wiki-dev.o.o on trusty yet16:25
fungiwe've gotten a lot stricter in years since about making sure deployment starts in puppet rather than being automated after teh fact16:25
mrmartinit is on precise actually, I'll share the deployment memo somewhere16:26
openstackgerritMerged openstack-infra/shade: Pass username/password to SwiftService  https://review.openstack.org/28722416:26
mrmartini made a round with master but it have a different deployment model, and uses composer for modules16:26
openstackgerritJames E. Blair proposed openstack-infra/subunit2sql: Don't extend the passed in targets  https://review.openstack.org/28729916:26
jeblairmtreinish: ^16:26
*** Swami_ has quit IRC16:26
jeblairmtreinish: your observation about the first one succeeding but subsequent ones failing, and also eating breakfast, led me to that ^16:27
jeblairmtreinish: i was able to reproduce it locally and verify the fix16:27
mrmartinthe 1.25 version of wiki.o.o was released in 2014, so the core and extensions were not updated since then.16:27
mtreinishjeblair: oh, that's a good catch16:27
pabelangermrmartin: 285405 was my entrypoint, but like I said, haven't tested it locally.16:27
jeblairmtreinish: here's my test program: http://paste.openstack.org/show/489002/16:27
* mtreinish curses SpamapS and his targets api :)16:28
jeblairmtreinish: and the commented out line in the middle toggles fixed/broken behavior16:28
mrmartinok, as a next step I want to check the new user registration / captcha story, if we can solve that, I guess we can turn back the user registration.16:28
mrmartinthe openid one16:28
SpamapSI have a targets API?16:28
*** gongysh has joined #openstack-infra16:28
mtreinishSpamapS: https://review.openstack.org/#/c/287299/116:28
mtreinishjeblair: I really want to fast track that fix, but do you think we could add a unit test for this case16:29
jeblairSpamapS: congratulations!  it's a targets API!16:29
fungimrmartin: yeah, i've occasionally popped in and updated mediawiki on that server for patches to critical vulnerabilities when they're in features we're exposing, but it's not been frequent and i've no doubt i've probably missed some less obvious issues16:30
mtreinishSpamapS: heh, I forgot what you called it. But it's the thing you added to add the hooks for appending the counters data to the end of the subunit16:30
clarkbmnaser: ya I can trigger some additional uploads though at this time of the day we will be competing with all of the other uploads so may not be quick16:31
jeblairmtreinish: let me see what i can do about a test16:31
mtreinishjeblair: cool, thanks16:32
mtreinishI just want to make sure we don't regress here again16:32
*** esikache1 has joined #openstack-infra16:32
clarkbmnaser: actually it is still trying to upload that one image over and over and still hitting this so I don't think it is transient network issues16:33
*** kushal has joined #openstack-infra16:33
*** odyssey4me_ is now known as odyssey4me16:33
*** jsavak has quit IRC16:33
*** electrofelix has quit IRC16:34
*** tiswanso has quit IRC16:34
*** keedya has quit IRC16:34
*** tiswanso has joined #openstack-infra16:34
*** sdake has quit IRC16:35
SpamapSmtreinish: yeah. I hope to get back to that soon.... We need it16:35
*** david-lyle has quit IRC16:36
*** infra-red has quit IRC16:38
*** david-lyle has joined #openstack-infra16:38
*** eric_lopez has joined #openstack-infra16:38
*** rhedlind has left #openstack-infra16:39
*** yamahata has quit IRC16:39
*** eric_lopez has quit IRC16:40
*** rcarrillocruz has left #openstack-infra16:40
*** apoorvad has joined #openstack-infra16:40
*** sdake has joined #openstack-infra16:40
*** eric_lopez has joined #openstack-infra16:40
*** regXboi has joined #openstack-infra16:41
*** krtaylor has quit IRC16:41
clarkbmnaser: mordred reading up on the glance v2 api the first step is to create the image which returns the file path, then you PUT to the file path16:41
clarkbmnaser: mordred the error message that is being logged implies the first POST to create the image is succeeding and returning a file path16:42
clarkbmnaser: mordred then for some reason we fail to connect during the PUT16:42
*** yamamoto has quit IRC16:42
openstackgerritJeremy Stanley proposed openstack-infra/system-config: Add vexxhost mirror to documentation and cacti  https://review.openstack.org/28731016:43
*** infra-red has joined #openstack-infra16:43
fungiclarkb: ^ i assume you didn't have that proposed yet16:43
clarkb(also the images the "succeeded" are not working in a way that I can debug so I don't think we actually succeeded there)16:43
clarkbfungi: I hadn't16:43
mordredclarkb: I agree with that reading16:44
*** BigWillie has quit IRC16:45
openstackgerrityolanda.robla proposed openstack/diskimage-builder: Generate fedora-atomic images using dib  https://review.openstack.org/28716716:45
*** jsavak has joined #openstack-infra16:45
openstackgerritdevdatta-kulkarni proposed openstack/requirements: Adding python-solumclient  https://review.openstack.org/28731316:45
*** ildikov has quit IRC16:46
*** sc68cal has joined #openstack-infra16:46
openstackgerritJames E. Blair proposed openstack-infra/subunit2sql: Don't extend the passed in targets  https://review.openstack.org/28729916:47
*** esker has quit IRC16:47
jeblairmtreinish, SpamapS: ^16:47
*** resker has joined #openstack-infra16:47
jeblairmordred: ^16:47
*** ashleighfarnham has joined #openstack-infra16:47
openstackgerritJames E. Blair proposed openstack-infra/subunit2sql: Don't extend the passed in targets  https://review.openstack.org/28729916:47
jeblairdrat; typo in comment^16:48
lucasagomesclarkb, jeblair if you guys have a second, mind taking a look at https://review.openstack.org/#/c/284036/ ? Thanks a lot16:48
clarkbmordred: mnaser a tcpdump against that dest host shows lots of bits going by which implies a portion of the upload is working16:49
*** infra-red has quit IRC16:49
jeblairclarkb: osic sitrep?16:51
*** rlandy has quit IRC16:51
clarkbjeblair: everything ready to go (now that i have working volume attach launch node stuff) except that their ssl situation is still :/16:51
jeblairclarkb: that's waiting on them; or should we look into a workaround?16:51
clarkbjeblair: we have to go fully unverified ssl to use the cloud in its current situation due to them not having working dns that maps to cert CNs16:51
clarkbjeblair: the only viable workaround for us is to not verify ssl at all to them16:52
fungiyeah, cert cn there is for a hostname which is not in dns and which they're not going to put in dns. instead they're waiting for intel to add a dns record and they'll replace the cert with one for that new cn16:52
jeblairdo we have an idea if that's a days/weeks sort of thing?16:53
clarkbfungi: the dns record is actually there fwiw, they could use a self signed cert for that name in the interim but aren't for some reason16:53
mtreinishjeblair: cool, thanks. Once that lands I'll push a release and we can restart the worker after it gets pulled in by puppet16:53
clarkbjeblair: I asked and haven't heard back16:53
jeblairmtreinish: ++16:53
fungiwhoever thought it was a good idea to have one organization stand up these environments yet have to go through a different organization for the cert and dns record of the api endpoint was not thinking clearly16:53
*** rossella_s has quit IRC16:53
fungiclarkb: oh, the new dns name exists and points to that ip address now?16:54
clarkbfungi: yes cloud1.osic.org has an A record16:54
*** rossella_s has joined #openstack-infra16:54
clarkbaiui they are waiting for the soa to ack the cert request16:54
fungistill, yeah the certificate mismatch is a no-go. i don't care if it's self-signed or not, it's unusable if it doesn't map to the hostname we're connecting to16:54
*** esikache1 has quit IRC16:54
*** ttx has quit IRC16:54
*** vgridnev has quit IRC16:55
nibalizerwell yay for getting dns16:55
nibalizerI arm running do-release-upgrade today so this might be the last time you hear from me for a while :P16:55
jeblaircool, so maybe just a matter of a little more waiting?16:55
clarkbjeblair: ya I think its mostly up to them getting the soa to ack the cert, putting cert in place then updating keystone catalog16:56
*** ttx has joined #openstack-infra16:56
yolandanibalizer, care reviewing some stuff before going into the dark?16:56
clarkbwe can use that cloud now if we want to not verify ssl but fungi seemed hesitant to do that16:56
jeblairnibalizer: i did that recently on my laptop -- suddenly in wily the full disk encryption boot prompt *works*!  that's so much better than on vivid where it did not!16:56
clarkbwhich is understandable given there are reasonable workarounds if dns is working16:56
fungiclarkb: i thought you said latest neutronclient didn't support that?16:56
fungi--insecure became a no-op in a recent release16:57
clarkbfungi: only via cli, openstackclient and shade seem fine16:57
fungii'm mainly leery of saying we're in production on a cloud where we have to downgrade to older client software to interface with it16:57
jeblairfungi, clarkb: i guess we explored the idea of putting something in /etc/hosts but decided not to because that's just bonkers?16:57
clarkbjeblair: I actually did explore that :) it doesn't work because the catalog doesn't use the host name it uses the ip addr16:58
clarkbjeblair: so that works for your first request to get a token and find the next service but fails when you start talking to not keystone16:58
fungirighht, they need to also update the keystone catalog, i forgot about that bit16:58
jeblairclarkb: i'm kind of glad that doesn't work.  i'm not entirely sure i'm glad for the reason.  :)16:58
jeblairalso, i'd really like to get 8 more clouds from them.  do you think that's possible?16:59
jeblair(cloud9.osic.org? !)17:00
*** _nadya_ has joined #openstack-infra17:00
fungihah17:00
fungiyou can ask to go back into the waiting list for slot #9 i guess17:00
*** vincentll has quit IRC17:01
clarkbI think in an ideal world they would've use a self signed cert against a name in dns that they control to start17:02
clarkbthen we could do like we did with infra cloud17:02
jeblairi'm going to restart statsd so we lose the infra-cloud gauges17:02
clarkband hwen they get dns sorted for long term switch over17:02
*** rcarrillocruz has joined #openstack-infra17:02
openstackgerritSteven Dake proposed openstack-infra/project-config: Change kolla gate timeout to 90 minutes  https://review.openstack.org/28732117:03
*** gyee has joined #openstack-infra17:03
clarkbmnaser: mordred also I confirmed that vexxhost appears to use dhcp for eth0 configuration which means my servers booted off the image uploads that "succeeded" probably aren't failing due to network setup problems17:04
*** rlandy has joined #openstack-infra17:04
*** rhallisey has quit IRC17:05
clarkblack of console log seems to confirm17:05
*** rhallisey has joined #openstack-infra17:06
mnaserin a conference call i'll jump in soon :-p17:06
*** scheuran has quit IRC17:07
zaromorning17:07
*** abregman has joined #openstack-infra17:09
*** fhubik_brb is now known as fhubik17:09
*** baoli has quit IRC17:09
*** yamahata has joined #openstack-infra17:09
*** kdas_ has joined #openstack-infra17:10
openstackgerritMerged openstack-infra/project-config: Create experimental nodepool rackspace dashboard  https://review.openstack.org/26499417:10
*** electrofelix has joined #openstack-infra17:10
clarkbmordred: when you did your upload with shade you did it on nodepool.o.o using system libs?17:11
fungijenkins01 and 07 are looking potentially unhappy. i'll see what i can do to cheer them up17:11
*** baoli has joined #openstack-infra17:11
openstackgerritMerged openstack-infra/project-config: Add irc channel for the i18n french team  https://review.openstack.org/28704817:11
*** e0ne has quit IRC17:11
anteayafungi: thank you17:11
anteayamorning zaro17:11
*** jpr has quit IRC17:12
*** pleia2 has joined #openstack-infra17:12
*** Swami__ has quit IRC17:12
*** ildikov has joined #openstack-infra17:12
zarojeblair: did you get my msg about trying to fix the gerrit issue where it some changes are not searchable?17:12
*** achanda has joined #openstack-infra17:12
*** kushal has quit IRC17:12
*** kdas_ is now known as kushal17:13
*** degorenko is now known as _degorenko|afk17:13
*** kushal has quit IRC17:13
*** kushal has joined #openstack-infra17:13
*** sdake has quit IRC17:13
jeblairzaro: sorry i missed that17:13
clarkbok reading the glanceclient code the "Error finding address" is misleading and appears to be a copy pasta thing17:14
clarkbit is right next to where name resolution errors are actually handled17:14
*** korzen has joined #openstack-infra17:14
mordredclarkb: yes17:15
korzenHi, is there anyone able to get my patch +2? https://review.openstack.org/#/c/250215 I've got +1 from armax, ihrachys and sc68cal, it is the DVR multinode grenade job for Neutron upgrade tests - sdague anteaya17:15
mordredclarkb: used system libs17:15
zarojeblair: maybe give fix-change a try: http://eavesdrop.openstack.org/irclogs/%23openstack-infra/%23openstack-infra.2016-02-18.log.html#t2016-02-18T00:23:1117:16
* armax looks17:16
*** dtantsur is now known as dtantsur|afk17:17
*** rfolco has quit IRC17:17
*** sdake has joined #openstack-infra17:17
sdaguekorzen: +217:17
openstackgerritMerged openstack-infra/shade: Fix create_server() with a named network  https://review.openstack.org/28678717:17
*** xarses has joined #openstack-infra17:17
korzenthanks sdague :)17:17
clarkbmordred: your idea for running the builder with verbose shade logging makes sense to me17:18
*** jlanoux has quit IRC17:18
*** jpr has joined #openstack-infra17:19
clarkbmordred: as a next step I think it will provide the most info, not sure how to limit that to a speicfic provider though17:19
*** salv-orl_ has joined #openstack-infra17:19
clarkbmaybe we wait for the current builds to other cloud sto finish then restart with that in place17:19
mordredclarkb: ++17:19
openstackgerritBen Swartzlander proposed openstack-infra/project-config: Add experimental Manila LVM job with minimal services  https://review.openstack.org/28582817:20
clarkbmordred: can that be toggled in the log config file easily?17:20
*** resker has quit IRC17:20
mordredclarkb: looking now17:21
*** fawadkhaliq has joined #openstack-infra17:21
zaroclarkb: so we are still having the same jvm gc issue with gerrit?17:21
*** salv-orlando has quit IRC17:22
*** marg7175_ has joined #openstack-infra17:22
clarkbzaro: yes  Ithink so. We had to restart last week to handle it17:22
*** dizquierdo has joined #openstack-infra17:22
*** jaypipes has quit IRC17:22
mordredclarkb: tl;dr - we want to enable debug logging for keystoneauth17:22
*** nmagnezi has quit IRC17:22
zaroclarkb: did you confirm it's the same issue?17:22
korzenany second +2 for my patch?17:23
mordredclarkb: so I think that is "yes"17:23
clarkbzaro: we confirmed that GC was high and we were using lots of memory I did not dig in logs to see if it was the same tracebacks17:23
clarkbmordred: just keystoneauth? I think we are failing when glanceclient talks to the glance server17:23
zaroclarkb: were people report 500 errors that prompted to restart?17:23
clarkbzaro: yes and server slowness17:23
*** mriedem has left #openstack-infra17:24
*** jcooley_ has joined #openstack-infra17:24
*** mriedem has joined #openstack-infra17:24
*** arxcruz has quit IRC17:24
zaroclarkb: so last we discussed the plan was to up the memory, we still gonna do that?17:24
*** krtaylor has joined #openstack-infra17:24
*** electrofelix has quit IRC17:25
*** xiangxinyong has quit IRC17:25
clarkbzaro: someone needs to make a plan to do it17:25
clarkbbut I think we should17:25
clarkbthe biggest problem there is our IP address will change so we need to let people know what the new address is ahead of time17:25
clarkband we need tofigure out how much bigger we should grow17:25
mnaserclarkb: yup, we have dhcp17:26
mordredclarkb: all http requests use keystoneauth17:26
mnaserso if you don't have a console log, the image is probably dun goof'd17:26
mordredclarkb: I'm doing a patch right now17:26
clarkbmordred: ok17:26
zaroclarkb: does something like this need a spec or can we just discuss/plan in a meeting?17:26
clarkbmnaser: yup17:26
mnaseri'd check the size of the image17:26
*** x00350071 has joined #openstack-infra17:26
clarkbzaro: I don't know that it needs a formal spec, but someone needs to do napkin math to figure out how big we need to go, get an infra root to boot the new server so that IP is assigned, then email that out wtih a cutover date17:27
mnaserclarkb: if the server goes to ACTIVE state as well, retrieving vnc session might be useful to see what/where it's stuck on17:27
*** achanda has quit IRC17:27
zaroclarkb: ok. gerrit-tuning is on the meeting agenda.  we can probably discuss with crew and see where it goes.17:27
openstackgerritMonty Taylor proposed openstack-infra/system-config: Add log file for http requests for nodepool  https://review.openstack.org/28733417:28
mordredclarkb: I think that should do it17:28
*** kiran-r has quit IRC17:28
clarkbmnaser: the file size seems to match. qcow2 is ok right? (it is what I uploaded manually yesterday and it booted so I think qcow2 is fine)17:29
mnaseryeah qcow2 is fine, this is plain kvm17:29
mnaserid just boot and check get-vnc-console and see what's there17:29
*** sfinucan has quit IRC17:30
pabelangerAJaeger: fungi: bindep for ansible-jobs look good17:30
fungiexcellent17:30
pabelangerI even uploaded an empty other-requirements.txt and the job still passed17:31
pabelangerwhich is good, since ansible took care of the dependencies said job17:31
fungipabelanger: AJaeger: i'm mostly hesitant to do much in the way of cutover to ubuntu-trusty workers this week for fear of further disrupting the already slammed feature-freeze-week gate load17:31
*** dprince has quit IRC17:31
pabelangerYa, I didn't even think of that17:31
fungiif things start to die off friday that would be great timing, or else monday/tuesday next week17:32
clarkbmnaser: oh! I was looking at the wrong images, ya size is 0 on one I was trying to boot17:32
pabelangerfungi: AJaeger: sure WFM17:32
mnaseryep, probably a failed upload in that case17:32
fungibut i have hopes the gate will quiet down a little once the current impetus for the rush is over. trying not to introduce too many new fires to fight while we're still stretched thin keeping the existing situation stitched together17:32
*** Swami has joined #openstack-infra17:33
*** vgridnev has joined #openstack-infra17:34
mnaserhow big are the images, i just tried to do an upload of a ~300M image from the office, just got uploaded with no problems, mordred earlier tried with manual shade upload and it went by fine?17:34
anteayasanity speaks17:34
clarkbmordred: +2 on the logging change lets see if we can get that in17:34
clarkbmnaser: 5-6GB17:34
clarkbmnaser: they cache all of our git repos and the packages we use17:34
mnaserdid mordred try to upload a 5-6gb image too?17:35
*** mrmartin has quit IRC17:35
clarkbmnaser: yes I think so17:35
*** boris-42 has joined #openstack-infra17:36
*** jaypipes has joined #openstack-infra17:36
*** bpokorny has joined #openstack-infra17:36
mordredmnaser: yes17:36
mordredI uploaded one of the nodepool ubuntu-trusty images from nodepool.o.o17:36
mnaserdo uploads from nodepool happen concurrently17:37
clarkbmnaser: we do up to 4 at a time, but saw that it was failing even with a single upload happening17:37
mnaseri think i know what is going on.17:37
*** hashar has quit IRC17:37
*** vivekd has quit IRC17:37
clarkbmnaser: but yes when it started we attempted to do 4 at once17:37
mnasernginx is buffering the request back to the slower glance backend17:37
mnaserso the file is uploaded really fast to nginx and glanceclient sits and wait for an answer17:37
clarkbaha17:37
*** dmsimard|afk is now known as dmsimard17:38
mnaserthat takes a while, glanceclient gives up because nginx is still slowly feeding the data into glance17:38
mnaserif there was no buffering, this wouldn't happen because when the upload completes, it'll get a response right away17:38
mnaserbased off this -- CentOS-6-x86_64-GenericCloud.qcow2c17:38
mnaseroops -- https://forum.nginx.org/read.php?2,213789,213793#msg-21379317:38
*** [1]Thelo has joined #openstack-infra17:38
mnaser#justopenstackthings - i'll have a look and report back17:39
clarkbmnaser: do we need some sort of ping pong keepalive between nginx and glanceclient?17:39
mnaserclarkb: i think ideally nginx shouldn't buffer and if the communication betwen nginx and glance is slow, then glanceclient should suffer17:39
clarkbmnaser: that would work too :) thank you for looking into this17:39
mnaserthe problem is the idea that the image was uploaded and shipped, but lack of response once the upload is done17:39
*** sambetts is now known as sambetts|afk17:40
mnaseri'd classify this as a problem at our side, probably doesnt happen when taking snapshots because that doesnt go through nginx17:40
*** ccrouch1 has joined #openstack-infra17:40
clarkbmnaser: do you think it is helpful to have nodepool keep trying or should we turn it off compltely for now while you work on your end?17:40
mnaserclarkb: http://nginx.org/en/docs/http/ngx_http_proxy_module.html#proxy_buffering17:41
clarkbwe have ~152 imgaes we need to cleanup at this point17:41
mnaseri just have to flick this switch17:41
*** Thelo has quit IRC17:41
*** [1]Thelo is now known as Thelo17:41
clarkbwoot17:41
mnaseractually17:41
mnaserhttp://nginx.org/en/docs/http/ngx_http_proxy_module.html#proxy_request_bufferingt17:41
mnaserhttp://nginx.org/en/docs/http/ngx_http_proxy_module.html#proxy_request_buffering17:41
mnaserthat17:41
*** claudiub has quit IRC17:41
*** jordanP has quit IRC17:42
*** ccrouch has quit IRC17:42
*** yamamoto has joined #openstack-infra17:43
*** fhubik has quit IRC17:43
*** sdake has quit IRC17:43
*** vgridnev has quit IRC17:44
clarkbI will work on cleaning up the images that we pushed that are all wonky17:44
mnaserclarkb: done, give it a shot, im off to go grab some nomnomz17:44
clarkbmnaser: thanks again!17:44
mnasernp, thank you!17:44
*** vgridnev has joined #openstack-infra17:44
clarkb(it may actually be a little while before nodepool gets to it as the other uploads to other clouds are happening too)17:45
*** achanda has joined #openstack-infra17:45
*** jaypipes has quit IRC17:47
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Add undercloud idempotency test  https://review.openstack.org/27921817:48
*** dizquierdo has quit IRC17:49
*** dizquierdo has joined #openstack-infra17:50
*** yamamoto has quit IRC17:50
*** rockyg has quit IRC17:51
*** IlyaG has joined #openstack-infra17:52
*** rfolco has joined #openstack-infra17:52
*** electrofelix has joined #openstack-infra17:53
*** electrofelix has quit IRC17:53
*** _nadya_ has quit IRC17:53
*** asettle has quit IRC17:53
clarkbok image cleanup is in progress17:54
clarkb(to clean out the images that ended up in a funny state)17:54
*** gongysh has quit IRC17:55
openstackgerritSergey Kraynev proposed openstack/requirements: Bump upper-constraint for heatclient to 1.0.0  https://review.openstack.org/28735117:55
*** sputnik13 has joined #openstack-infra17:56
openstackgerritMerged openstack-infra/ansible-puppet: Safely fail when logs cannot be found  https://review.openstack.org/28499917:56
anteayaAJaeger: in the next day or so I was hoping you might have a few moments to have a discussion with me about how to calculate how many of the jobs we are running (or have available to run) are non-voting as a percentage of total jobs we have available to run17:56
*** Jeffrey4l has quit IRC17:56
anteayaAJaeger: this came out of the discussion at the tc meeting yesterday17:57
anteayaAJaeger: I was thinking if we could grep the jjb xml somehow that would be a start17:57
anteayaAJaeger: I was wondering if you had any other or better thoughts17:57
*** kencjohnston has quit IRC17:58
clarkband image cleanup is alldone17:58
*** gokrokve has quit IRC17:58
clarkbthe next uploads should be starting from a clean slate and if this was the root cause end up working17:58
*** kzaitsev_mb has quit IRC17:59
*** pcrews__ has joined #openstack-infra17:59
*** pcrews_ has quit IRC17:59
*** dprince has joined #openstack-infra18:00
*** derekh has quit IRC18:00
*** Sukhdev has quit IRC18:00
openstackgerritJohn Garbutt proposed openstack/requirements: Bump upper constraint on python-novaclient to 3.3.0  https://review.openstack.org/28735518:00
*** lucasagomes is now known as lucas-dinner18:01
*** jsavak has quit IRC18:02
openstackgerritPaul Belanger proposed openstack-infra/project-config: Add missing gate-{name}-ansible-functional-ubuntu-trusty-nv  https://review.openstack.org/28735818:02
*** jsavak has joined #openstack-infra18:03
*** maishsk has joined #openstack-infra18:03
jeblairmordred, SpamapS: naturally, on my workstation, i need to set ANSIBLE_NOCOWS=118:03
*** e0ne has joined #openstack-infra18:04
openstackgerritMerged openstack-infra/tripleo-ci: Roll up static Heat envs into CI directory  https://review.openstack.org/28043118:04
mordredjeblair: yes. it's an important env var to set18:04
mordredjeblair: or, you can just learn to love the cows18:04
*** austin81 has quit IRC18:04
jeblairmordred: i do love cows!18:04
*** jsavak has quit IRC18:04
*** alivigni has quit IRC18:04
jeblairwhich is why i have cowsay installed, unlike most servers, which is why i haven't run into this before :)18:04
mordred:)18:04
*** jsavak has joined #openstack-infra18:04
*** shardy has quit IRC18:06
pabelangerhttp://grafana.openstack.org/dashboard/db/nodepool-rackspace-experimental now exists with hot templating action18:08
pabelangerhowever the 'ready node' and 'error node' graphs look off18:08
*** roxanaghe has quit IRC18:09
anteayasorry I missed the discussion about a rax experimental cloud/region18:10
clarkbjeblair: what do you think of spinning up more nodepool builder processes like we did in ft collins in order to get to the vexxhost uploads quicker18:10
anteayawhat is the purpose of this region and why is it so named?18:10
clarkbjeblair: is there command history I can lookup for that?18:10
fungianteaya: actually, the output from the zuul layout job lists every job it knows to run for all projects, and indicates which have a non-voting parameter too18:11
fungianteaya: so it could probably be determined purely on analysis of that output18:11
anteayaoh okay grand18:11
anteayathat would be easy enough18:12
*** keedya has joined #openstack-infra18:12
fungianteaya: based on my interpretation of your problem statement, i don't think you'd be able to figure out what you wanted from the jjb configs, since it's zuul which actually knows what votes and what doesn't18:12
*** david-lyle has quit IRC18:12
mordredyah - I also hadn't heard of rax experimental and am curious as to what it is18:12
*** sigmavirus24 is now known as sigmavirus24_awa18:12
anteayafungi: ah okay thank you18:12
*** david-lyle has joined #openstack-infra18:13
jeblairmordred, anteaya: see the discussion in here a while back.  it's the *dashboard* that is experimental.18:13
mordredAH18:13
anteayaoh okay thank you18:13
mordredthat makes much more sense to me18:13
* anteaya goes to read backscroll18:13
*** fernnest has quit IRC18:13
jeblairit's trying out the templating stuff18:13
anteayaah18:13
jeblairclarkb: "nodepool-builder -d -c /etc/nodepool/nodepool.yaml"18:14
jeblairclarkb: but what we learned from that is ---18:14
jeblairclarkb: either use the existing logging config file or make a new one18:14
clarkbjeblair: oh right18:14
jeblairclarkb: (assuming you care about the output)18:14
*** ihrachys has quit IRC18:14
clarkbI can use tee to tee the stdout to files18:14
clarkbso that is easy enough18:14
jeblairthat'll work too (though it may be stderr?)18:14
clarkbcan catch both18:15
*** sripriya_ has quit IRC18:15
*** dprince has quit IRC18:16
openstackgerritMerged openstack-infra/puppet-infracloud: Set the neutron parameters mandatory for controller  https://review.openstack.org/28483518:16
*** dprince has joined #openstack-infra18:16
*** mikelk has quit IRC18:17
*** sridhar_ram has joined #openstack-infra18:17
*** mriedem has quit IRC18:18
clarkbjeblair: and you ran that command as the nodepool user?18:18
*** yaume_ has quit IRC18:18
openstackgerritMerged openstack-infra/project-config: Add nodepool vexxhost dashboard for grafana  https://review.openstack.org/28721118:18
jeblairclarkb: yes18:18
clarkbcool vexxhost uploads are in progress now18:21
*** angdraug has joined #openstack-infra18:21
*** tphummel has joined #openstack-infra18:22
cjhey folks18:22
cjhttp://logs.openstack.org/56/287056/1/check/gate-gerritbot-pep8/d0c9194/18:22
cjcould someone help me find a pythontidy I can run my code through?18:22
anteayawhat is a pythontidy?18:24
*** annegent_ has joined #openstack-infra18:24
clarkbanteaya: something to format the code in a way that is pep8 compliant18:24
clarkbI just use whatever python vim plugin ubuntu ships with18:24
cjI'm not sure if it exists.  perltidy applies formatting rules to code, so I was hoping there was one for python18:25
anteayaah18:25
cjclarkb: danke18:25
anteayahttps://pypi.python.org/pypi/PythonTidy/18:25
*** abregman is now known as abregman|mtg18:25
anteayaapparently there is18:25
*** jsavak has quit IRC18:25
cjokay, that's the one I found, but I can't seem to hook it in to emacs.18:25
cjclarkb: I'll open it with vim.  is there a command I need to run to have it do its thing?18:26
clarkbcj: I am not sure how to get vim to apply the rules after the code is written18:26
clarkbI am sure there is a way but I always have it apply as I go18:27
*** sarob has joined #openstack-infra18:27
cjokay.18:27
clarkboh if you emacs then jeblair or sdague may have a good asnwer for you18:27
*** annegentle has quit IRC18:27
*** annegent_ has quit IRC18:28
*** austin81 has joined #openstack-infra18:28
*** dizquierdo has quit IRC18:29
lifelessjeblair: mtreinish: the call is a flush call on a readonly file - I'd just remove the flush call as noise :)18:29
*** mriedem has joined #openstack-infra18:29
*** e0ne has quit IRC18:29
*** ybathia has joined #openstack-infra18:29
mtreinishlifeless: it already has been18:29
*** dizquierdo has joined #openstack-infra18:29
*** e0ne has joined #openstack-infra18:30
mtreinishlifeless: https://review.openstack.org/28692818:30
lifelessmtreinish: ok cool; is there another on-closed error that that uncovered?18:30
*** doug-fish has quit IRC18:30
*** jsavak has joined #openstack-infra18:30
*** doug-fish has joined #openstack-infra18:31
*** e0ne has quit IRC18:31
*** ddieterly has joined #openstack-infra18:31
mtreinishlifeless: jeblair found the bug and fixed it with: https://review.openstack.org/28729918:31
*** e0ne has joined #openstack-infra18:32
clarkbmnaser: OpenStackCloudException: Image creation failed: Error finding address for https://image-ca-ymq-1.vexxhost.net/v2/images/652eb40f-bde5-4e2b-adc2-8eb74b6d507f/file: Unable to establish connection to https://image-ca-ymq-1.vexxhost.net/v2/images/652eb40f-bde5-4e2b-adc2-8eb74b6d507f/file18:33
openstackgerritC.J. Collier proposed openstack-infra/gerritbot: Improve gerritbot error handling  https://review.openstack.org/28705618:33
clarkbmnaser: that happened at 2016-03-02 18:30:32,944 UTC18:33
clarkbso about 3 minutes ago18:33
pabelangerjeblair: will have to dive into experimental grafana. If you select individual zones the graphs match up, when you select all for experimental, 2 don't sync. Maybe the template logic is not correct18:34
pabelangereither way, grafyaml did its job18:34
anteayacurrently jeblair patch https://review.openstack.org/#/c/282099/ add timeout/retries to zuul cloner is in a cannot merge state18:34
clarkbmnaser: is it possible your change hasn't taken effect yet?18:34
fungiclarkb: "error finding address" seems specious. i wonder if that's a misleading error message18:34
anteayajeblair: did you want to rebase?18:34
*** doug-fis_ has joined #openstack-infra18:34
clarkbfungi: it is I looked at glance client code and it looks like they copied the error message from the actual name lookup failure handling code18:34
fungihah18:35
fungiclassic18:35
openstackgerritAustin Clark proposed openstack-infra/infra-specs: Deploy-Stackviz spec  https://review.openstack.org/28737318:35
clarkbfungi: the second part of that is the important bit "unable to establish connection to" is the actual python requests exception18:35
*** doug-fish has quit IRC18:35
fungiyeah, that seemed far more likely18:35
clarkbmnasers theory was caching in nginx confusing glanceclient while nginx finished writing to glance backend18:35
AJaegeranteaya: let me read backscroll, will have some time soonish18:36
mnaserclarkb: the changes should be in effect, unless http://nginx.org/en/docs/http/ngx_http_proxy_module.html#proxy_request_buffering -- "When HTTP/1.1 chunked transfer encoding is used to send the original request body, the request body will be buffered regardless of the directive value unless HTTP/1.1 is enabled for proxying."18:36
mnaserwe coud be fallin in that18:36
austin81AJaeger: https://review.openstack.org/287373 :)18:37
openstackgerritMark Vanderwiel proposed openstack-infra/project-config: Switch to haproxy driver for heat lbaas v2 integration tests  https://review.openstack.org/28737718:37
anteayaAJaeger: thanks I think fungi suggested a scrape of the layout job output18:37
*** abregman|mtg is now known as abregman|afk18:37
mnaserok so looks like glance does use chunked transfer so that directive is ignored18:37
mnaseri'll force the proxy to use http 1.1, which will make it effectie18:38
anteayaAJaeger: I'm still reading backscroll from last night so haven't gotten to that yet18:38
*** fernnest has joined #openstack-infra18:38
*** doug-fis_ has quit IRC18:39
mnaserclarkb: can you try again, i think the directive was ignored because it was a http/1.1 request18:39
clarkbmnaser: yup there are a couple other uploads queued18:39
*** kushal has quit IRC18:40
mnaseryep its proxying in http/1.1 -- lets see how this goes18:40
*** piet has joined #openstack-infra18:40
*** infra-red has joined #openstack-infra18:41
*** sdake has joined #openstack-infra18:41
*** amitgandhinz has quit IRC18:42
openstackgerritMerged openstack-infra/project-config: Adds training-labs post jobs to publish tarballs  https://review.openstack.org/28172118:43
*** amitgandhinz has joined #openstack-infra18:45
*** ajmiller_ has joined #openstack-infra18:45
openstackgerritMerged openstack-infra/shade: Add debug message about file hash calculation  https://review.openstack.org/28728118:45
*** ajmiller has quit IRC18:48
openstackgerritMerged openstack/requirements: Cap osprofiler  https://review.openstack.org/27898418:48
AJaegerfungi, understood for ubuntu-trusty cutover. What do you think of reviewing 285722  and 283445 now - so thta we can make changes if needed before? 285722 looks straightforward to me but on 283445 I'd like you to check the tox change18:49
openstackgerritMerged openstack/requirements: Bump python-ironic-inspector-client to 1.5.0  https://review.openstack.org/28597218:49
*** austin81 has left #openstack-infra18:50
*** jcoufal has quit IRC18:50
*** esikache1 has joined #openstack-infra18:51
*** pvaneck has joined #openstack-infra18:53
AJaegeranteaya: yeah, he's suggestion is what I would have suggested as well. Run "tox -e zuul" and take it from there.18:54
*** maishsk has quit IRC18:54
*** sripriya has joined #openstack-infra18:54
sdaguecj: https://dague.net/2015/03/24/openstack-emacs-tools/18:54
*** vgridnev has quit IRC18:55
anteayaAJaeger: sounds good, thanks18:55
anteayaglad it is so simple18:55
*** esikache1 has quit IRC18:56
clarkbmnaser: woot I think we just had an image go ready in vexxhost18:56
*** jcoufal has joined #openstack-infra18:56
clarkbmnaser: 7ca18474-3b2e-4cff-91a0-0ea34e722e8b is the image uuid18:57
mnaserwewt18:57
fungiAJaeger: seems fine to me. ideally the database-using tox jobs would become the exception, but i don't know of an easy way to work out which projects/jobs are relying on mysql or postgres databases set up in advance18:57
clarkbmnaser: file size looks good too18:57
clarkbmay be time to revert the max-servers: -1 change18:57
mnaseryeah i think its resolved, as explained by the much slower time it showed in the logs :-P18:57
clarkbI will try booting off that image real quick now18:58
anteayafungi: AJaeger I think dhellmann was looking for a ballpark percentage18:58
*** sripriya_ has joined #openstack-infra18:58
*** kushal has joined #openstack-infra18:58
anteayaso hopefully grepping the output of tox -e zuul will get us a starting point18:58
fungianteaya: my ballpark percentage is "no clue"18:58
anteayayup, that is where I started too18:58
fungiso, yeah, need to find actual numbers18:58
*** maishsk has joined #openstack-infra19:00
*** thorst_ has quit IRC19:00
*** thorst has joined #openstack-infra19:01
*** thorst has quit IRC19:01
*** thorst has joined #openstack-infra19:01
AJaegerfungi, if database-using-tox should be the exception, should we use different naming? Create also a tox 'gate-{name}-tox-db-{envlist}' and change my 'gate-{name}-tox-nodb-{envlist}' to something else? Or create both versions?19:01
*** sripriya has quit IRC19:01
fungiAJaeger: i don't really have a strong opinion on it. what you have now looks fine honestly19:02
AJaegerglad to hear ;)19:03
clarkbmnaser: I successfully booted off of that image so looks good19:03
clarkbssh'd in and everything19:03
anteayayay19:04
*** jsavak has quit IRC19:04
mnaser\o/19:04
mnaserthanks for helping us find out a problem already i guess :-P19:04
fungione of the fringe benefits of donating ;)19:05
*** annegentle has joined #openstack-infra19:05
AJaegerfungi, https://etherpad.openstack.org/p/bindep contains all info about the ubuntu-trusty change...19:05
*** jsavak has joined #openstack-infra19:06
clarkbhrm19:07
clarkbour ubuntu-trusty images are more than twice the size of our devstack-trusty images19:07
clarkbor thereabouts twice as big19:08
fungithat seems odd19:08
clarkbso much for minimal :)19:08
fungipre-caching more packages?19:08
clarkb5.7GB vs 11GB19:08
clarkbfungi: maybe? I havne't dug further than looking at ls output funny19:08
fungistill, 4.3gb of packages is... a lot19:08
fungiso i doubt that's it19:08
pleia2that is a lot, are they shipping a desktop? :)19:09
clarkbthey should have almost nohting in them because the are "minimal"19:09
pleia2I just installed 14.04.4 desktop yesterday and even that only takes up 8.6G19:10
mordredclarkb: we should definitely figure that out - they should certainly have less19:10
mnaserwhat does df -h say on these images19:10
bkeroThat's a crazy amount of space for an os image19:10
mnaseronce they boot19:10
*** daemontool_ has quit IRC19:10
fungiperhaps we aren't properly squashing them?19:10
fungiyeah, i'm thinking what mnaser is probably also getting at19:11
mnaser^ my thoughts, virt-sparsify19:11
pleia2and that's 8.6G on the filesystem, while it's running19:11
openstackgerritClark Boylan proposed openstack-infra/project-config: Revert "Disable image uplaods to vexxhost"  https://review.openstack.org/28738919:11
clarkbbkero: well we cache stuff19:11
AJaegerarmax, dougwig, mestery : https://review.openstack.org/#/c/282572/ needs one neutron liason review, could either of you look at it, please?19:11
clarkbbkero: but ~5GB is about what we expect19:11
bkeroclarkb: Yeah. I'm hearing 4.3GB of packages, so 700mb for an OS makes sense19:11
mesteryAJaeger: Looking19:11
bkero6GB doesn't make sense19:11
clarkbfungi: mordred pleia2 https://review.openstack.org/287389 will keep us moving forward on vexxhost19:12
pleia2clarkb: yay19:12
*** doug-fish has joined #openstack-infra19:12
fungibkero: a lot of it is the full mirror of all our git repos we keep locally on images, but that should be on both so doesn't account for the discrepancy19:12
*** kencjohnston has joined #openstack-infra19:12
dougwigAJaeger: there you go, all three.19:13
clarkb/dev/disk/by-label/cloudimg-rootfs  188G   10G  170G   6% /19:13
clarkbso we are using roughly that much disk19:13
clarkbI think19:13
fungicurious to know where the bulk of that is residing in the fs19:13
clarkbya I am running the very expensive du -hs * in /19:14
bkeroclarkb: pastebin?19:14
clarkbbkero: its not done yet19:14
bkeroI mean when it is :)19:14
clarkbsure19:14
bkerobaobab is a really cool tool for doing that, but would probably be hellish over ssh19:15
AJaegerdougwig, mestery, armax : Thanks!19:15
*** ajmiller_ is now known as ajmiller19:15
openstackgerritOpenStack Proposal Bot proposed openstack/requirements: Updated from global requirements  https://review.openstack.org/24611419:16
clarkbhttp://paste.openstack.org/show/489033/ that implies it is our cache, will dig in /opt more19:16
fungiyeah, i think the only thing we cache in /opt is git repos for the moment19:16
clarkbya 6.8GB is just /opt/git19:17
fungiso this may be because we19:17
pleia2wow :\19:17
fungigrr19:17
clarkbfungi: hrm?19:17
fungistray finger on the enter key19:17
fungiwe're trying to cache all branches of all repos rather than just the master branch state (which was our previous naive use of the dib source element)19:18
mnasertime to run reverse cache proxies on mirror nodes and use that to pull deps?19:18
clarkbfungi: but devstack-trusty uses the same caching thing19:18
clarkbmnaser: well if we can get it back down to 5.7GB I might be happy with that since we essentially have the same images19:18
fungidouble-checking how we're calling it from the ubuntu-trusty config19:19
sdaguehey, remember that 180 minute backstop idea?19:19
openstackgerritMerged openstack-infra/subunit2sql: Don't extend the passed in targets  https://review.openstack.org/28729919:19
*** kushal has quit IRC19:19
*** korzen has quit IRC19:19
openstackgerritdevdatta-kulkarni proposed openstack/requirements: Adding python-solumclient  https://review.openstack.org/28731319:20
sdaguewe should consider that, because especially on environments like internap we can become our own noisy neighbors when at full utilization19:20
fungiclarkb: looking at the diskimages list, the only differences there shouldn't impact git caching, i agree19:20
sdagueand drive everything past 2 huors19:20
sdaguehttps://review.openstack.org/#/c/286276/19:20
*** abitha has joined #openstack-infra19:20
mnaserour glance images are currently stored in a single az (triple replicated on ceph).  we're looking at deploying glance-api with cache on the other az to make things faster19:20
sdague2 fails in the gate, on internap, because the job was still going at 2 hours19:20
sdaguebut it was passing19:21
mnaserso if that becomes an annoyance, we might do that at our side19:21
*** Sukhdev has joined #openstack-infra19:21
bkeroclarkb: are the repos copied out of /opt/git when they're used?19:21
*** sigmavirus24_awa is now known as sigmavirus2419:22
bkeroor are the 'git checkout $THING' in place?19:22
bkerothey*19:22
fungiclarkb: i see we're not doing QEMU_IMG_OPTIONS: compat=0.10 on ubuntu-trusty... should we be?19:22
*** Sukhdev has quit IRC19:22
clarkbfungi: I think hpcloud was the only cloud that required it but it could be a difference in compression between versions so maybe we should19:22
clarkbbkero: they use gits hardlinking cloning stuff iirc19:23
*** infra-re_ has joined #openstack-infra19:23
bkeroSquashfs the repos? :319:23
openstackgerritMerged openstack-infra/project-config: networking-odl: enable test with ODL release of boron-snapshot  https://review.openstack.org/28257219:23
*** alivigni has joined #openstack-infra19:23
*** jed56 has quit IRC19:23
*** abitha has left #openstack-infra19:23
*** infra-red has quit IRC19:25
*** infra-red has joined #openstack-infra19:26
fungisdague: i wonder if that means we're overutilizing/oversubscribed too far and need a lower quota to stop adversely impacting their systems, or if there might be something else going on in internap19:26
mgagnefungi: currently running at 2:1 (instead of 3:1 a couple of months ago)19:27
sdaguefungi: right, I don't know, but moving that backstop up to 180minutes would at least not fail good jobs19:27
fungisdague: the jobs just ran slower, no indication they spent a lot of time updating git repos or anything where they might have been hung on a git remote action?19:27
*** maishsk has quit IRC19:27
sdaguefungi: yeh, no specific hot spot19:27
sdaguejust everything is going slow19:27
*** ociuhandu has quit IRC19:27
sdaguepip_install is like 2x what I'd expect19:28
sdaguemy guess is disk io contention19:28
clarkbfungi: when I can get out of my chair and boot my test machine I will run some local dib build tests to see if the compat option is making things saner19:28
fungii suppose increasing the timeout is an option. i don't know for sure whether that removes some of our only backpressure on people merging things which will make the jobs run longer and soon start creeping up to the new timeout19:28
sdagueI don't think that timeout is really back preasure on that19:28
sdaguewe've got social preasure well before that19:29
*** infra-re_ has quit IRC19:29
sdagueI actually think a high percentage of our current gate reset is basically a combo of 2 timeouts right now19:30
sdaguethat we are triggering because of load on providers19:30
sdaguewhich... means rechecks, generating more load19:30
sdagueso vicious cycle19:30
fungii'm not particularly opposed, though i would love to get a better handle on whether we're seeing new performance regressions in particular providers so either we or they can address that19:30
anteayadhellmann AJaeger fungi: http://paste.openstack.org/show/489038/ 5.87% of total jobs that zuul knows about are non-voting19:30
*** caphrim007 has joined #openstack-infra19:30
sdaguesure, once openstack health is back that has a set of data19:30
anteayaaccording to my grepping + math19:30
sdaguebut we're blind there atm19:30
fungiright19:30
AJaegeranteaya: only? Wow19:31
sdaguehonestly, the timeout reset is super hard to spot like this unless you have a patch impacted by it19:31
anteayaAJaeger: yeah I thought it would be higher too19:31
sdaguebecause there is nothing dead on the zuul queue to look at19:32
AJaegeranteaya: can you calculate also jobs run in check queue only19:32
anteayaI'll see19:32
sdagueI happened to have a patch in gate that got hit by it twice19:32
clarkbwhen jobs timeout they should report failure19:32
clarkbso you would see fail by timeout19:32
anteayaAJaeger: not sure how since they are all called gate19:32
fungianteaya: the output zuul produces is per pipeline19:33
fungianteaya: so you could trim it down to just the output for one pipeline19:33
anteayafungi: ah okay I will continute to browse the file and see if I can get a number19:34
*** doug-fis_ has joined #openstack-infra19:35
caphrim007hey folks, i'm having trouble getting shade to list my servers. http://pastebin.com/HPrHSA0d is the error i'm receiving. its a packstack install19:35
*** Sukhdev has joined #openstack-infra19:35
AJaegeranteaya: I just manually edit the file and copied the check pipelines out.19:35
AJaegeranteaya: I have a different tree then you have, so numbers are "ball-park" figures only19:35
AJaeger586 non-voting out of 4101 check jobs19:36
*** doug-fish has quit IRC19:36
fungithough also, this number is likely only useful for limited purposes. the frequency with which these jobs are run is unrelated to the number of them which exist19:37
anteayaAJaeger: fungi I've updated the paste look at line 1019:37
anteayaI might not be grepping properly19:37
*** rockyg has joined #openstack-infra19:37
*** exploreshaifali has quit IRC19:38
fungiyou could probably get that out of graphite or logstash, but then even frequency with which they're run is not a good indicator of resources consumed without taking into account how long each runs19:38
anteayaAJaeger: I don't understand how you are getting those numbers, but I'm not having confidence in my method to get accurate numbers19:38
AJaegeranteaya: your grep looks fine, just wanted to say that I'm not on head19:38
anteayaAJaeger: oh19:38
fungiif the goal is figuring out how much of our worker pool gets spent running non-voting jobs, the figure is probably going to not be easy to arrive at19:39
anteayafungi: very true, these are just the jobs that exist19:39
AJaegeranteaya: I had 12422 and 689 jobs overall...19:39
anteayafungi: that seems to be the conclusion I am reaching as well19:39
*** ybathia has quit IRC19:39
anteayaAJaeger: 12422 what?19:39
*** davideagnello is now known as davideagnello__19:40
AJaeger12422 total jobs - and 689 non-voting. So same ballpark figure as yours...19:40
anteayaoh okay thanks19:40
anteayaso a very rough ballpark19:41
*** infra-re_ has joined #openstack-infra19:41
anteayawhich likely isn't of any value in helping us decide how to best use resources19:41
anteayadhellmann: ^^19:41
*** korzen has joined #openstack-infra19:41
openstackgerritSean Dague proposed openstack-infra/project-config: mostly standardize dsvm tempest jobs on 180 minute backstop  https://review.openstack.org/28740719:43
sdaguefungi ^^^19:43
pabelangerwhen we send out ATC passes for the summit, do we check against the current development cycle or the previous?  So with Austin, you need commits in liberty or mitaka?19:44
*** infra-red has quit IRC19:44
clarkbpabelanger: its the current cycle19:44
clarkbso mitaka19:44
clarkbpabelanger: in the past it was the current and previous but that has been adjusted based on scale?19:44
fungiwe could probably, with some effort, determine aggregate runtime for each job (as a product of run count and run duration) from graphite, then map that to the information zuul has about whether each is voting or non-voting since that's determined purely on job name19:44
pabelangerclarkb: right, so how does commits after the ATC email work?19:44
anteayafungi: interesting19:45
*** davideagnello__ is now known as davide85_19:45
*** Kaiyan has joined #openstack-infra19:45
clarkbpabelanger: there are several discount pass waves sent out and they stop before the expiration on the early reg iirc19:46
clarkbpabelanger: so you hvae to get in befor ethat19:46
*** doug-fish has joined #openstack-infra19:46
fungipabelanger: criteria are decided by the event organizers, but this time around it's basically anyone who owns a change merged to a tc-recognized project deliverable between the last release and the integrated feature freeze19:46
*** Sukhdev has quit IRC19:46
pabelangerfungi: clarkb: perfect19:46
*** korzen has quit IRC19:46
fungiso october 15 through probably march 3 this time19:46
fungias my plan is to try and fire off the final batch tomorrow19:47
*** doug-fis_ has quit IRC19:47
*** rguillebert has quit IRC19:47
openstackgerritMerged openstack-infra/shade: Add test for os_server Ansible module  https://review.openstack.org/28542419:48
pabelangerfungi: do we have the documented some place on the wiki?19:49
fungipabelanger: no, since it varies from cycle to cycle it's generally not been well tracked19:49
pabelangerokay, thanks19:49
*** infra-red has joined #openstack-infra19:50
fungialso i feel like publishing explicit qualifying requirements increases the number of people who will abuse it for free conference admission, as opposed to a more nebulous "we give free passes to active contributors for the cycle"19:50
*** infra-re_ has quit IRC19:52
fungifor the most part, people actually working on upstream development in official projects should never need to ask. most of the questions i get about it are people who are trying to get one commit in at the last minute so they don't have to pay $600, but aren't actually planning to participate in the design summit at all and just want to go to the conference for free19:53
*** tongli has joined #openstack-infra19:53
*** jamesmcarthur has quit IRC19:54
anteayathere are also the extra-atcs category for anyone who makes a contribution that gerrit doesn't know about19:55
fungii get tons of questions like "here's a one-line patch changing a couple words in a code comment which i and several of my colleagues are listed as co-authors on as our only contribution, where's our free pass?" (how does a change like that even get approved by core reviewers on a project?), or "i had a patch merged in january of last year but haven't done anything since then, can i still get a free19:56
fungiconference pass?"19:56
mordredI should not respond to mailng lists19:56
fungioh, and several this time were "i got a free pass for tokyo but couldn't attend, can i use it to get into the conference in austin?"19:56
*** kzaitsev_mb has joined #openstack-infra19:57
Kaiyanhttps://review.openstack.org/#/c/286179/ Hello everyone! Can anyone look at this change and approve it please? Thanks!19:57
anteayafungi: yeah it is the addition of a whitespace or newline with 3 co-authors that boggles my mind19:58
Clintyou should answer that you hid the golden tickets in a chocolate bar19:58
*** sdake_ has joined #openstack-infra19:59
fungiClint: oh, i've got a golden ticket!19:59
fungiopenstack conference wonkypass19:59
*** BobBall is now known as BobBall_AWOL20:00
*** BobBall_AWOL has quit IRC20:00
*** sdake has quit IRC20:00
*** jamesmcarthur has joined #openstack-infra20:00
fungisdague: at what point did we start being able to infer the devstack-gate inner timeout from the jenkins timeout wrapper value?20:00
*** tqtran has joined #openstack-infra20:00
*** thorst has quit IRC20:01
AJaegerKaiyan: not everybody here reviews everything. Please tell us a bit more about your change so that the right people can look at it - if they have time...20:01
*** maishsk has joined #openstack-infra20:01
*** thorst has joined #openstack-infra20:01
AJaegerfungi: 3rd of February jeblair created a patch - see Iff5e96bd80b8f165ccc7d6be09b53208613571eb20:01
fungisdague: oh, i bet it defaults to 12020:01
fungiAJaeger: ooh!20:02
openstackgerritMerged openstack-infra/project-config: Adding integration test support for lbaas v2 dashboard  https://review.openstack.org/27539420:02
morganmordred: lol20:02
* mordred goes back to hiding20:02
fungiAJaeger: sdague: that's some serious rainbow unicorn magic. i'm thrilled20:02
*** sdake has joined #openstack-infra20:03
fungijeblair: ^ sorry i missed 275908, that's awesome20:03
*** piet has quit IRC20:03
mtreinishjeblair: https://pypi.python.org/pypi/subunit2sql/1.4.0 now to wait for puppet on subunit-worker01.o.o20:03
AJaegerfungi: and then we removed the TIMEOUT variable everywhere in project-config...20:04
fungiAJaeger: yep, i see that20:05
fungii simply missed introduction of the magic20:05
AJaeger;)20:05
*** thorst has quit IRC20:05
*** sdake_ has quit IRC20:06
*** hashar has joined #openstack-infra20:06
*** thorst has joined #openstack-infra20:08
jeblairmordred: that post is far more sensible than i was expecting based on your earlier warning :)20:10
mordredjeblair: it's all about setting expectatoins20:10
jeblairfungi: i think zaro did the hard work adding the feature to jenkins20:10
Kaiyanhttps://review.openstack.org/#/c/286179/ This review is to set monasca persister to only run monasca tempest tests in check and gate jobs instead of running all the 1000+ tests. Please review it when you get a chance! Thank you!20:10
mordredI successfully set your expectations low enough that the post seemed sensible20:10
mordredcompared to the horror-show you know it COULD have been :)20:11
fungijeblair: oh, yes i do remember him working on the jenkins side of that--thanks zaro! it's a massive simplification for our job configs and has been a long time coming20:11
ianwmordred: ahh, i did think it seemed like we should pass username/password in when i was going through the swift upload (https://review.openstack.org/#/c/287224/).  so do we think it's fixed?20:11
mordredianw: yah - we did the same thing in occ for normal swift Connection20:12
openstackgerritAndreas Jaeger proposed openstack-infra/project-config: Use ZUUL_REFNAME (1/2) for branch checking  https://review.openstack.org/28741420:12
openstackgerritAndreas Jaeger proposed openstack-infra/project-config: Use ZUUL_REFNAME (2/2) for branch checking  https://review.openstack.org/28741520:12
AJaegerfungi, here's the REFNAME cleanup we discussed earlier ^20:12
mordredianw: so I think between your patch and that patch we should be much better now20:12
mordredI need to cut a new shade release, but there was enough going on that I thought maybe today was a bad time?20:13
*** infra-red has quit IRC20:13
openstackgerritgordon chung proposed openstack/requirements: bump ceilometermiddleware upper constraints  https://review.openstack.org/28741620:13
mordredinfra-root: ^^ any thoughts on fear/joy related to a new shade release?20:13
ianwmordred: i haven't had my first coffee yet, let alone read scrollback :)20:13
mordredianw: haha20:14
mordredianw: DRINK MOAR COFFEEEE20:14
ianwprobably the world is on fire20:14
openstackgerritDouglas Mendizábal proposed openstack/requirements: Update upper-constraints for barbianclient  https://review.openstack.org/28741720:14
anteayamordred: I have no feelings on the new shade release, so I guess I don't have any fear20:15
anteayajoy in as much as it has the potential to make others happy20:15
*** claudiub has joined #openstack-infra20:15
* anteaya goes to find food20:15
mgagnefound a cool online dstat graphing tool: http://lamada.eu/dstat-graph/20:16
*** jsavak has quit IRC20:16
mgagnecan also be downloaded from https://github.com/Dabz/dstat_graph20:16
*** bhunter_ has joined #openstack-infra20:16
openstackgerritMerged openstack-infra/project-config: Add a third scenario for packstack integration testing  https://review.openstack.org/28657920:16
*** piet has joined #openstack-infra20:17
*** doug-fis_ has joined #openstack-infra20:17
*** claudiub has quit IRC20:17
cjI've fixed my gerrit commit.  Is there a way to tell jenkins to kick off another test run?20:17
*** sripriya__ has joined #openstack-infra20:17
cjhttps://review.openstack.org/#/c/287056/20:17
*** claudiub has joined #openstack-infra20:17
*** doug-fish has quit IRC20:18
*** keedya has quit IRC20:19
cjah.  To re-run check or gate jobs, leave a comment on the review with the form “recheck”.20:19
*** maishsk_ has joined #openstack-infra20:20
*** jsavak has joined #openstack-infra20:20
*** maishsk has quit IRC20:20
*** maishsk_ is now known as maishsk20:20
AJaegercj: we automatically run a new check when you upload a new change20:20
*** sripriya_ has quit IRC20:20
*** reed_ has joined #openstack-infra20:21
anteayacj: go to status.openstack.org/zuul and put 287056 in the Filters: text box20:22
anteayacj: your patch is in the check queue awaiting a node20:22
anteayaand has been for an hour and 49 minutes20:23
fungimordred: holding off non-urgent behavior changes to our primary ci toolchain makes sense. i think the devs currently bum-rushing feature freeze appreciate our change slushiness this week20:23
*** annegentle has quit IRC20:23
cjcool, thanks.  I'd like to merge in changes from 279169 - is there an easy command to do that?20:23
*** apoorvad has quit IRC20:23
mordredfungi: yah. that's kinda what I was thinking20:24
mordredfungi: wait - maybe we've been taking the wrong approach this entire time20:24
*** annegentle has joined #openstack-infra20:24
anteayacj: yes in your workspace go to the master branch with no changes going with you, then git review -d 27916920:24
mordredfungi: maybe we should ONLY make changes to the infrastructure during feature freeze rush20:24
anteayawhich creates a new branch with Gabor's change on it20:24
mordredfungi: to possibly block some percentage of the last-minute bad ideas20:25
rcarrillocruzhaha20:25
anteayathen put your change as a child of 27916920:25
fungimordred: you can never block enough of the bad ideas to make that worthwhile20:25
fungithere are simply too many20:25
mordredfungi: good point20:25
* mordred glares at floating ips20:25
*** ccrouch1 has quit IRC20:25
cjanteaya: and if I wanted to merge these changes in to my branch?20:26
mordredfungi: hey, wanna buy a server that's only available through a NAT connection? I'll sell you one for $5 ...20:26
mordredcj: you do not want to do that20:26
cjsame but in my branch instead of master?20:26
anteayacj: we don't merge locally20:26
cjmordred: oh, alright.  How about if I want to put my branch in a state that it can merge gracefully?20:26
fungimordred: i can give you a really cheap rate on servers you'll never be able to reach20:26
anteayawe create individual patches and zuul merges20:26
mordredcj: yes, that is a thing you can enjoy doing20:26
cjmordred: heh, okay.  I meant how do I find the merge conflicts?20:26
mordredcj: so - git review -d 279169 will make a local branch for you and download the change in question into that branch20:26
mordredcj: if you then cherry-pick your change on top of it20:27
cjokay, thanks.20:27
mordredcj: you can resolve the merge conflicts20:27
* anteaya wonders what the difference is between what mordred and what she had said20:27
mordredcj: and when you run git review again, it will upload your change, except now it will be parented on the 279169 change20:27
mordredanteaya: you said the same words - but you said less of them20:27
ianwfungi / jeblair : when you have a moment -> https://review.openstack.org/#/c/285876/ ; fixing up fight between pip/packages on fedora-minimal images.  this feels like the right place to do it, because puppet shouldn't know about it, and without we leave the system in a really precarious state20:28
anteayaah more words were needed20:28
AJaegersdague: https://review.openstack.org/#/c/286085 is a nova vif change for project-config. Do you want t o review that?20:28
*** andreykurilin__ has joined #openstack-infra20:28
mordredanteaya: I'm assuming that cj is used to a different way of doing git and that all of our things are strange and frightening, so trying to step back and remember which things of how things work here he might not assume20:29
anteayaI'm not often accused of not using enough words20:29
cjfurrfu.  looks like PythonTidy does not emit code that passes pep820:29
mordredanteaya: I'm NEVER accused of that20:29
anteayayup20:29
mordred:)20:29
anteayahe he20:29
*** matrohon has joined #openstack-infra20:29
cjthanks to both of you, mordred and anteaya20:29
mordredcj: I have thus far found no programs that are good at reformatting python code20:29
mordredcj: I've looked a few times20:29
*** rguillebert has joined #openstack-infra20:29
cjmordred: that's too bad.  I like perltidy.  I wonder if I can alter it to work with python, ruby, javascript, etc.20:30
mordredcj: well - you might be better off trying to morph https://pypi.python.org/pypi/pyfmt/ into shape20:30
*** keedya has joined #openstack-infra20:31
*** annegentle has quit IRC20:31
fungimordred: closest i've found is autopep8. it's not perfect but it's not terrible for trying to neaten up old projects which didn't follow any consistent style20:31
mordredthe biggest problem I have with pyfmt is that it disagrees with me on how to deal with line continuations after parentheses20:31
*** doug-fish has joined #openstack-infra20:31
mordred_I_ prefer to line break after opening parens a-la lisp20:31
*** ccrouch has joined #openstack-infra20:31
mordredand tryto avoid visual indentation blocks at all costs20:31
fungihttps://pypi.python.org/pypi/autopep820:31
mordredpyfmt thinks the opposite is what should happen20:31
fungii still end up cleaning up behind it and turning off some rules, but it's a start20:32
*** jsavak has quit IRC20:32
fungiand faster than hand-formatting the entire project in many cases20:32
AJaegerany project-config core around to +2 a change to fix the bug association for training-guides, please? https://review.openstack.org/#/c/283493/20:32
mordredfungi: I should see if I can get autopep8 to line-break the way I like20:32
*** jsavak has joined #openstack-infra20:33
*** doug-fis_ has quit IRC20:33
AJaegerjeblair: do you want to merge the wheel mutex change now? Or wait until M3 is over? https://review.openstack.org/27597520:33
fungimordred: if memory serves (been months since the last time i used it) i turned off line breaking in it and used a regex to find long lines so i could fix them myself20:33
mordredthat's probably the best idea20:33
fungior to find lines i'd long ago used backslash continuations on and have since thought better of20:33
AJaegerOne more change for wrong bug association: https://review.openstack.org/#/c/283453/20:34
mordredmy current rubric is "if I can't figure out how to paren-break a line, I probably need to make a helper method or function" :)20:34
AJaegerthanks, anteaya20:34
*** IlyaG has quit IRC20:34
anteayawelcome20:34
* AJaeger waves good-bye20:35
anteayabye20:35
anteayathanks AJaeger20:35
anteayasee you tomorrow20:35
mtreinishjeblair: puppet just updated subunit2sql: http://puppetdb.openstack.org/report/subunit-worker01.openstack.org/faa5a2130bde54713e44a2c97704affe256b344d20:35
AJaegertalk to you tomorrow, anteaya !20:35
mtreinishso we can restart the worker and hopefully it'll all work correctly20:35
*** annegentle has joined #openstack-infra20:35
mordredfungi: wow. I just ran autopep8 on shade and it only made 3 changes20:35
mordredfungi: http://paste.openstack.org/show/48904620:35
anteayanice20:36
*** rcernin has quit IRC20:36
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Test overcloud SSL  https://review.openstack.org/28198820:37
mtreinishinfra-root: ^^^ any of you can help with that part :)20:37
*** ybathia has joined #openstack-infra20:38
pleia2mtreinish: yeah, I can take care of this again20:38
mtreinishpleia2: cool, thanks20:38
pleia2mtreinish: same as yesterday?20:38
mtreinishpleia2: yep, although this time hopefully we won't have to stop it after a couple min20:39
*** claudiub has quit IRC20:39
fungi(and then run around with a fire extinguisher)20:39
*** annegentle has quit IRC20:40
pleia2mtreinish: running20:40
mtreinishpleia2: well we've got multiple runs already, so that's a good sign :)20:40
cjmordred: hmm... one of the nice things about perltidy is ~/.perltidyrc, which can alter the minutia about how formatting is implemented20:40
pleia2mtreinish: as you saw yesterday, I don't think the filter is working though, since it's still looking at everything20:40
*** maishsk has quit IRC20:41
pleia2mtreinish: so it tries to grab testrepository.subunit and gets a 404 a lot20:41
*** sripriya__ has quit IRC20:41
pleia2but I'm not seeing the IO failures, so yay :)20:42
*** rockyg has quit IRC20:42
openstackgerritMerged openstack-infra/project-config: Add groups for Training labs  https://review.openstack.org/28349320:42
*** maishsk has joined #openstack-infra20:42
mtreinishpleia2: well it probabvly will still get 404s on runs that don't have subunit20:42
mtreinisheither because they don't generate it, or because of an early failure20:42
mtreinishand also the gz vs non-gz thing will generate 404s too I think20:42
mtreinishwe probably shouldn't log a stack trace and just do a single log message20:43
*** ybathia has quit IRC20:43
sdagueAJaeger: thanks20:43
pleia2mtreinish: ah, ok20:43
mordredcj: ok. I was wrong - pyfmt is horrible. autopep8, as fungi mentions, is pretty good though20:43
*** annegentle has joined #openstack-infra20:43
fungii like that autopep8 mostly doesn't overreach20:43
fungiit reformats things that are safe to reformat, and leaves the hard decisions to you20:44
openstackgerritMerged openstack-infra/project-config: Add proper groups for Training guides  https://review.openstack.org/28345320:44
mtreinishpleia2: you can see the little blue on the bottom right of the first graph: http://status.openstack.org/openstack-health/#/20:44
mtreinishthat matches up to how many runs we've worked through...20:44
fungithat said, i wouldn't use it in automation personally. i've really only found it useful for cleaning up some of my older and larger codebases which needed modernizing20:44
mordredyup20:45
*** david-lyle has quit IRC20:45
fungiand from that point on, i style-check all my commits20:45
*** maishsk has quit IRC20:45
pleia2mtreinish: oh good :)20:46
mtreinishpleia2: is it still going ok? I don't see any new ones being added20:46
pleia2mtreinish: yep20:46
sdaguecj: well, with python there isn't really different standards, because whitespace counts20:47
mtreinishhmm, ok20:47
sdaguemy recommendation, get flake8 running in your emacs buffer under flymake, and it will just tell you when things are wrong in buffer20:47
*** david-lyle has joined #openstack-infra20:47
openstackgerritRussell Bryant proposed openstack-infra/project-config: Move networking-odl rally job into check.  https://review.openstack.org/28259120:48
mtreinishpleia2: ah, its just sluggish (or I'm impatient)20:48
*** roxanaghe has joined #openstack-infra20:49
*** |-paul-| has joined #openstack-infra20:50
lifelessmtreinish: oh great20:50
mordredsdague: line wrapping and continuation style are still left - but I agree with you - it's mostly just the-one-style20:51
*** gyee has quit IRC20:52
*** rockyg has joined #openstack-infra20:52
*** rossella_s has quit IRC20:53
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Add undercloud idempotency test  https://review.openstack.org/27921820:54
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: set -o pipefail in deploy.sh  https://review.openstack.org/28743320:54
*** rossella_s has joined #openstack-infra20:54
*** kencjohnston has quit IRC20:55
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: set -o pipefail in deploy.sh  https://review.openstack.org/28743320:56
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Add undercloud idempotency test  https://review.openstack.org/27921820:56
cjthanks, mordred.  I'm feeding the source to autopep8 now and cleaning up the exceptional cases.  Can someone tell me how to make a print statement python3 compatable?  Wrap () around the arguments?20:57
*** pvaneck has quit IRC20:57
*** maishsk has joined #openstack-infra20:57
*** pvaneck has joined #openstack-infra20:58
openstackgerritMerged openstack/requirements: Updated from generate-constraints  https://review.openstack.org/28590120:59
*** apoorvad has joined #openstack-infra20:59
*** kencjohnston has joined #openstack-infra20:59
*** kgiusti has left #openstack-infra21:00
*** korzen has joined #openstack-infra21:01
*** sridhar_ram1 has joined #openstack-infra21:01
keedyaAJaeger: can I get a review on https://review.openstack.org/#/c/284827/21:02
*** _ody has quit IRC21:03
*** sridhar_ram has quit IRC21:03
*** _ody has joined #openstack-infra21:03
mordredcj: yes21:04
anteayacj: looks like yes: https://docs.python.org/3.0/whatsnew/3.0.html#print-is-a-function21:04
*** abregman|afk has quit IRC21:07
*** arxcruz has joined #openstack-infra21:08
*** aysyd has quit IRC21:09
openstackgerritMerged openstack-infra/project-config: Revert "Disable image uplaods to vexxhost"  https://review.openstack.org/28738921:09
*** doug-fish has quit IRC21:09
*** moravec has joined #openstack-infra21:09
*** abregman has joined #openstack-infra21:10
openstackgerritArtur Korzeniewski proposed openstack-infra/project-config: Create DVR multinode grenade job for Neutron upgrade tests  https://review.openstack.org/25021521:10
keedyaHi All, I know you are busy!! I have a new project that i am trying to push to openstack. Is there a chance to get reviews ?21:10
*** julim has quit IRC21:11
korzenAJaeger, I have addressed your comment in https://review.openstack.org/#/c/25021521:11
openstackgerritMerged openstack-infra/puppet-jenkins: Increase timeout for yum install of development tools  https://review.openstack.org/27408721:13
*** ldnunes has quit IRC21:14
*** doug-fish has joined #openstack-infra21:15
*** doug-fis_ has joined #openstack-infra21:16
*** korzen has quit IRC21:17
jheskethMorning21:18
*** aeng has quit IRC21:18
anteayafungi AJaeger http://paste.openstack.org/show/489054/ so I figured out why the output was saying 8500 jobs and my grep had 12000 jobs, the jobs are in both check and gate, many of them anyway21:19
anteayamorning jhesketh21:19
anteayafungi AJaeger so the percentage of non-voting jobs in check is 15%21:19
*** doug-fish has quit IRC21:20
anteayawe have 15 non-voting jobs in the gate queue and I will look at cleaning those up21:20
*** aeng has joined #openstack-infra21:20
openstackgerritTravis Tripp proposed openstack/requirements: Add python-searchlightclient>=0.2.0  https://review.openstack.org/26839421:23
*** doug-fish has joined #openstack-infra21:23
*** amrith is now known as _amrith_21:24
*** doug-fis_ has quit IRC21:25
*** smarcet has quit IRC21:25
*** dkranz has quit IRC21:27
*** dkranz has joined #openstack-infra21:30
*** alivigni has quit IRC21:30
openstackgerritgordon chung proposed openstack/requirements: bump ceilometermiddleware upper constraints  https://review.openstack.org/28741621:34
*** doug-fis_ has joined #openstack-infra21:35
*** reed_ has quit IRC21:35
*** baoli has quit IRC21:38
*** ZZelle_ has joined #openstack-infra21:38
*** doug-fish has quit IRC21:38
openstackgerritabregman proposed openstack-infra/project-config: Add new expermintal job - centos with neutron  https://review.openstack.org/28723921:39
clarkbsomething something success https://jenkins01.openstack.org/job/gate-puppet-glance-puppet-unit-3.7-dsvm-centos7/68/consoleFull21:41
clarkbmnaser: ^ you are official providing test resources now21:41
clarkbmnaser: are you ok with us bumping the number of servers we use up to 25 now?21:41
mnaseroh sweet, lets go for it if things are going well clarkb21:42
pabelangerclarkb: mnaser http://grafana.openstack.org/dashboard/db/nodepool-vexxhost is also a thing too21:42
mnaserpabelanger: ive been eyeing it for sometime :) interesting metrics21:43
mnaserthe most interesting one for me being error node launch attempts21:43
mnaserwe track this already internally anyways21:43
*** doug-fish has joined #openstack-infra21:43
*** dizquierdo has quit IRC21:43
openstackgerritDavid Moreau Simard proposed openstack-infra/project-config: Make Packstack integration jobs voting, add to gate  https://review.openstack.org/28746121:44
pabelangermnaser: ya, so far nothing reported, which is good21:44
*** dizquierdo has joined #openstack-infra21:44
mnaseri'm hoping at 1 instance nothing breaks :-P21:44
*** angdraug has quit IRC21:45
dmsimard^ AJaeger, anteaya, pabelanger, ianw et al, thanks for your help. Packstack integration jobs are ready to gate !21:45
clarkbthe one thing I wnat to double check is that we have a job that ran and used our mirror with no problems21:46
*** doug-fis_ has quit IRC21:46
pabelangerdmsimard: good work! I need to spend some time looking at what you did21:46
*** erlon has quit IRC21:46
openstackgerritClark Boylan proposed openstack-infra/project-config: Use all our vexxhost quota  https://review.openstack.org/28746221:47
*** jsavak has quit IRC21:47
clarkbso will look into that while ^ gets through check testing21:47
dmsimardpabelanger: nothing special (in project-config)21:47
dmsimardpabelanger: but happy to discuss nonetheless :)21:47
*** jsavak has joined #openstack-infra21:47
dmsimardI'm writing an e-mail to rdo-list and openstack-dev right now about it21:47
*** |-paul-| has quit IRC21:47
mnaserclarkb: i setup the right quota but if you'd be kind enough to double check it (if you have not).   that would be useful21:47
clarkbhttps://jenkins01.openstack.org/job/gate-tempest-dsvm-full/36400/console used the mirror just fine so thats good21:48
*** krtaylor has quit IRC21:48
clarkbmnaser: I did look, if -1 means unlimited then I think our ram quota is ultimately what determines our instance quota and if I mathed right you got it at 25 8GB instances21:48
openstackgerritAnita Kuno proposed openstack-infra/project-config: Remove non-voting jobs from the gate pipeline.  https://review.openstack.org/28746421:48
*** sc68cal has quit IRC21:48
mnaserexactly how i set things up so we're good21:48
*** matrohon has quit IRC21:49
*** piet has quit IRC21:49
jeblairmnaser: has someone pointed http://grafana.openstack.org/dashboard/db/nodepool-vexxhost  to you yet?21:49
*** jtomasek has quit IRC21:50
fungioh, cool, that's live now21:50
*** esikache1 has joined #openstack-infra21:50
mnaserjeblair: yes, waiting to see the number go up soon21:50
jeblairoh heh, i see pabelanger beat me to it just a few mins ago :)21:50
*** sc68cal has joined #openstack-infra21:50
anteayaEmilienM mtreinish can you take a look at https://review.openstack.org/#/c/287464/1 please? it removes non-voting jobs from the gate pipeline21:51
EmilienManteaya: sure thing21:51
anteayaEmilienM: thank you21:51
mtreinishanteaya: can you add me on the review, so I'll have an email to remind me21:52
anteayasure21:52
mtreinishI'm about to head home and I'll lose the link21:52
anteayamtreinish: yup, will do21:52
anteayathank you21:52
EmilienMnibalizer: please also look21:53
fungiwe're still somewhat frequently hitting ssh timeouts when trying to bring up nodes in bluebox-sjc121:55
*** armax has quit IRC21:55
*** esikache1 has quit IRC21:55
funginot sure what's going on there21:55
EmilienManteaya: done21:55
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Test overcloud SSL  https://review.openstack.org/28198821:55
anteayaEmilienM: thank you21:55
fungialso nodepoold is unhappy about being unable to clean up old images from infracloud-west... should those just be deleted from the database?21:56
fungior ignore?21:56
anteayaEmilienM: all the jobs I removed are nonvoting21:56
anteayaaccording to the zuul layout output21:56
openstackgerritPaul Belanger proposed openstack-infra/project-config: Add total singlestats for nodepool dashboard  https://review.openstack.org/28746921:56
jeblairfungi: i think just rm them21:56
pabelangerjeblair: ^ add total counts for build, ready, etc to top level nodepool page21:57
EmilienManteaya: look https://review.openstack.org/#/c/286845/21:57
anteayaEmilienM: if they should be voting then perhaps you need to removing some voting; false statements21:57
EmilienMonly gate-puppet-openstack-integration-scenario003-tempest-dsvm-trusty is not voting21:57
fungijeblair: oh, yeah, no chance of them ever coming back as referenced anyway, what with a planned rename21:57
anteayanot according to the zuul layout output21:57
mordredclarkb: I'm seeing a lot of node fails from ovh and bluebox21:58
anteayaEmilienM: which is what I was going by21:58
mordredclarkb: I checked bluebox floating ips and we don't seem to be leaked21:58
clarkbmordred: nova show the hosts is usually where I start if the fails aren't obvious21:58
clarkbmordred: and sometimes nova console log21:58
EmilienManteaya: https://github.com/openstack-infra/project-config/blob/master/zuul/layout.yaml#L1090-L109821:58
mordredclarkb: yah - nova shows a set of hosts sitting there happily21:59
*** armax has joined #openstack-infra21:59
*** IlyaG has joined #openstack-infra21:59
clarkbmordred: is ssh failing to them?21:59
anteayaif you run tox -e zuul and capture the output and then use the file from INFO:zuul.DependentPipelineManager:Configured Pipeline Manager gate to the post pipeline21:59
clarkbconsole-log can be helpful for diagnosing that if glean/cloud-init failed21:59
anteayathat output shows the 001 scenario for trusty as non-voting22:00
anteayaso I'm not sure what is happening here22:00
fungii guess the snapshot_image table is not very accurately named these days, since it also covers the uploaded copies of dib images22:00
EmilienManteaya: look https://review.openstack.org/#/c/286845/22:00
anteayayeah I know, I did22:00
fungijeblair: delete from snapshot_image where provider_name="infracloud-west";22:00
EmilienMthey are voting22:00
fungilgty?22:01
mordredclarkb: they mostly look like building or used state22:01
anteayaI'm saying there is a mismatch22:01
anteayabetween what zuul is reporting on output as nonvoting22:01
clarkbmordred: they are erorring after being used?22:01
anteayaand what is running as nonvoting22:01
fungijeblair: the equivalent select returns 4 rows which match the image ids it's complaining about. anything else i need to clean up after that?22:01
mordredclarkb: ah - well22:01
mordredclarkb: cloud-init-nonet[13.37]: waiting 120 seconds for network device22:01
mordredclarkb: I do see that on one in the building state22:01
*** thorst has quit IRC22:01
mnasersigns of a dead dhcp server ^22:02
mordredjlk: heya! so ^^22:02
*** thorst has joined #openstack-infra22:02
mordredjlk: we're seeing a bunch of bluebox fails22:02
jlkcrap.22:02
mnaseror ovs flows gone poof from compute nodes22:02
jlkno ovs22:02
mordredmnaser: they're using linuxbridge, so it sholdn't be the second :)22:02
*** dprince has quit IRC22:03
jlklet me ping my ops golks22:03
jlkfolks22:03
mnasergotcha22:03
mordredcool22:03
mordredjlk: also, if you do find something and it's somehting I could have found/done from my admin panel without bugging y'all, that's also potentially useful info22:03
anteayaEmilienM: here is the tox -e zuul output for puppet-cinder's gate: http://paste.openstack.org/show/489062/22:03
jlkmordred: probably going ot have to restart services, which I don't think you can do22:04
mordredjlk: kk22:04
jlkmordred: are ou seeing instance build failures, or some other failure?22:04
mordredjlk: eba2a2fb-906c-449d-8143-08646fde6ed3 is a uuid of a host in that state that I've held22:04
mordredjlk: yah I get  cloud-init-nonet[13.37]: waiting 120 seconds for network device in the console log22:04
mordredjlk: which doesn't seem happy22:04
*** sridhar_ram1 has quit IRC22:05
*** BobBall_AWOL has joined #openstack-infra22:05
mordredjlk: oh - that one finally came up22:05
jlkactually up, and pingable, or just gave up on cloud-init?22:06
*** achanda has quit IRC22:06
mordredjlk: yup. that one is now actually up. sorry - I thought I'd caught one in unhpapy state22:06
*** jsavak has quit IRC22:06
mordredjlk: lemme keep trying to catch one22:06
*** thorst has quit IRC22:06
*** sridhar_ram has joined #openstack-infra22:06
jlkokay22:07
mordredjlk: hahahaha. literally 10 minutes ago the bluebox failure rate dropped off and became normal again22:07
mordredjlk: so, like, the instant I started investigating and bothering you22:07
timrcmordred: quantum debugging22:08
fungii'm still trying to catch one of the boots in ovh-gra1 that ends up in error state according to nova, but they're gone from the instance list by the time i can plug in the uuid and run nova show22:08
fungilike trying to catch fish with your bare hands22:08
*** BobBall_1WOL has quit IRC22:08
*** _joes_ has quit IRC22:09
jlklolol22:09
mordredfungi: maybe we should add some stuff to nodepool to grab a few things like that when it deletes due to error and log them22:09
fungiyeah, this is one of the few times i miss rackspace's "we'll delete the instance when we get around to it" asynchronicity22:10
jlkonly alert we got was that one of the nodes saw a heavy amount of network traffic.22:10
jlka compute node though.22:10
*** doug-fis_ has joined #openstack-infra22:10
fungihrm, we actually have a lot ending up in error in ovh-bhs1 too22:11
*** doug-fish has quit IRC22:11
mnaserstoring the failed uuids is really useful from the operator side22:11
mnaserits really easy to figure out what happened on our side22:12
mordredfungi: so, looking at nodepool logs22:12
fungiyeah, we do log the uuids. it's just easier if i can spot more detail in the nova api as well22:12
openstackgerritBen Nemec proposed openstack-infra/project-config: Enable pep8 job on tripleo-heat-templates  https://review.openstack.org/25812222:13
mordredfungi: I only see 78 ERROR to 2260 ACTIVE for bluebox22:13
mordredfungi: I see 1626 ERROR to 2834 active for ovh22:13
mnaseralso, is there a policy in terms of using the data of the openstack-infra graphite instance (i'd like to bring the dashboard you use into our own grafana)22:13
mordredmnaser: nope - it's all public22:13
fungimordred: yep22:13
*** achanda has joined #openstack-infra22:13
fungiit's pretty constant22:14
mnasersweet. we'll add it to our dashboards22:14
clarkbjust keep in mind we periodically change keys out or reorg the schema22:14
*** e0ne has quit IRC22:14
*** maishsk has quit IRC22:14
mordredmnaser: we mostly just would prefer you don't use any of our graphs as comparative marketing materials "we're 100X faster than XXX as shown by OpenStack Infra"22:14
clarkbbut that is fairly infrequent22:14
mnasermordred: nah, that's not what we're after, we just want to have the visibility in our own grafana instance22:14
mordred++22:15
mnaserbecause our eyes are on it more than the one you have :P22:15
fungimordred: nova list in ovh-gra1 consistently turns up at least a few instances in "ERROR ... NOSTATE" though22:15
pabelangermnaser: consider using grafyaml to build your dashboards :)22:15
mordredmnaser: have you see the grafyaml project pabelanger put together?22:15
mordredjinx22:15
*** sdake_ has joined #openstack-infra22:15
fungidifferent instances each time since they're getting deleted almost instantly22:15
mnasermordred lol22:15
pabelangerhttps://github.com/openstack-infra/project-config/tree/master/grafana22:15
pabelangermordred: moar people on grafyaml22:15
* mnaser looks22:16
anteayahopefully using our git mirrors22:16
fungigithub.com is one of our git mirrors, just one we don't have much control over :/22:16
EmilienManteaya: I don't understand :)22:17
fungiso not one of our git servers22:17
*** e0ne has joined #openstack-infra22:17
dmsimardmordred: did you figure out that cloud-init nonet issue ?22:18
ianwdmsimard: cool ... so i think you want to modify the template in that change22:18
*** sdake has quit IRC22:18
anteayaEmilienM: yeah, I don't either, I'm confused about why zuul output is showing nonvoting on those jobs yet treating them as voting22:18
mordreddmsimard: it was just time - it has decided to resolve itself22:18
anteayafungi: sorry you are correct, our git servers22:18
dmsimardianw: thanks, feel free to comment, I'm a noob at this22:18
mordreddmsimard: so I'm going to add more debugging22:18
dmsimardmordred: ok, just in case, shameless plug https://dmsimard.com/2014/05/02/no-network-on-ubuntu-14-04-cloud-image-with-cloud-init/22:18
anteayaEmilienM: I'm hoping AJaeger may have some insight after he takes a look22:19
EmilienManteaya: cool no worries :-)22:19
*** weshay has quit IRC22:19
dmsimardmordred: I filed https://bugs.launchpad.net/cloud-init/+bug/1315501 slightly before Ubuntu Trusty came out I think.. and 16.04 is right around the corner22:19
openstackLaunchpad bug 1315501 in cloud-init (Ubuntu) "cloud-init does not use interfaces.d in trusty" [High,Confirmed]22:19
mordreddmsimard: nice22:19
mordreddmsimard: so - that'll also go away from us once we're on our new ubuntu-trusty images everywhere because we don't use cloud-init on those at all22:20
anteayaEmilienM: if the nonvoting attribute is not accurately applied via tox then our foray into finding percentage of nonvoting jobs is completely inaccurate22:20
mordred:)22:20
*** achanda has quit IRC22:20
dmsimardmordred: cool :D22:20
*** ybathia has joined #openstack-infra22:20
anteayaEmilienM: as that was what I was attempting to do when I uncovered what I believed to be nonvoting jobs in the gate pipeline22:20
mordreddmsimard: shameless plug from my end: http://git.openstack.org/cgit/openstack-infra/glean :)22:20
dmsimardmordred: that looks sexy22:21
clarkbfungi: looking at nodepool logs we are building ubuntu-trusty with the compat option set22:21
fungiclarkb: odd, maybe that's a default then and the addition in nodepool.yaml is a no-op?22:22
*** claudiub has joined #openstack-infra22:22
clarkbfungi: ya dib may be doing it for us22:22
fungii wonder if it didn't always and we added those as a workaround while waiting for the behavior change upstream22:23
fungiso there goes that theory22:23
clarkbwhich means the differences are ubuntu vs ubuntu-minimal+simple-init+growroot22:23
anteayamtreinish: thank you22:23
clarkbif I had to guess based on that growroot is a likely culprit22:23
*** thorst has joined #openstack-infra22:23
mtreinishanteaya: sure, np22:23
fungiclarkb: which doesn't seem like it should impact the contents of /opt22:23
fungiclarkb: or did /opt size turn out to be consistent between both images?22:23
clarkbfungi: I think /opt is the same in both cases (though I havne't double checked)22:23
mtreinishanteaya: tbh I was kinda surprised the forward jobs were nv on gate22:23
anteayamtreinish: me too22:24
anteayadid you see the rule in the top of zuul/layout.yaml?22:24
*** claudiub has quit IRC22:24
clarkbI am going to try some local experimentation22:24
*** claudiub has joined #openstack-infra22:24
anteayamtreinish: http://git.openstack.org/cgit/openstack-infra/project-config/tree/zuul/layout.yaml#n123222:25
openstackgerritC.J. Collier proposed openstack-infra/gerritbot: Improve gerritbot error handling  https://review.openstack.org/28705622:25
*** cbader has joined #openstack-infra22:25
mtreinishanteaya: yeah, that job only runs on stable branches, instead of doing n-1 -> n it does n -> n+122:26
mtreinishbut it's never really worked which is why it's nv22:26
openstackgerritMonty Taylor proposed openstack-infra/nodepool: Add an error log with the server fault message  https://review.openstack.org/28748122:27
*** e0ne has quit IRC22:27
*** thorst has quit IRC22:27
mordredfungi: ^^ maybe that gets us slightly more log information22:27
clarkboh cache-bindep22:28
clarkbmaybe it really is the packages22:28
*** amitgandhinz has quit IRC22:28
openstackgerritDavid Moreau Simard proposed openstack-infra/project-config: Make Packstack integration jobs voting, add to gate  https://review.openstack.org/28746122:28
anteayamtreinish: okay thanks for the +122:28
mordredclarkb: cache bindep22:28
mordredyah22:28
clarkbI take that back both cache bindep22:28
mordredoh22:28
*** dkranz has quit IRC22:28
*** cbader has quit IRC22:28
openstackgerritBen Nemec proposed openstack-infra/tripleo-ci: Test overcloud SSL  https://review.openstack.org/28198822:29
dmsimardianw: I think this might be better now: https://review.openstack.org/#/c/287461/2/ thanks for your feedback again :)22:29
clarkbI am trying a local build without growroot22:29
fungiclarkb: right. that was my initial assumption but 1. the apt cache shouldn't be anywhere near that huge and 2. turns out we were running it for both images22:30
*** jamesmcarthur has quit IRC22:30
*** jamesmcarthur has joined #openstack-infra22:31
*** edmondsw has quit IRC22:32
*** jamesmcarthur has quit IRC22:33
ianwdmsimard: so yeah, that's not going to work.  left a comment, but you just want to duplicate what was there into a "gate:" section, and remove the voting regex22:34
fungimordred: if getServerFromList() contains a 'fault' key then it always has a value which is a dict with a 'message' key?22:34
dmsimardianw: oh, why not ? the variables are there for it ? https://github.com/openstack-infra/project-config/blob/master/jenkins/jobs/projects.yaml#L5123-L513422:35
*** jamesmcarthur has joined #openstack-infra22:35
dmsimardoh, saw your comment, ok22:35
fungimordred: or do we need to guard against the possibility of resource['fault']['message'] raising an exception?22:35
mordredfungi: I have code in shade which just prints the sever['fault']['message'] on error22:35
fungiokay22:35
mordredfungi: I put in the if because sometimes these are images22:35
*** _amrith_ is now known as amrith22:36
openstackgerritDavid Moreau Simard proposed openstack-infra/project-config: Make Packstack integration jobs voting, add to gate  https://review.openstack.org/28746122:36
ianwdmsimard: yeah, that {name} is just zuul substituting in the name: of that particular job setup.  jjb are separate templates22:36
fungimordred: so basically that should never raise? lgtm then22:36
*** esker has joined #openstack-infra22:36
ianwdmsimard: cool, LGTM, just let jenkins churn away at it22:37
*** ayoung has quit IRC22:37
dmsimardyup ok22:38
*** Kaiyan has quit IRC22:38
*** jpr has quit IRC22:38
mnaserhow often does nodepool upload images?22:38
*** Sukhdev has joined #openstack-infra22:39
openstackgerritIan Wienand proposed openstack-infra/project-config: Add new expermintal job - centos with neutron  https://review.openstack.org/28723922:39
*** doug-fis_ has quit IRC22:39
clarkbmnaser: once a day22:40
ianwclarkb: sometimes :)  speaking of ... has the rax uploads been restarted?22:40
mnaserclarkb: i see, depending on how this goes, it may be ideal for us to shuffle around glance-api to make it upload faster, and cache at each az so access/time to ready gets faster22:41
ianwi did check a recent f23 run and it didn't seem to have build timestamps22:41
openstackgerritDavid Moreau Simard proposed openstack-infra/project-config: Make Packstack integration jobs voting, add to gate  https://review.openstack.org/28746122:41
clarkbianw: they ran and failed today22:41
*** annegentle has quit IRC22:41
*** doug-fish has joined #openstack-infra22:42
ianwclarkb: failed to upload?22:42
clarkbya I think with the same issue but I can dig up tracebacks if it helps22:42
clarkb(I should state that the uploads in the extra builders I started to get to vexxhost faster did succeed becuase they all requested new tokens22:43
ianwclarkb: well, i think best to retry with the two changes to shade we've got there when mordred releases ... pretty convinced that's it22:43
clarkbianw: kk22:44
*** doug-fis_ has joined #openstack-infra22:44
clarkbthats easy enough to do once shade releases22:44
mordredclarkb, ianw: holding off on a shade release because of FF22:44
mordredbut it's ready to tag/release whenever we're ready to do that22:44
ianwmordred: do i want to know what FF is?22:44
clarkbis http://tarballs.openstack.org/trove/images/ubuntu/mysql.qcow2 still a thing we use in tests?22:45
* clarkb is just looking at general disk consumption and ^ jumps out22:45
*** xyang1 has quit IRC22:45
openstackgerritMerged openstack-infra/project-config: Use all our vexxhost quota  https://review.openstack.org/28746222:46
fungiclarkb: i take it that's in the devstack file download links list?22:46
clarkbfungi: yup22:46
ianwthat's a good question ...22:46
fungiianw: feature freeze (week)22:47
*** doug-fish has quit IRC22:47
anteayaianw: http://releases.openstack.org/mitaka/schedule.html22:48
*** mriedem has quit IRC22:48
*** doug-fis_ has quit IRC22:48
mnaserclarkb: so now that the change above has been merged, will the nodepool configs update eventually on next puppet run or something?22:49
ianwfungi: oh, doh, of course ... there's all these providers who i've never heard of, i thought it might be short-form for some new one :)22:49
clarkbmnaser: yup they will update on the next puppet run22:49
clarkbmnaser: and nodepool will figure out its config updated and start using it at that point22:49
openstackgerritAnita Kuno proposed openstack-infra/project-config: Remove non-voting jobs from the gate pipeline.  https://review.openstack.org/28746422:51
openstackgerritDeklan Dieterly proposed openstack-infra/project-config: Add freezer-api job for tempest tests  https://review.openstack.org/28748622:52
*** annegentle has joined #openstack-infra22:52
*** krtaylor has joined #openstack-infra22:52
anteayaEmilienM: the patch you showed me with those jobs as voting are check jobs22:53
anteayaEmilienM: have you any gate jobs results?22:53
anteayaEmilienM: it is the gate queue I'm looking at22:53
openstackgerritPaul Belanger proposed openstack-infra/project-config: Enable bindep / devstack cache for fedora22 nodes  https://review.openstack.org/28748822:53
EmilienManteaya: https://review.openstack.org/#/c/286788/22:53
EmilienMlook at gate jobs22:53
EmilienManteaya: maybe you found a bug :-)22:54
clarkbmnaser: looks like our usage just jumped22:54
clarkbmnaser: quite a few instances building now22:54
mnaserbuilding 2322:54
* mnaser watches22:54
anteayaEmilienM: it is possible22:55
*** sc68cal_ has joined #openstack-infra22:56
*** dims_ has quit IRC22:56
mnaserof course the az with the 10gig networking instances go up super fast, the others are still getting setup22:58
mnaseri think we'll do a glance-api cache on the second zone soon to speed it up as well as local nova image cache will help for new deploys22:58
*** sridhar_ram has quit IRC22:58
*** sc68cal has quit IRC22:58
*** annegentle has quit IRC22:59
ianwclarkb: so, after a bit of poking, it seems there's "trove-integration" jobs which actually build that vm on the fly.  i'm pretty convinced it's unused.  i'll proposal a removal and get some reviews22:59
*** jamesmcarthur has quit IRC22:59
*** Jeffrey4l has joined #openstack-infra23:00
clarkbianw: thank you23:00
*** alivigni has joined #openstack-infra23:00
*** jamesmcarthur has joined #openstack-infra23:00
ianwclarkb: although, it probably will hang around in old branches ... have to make sure to backport it to really get rid of it.23:01
mnaserclarkb all 25 in-use, zero errors on launch23:01
mnasersuccess.gif23:01
clarkbianw: ya and older branches may need it so it won't die immediatley but this is step one in cleaning it up23:02
*** jcoufal has quit IRC23:02
pabelangermnaser: nice23:02
mnaseri think time to ready should go down, i would def rather have instances cooked up in a faster time period23:02
ddieterlycan i get some help with https://review.openstack.org/287486?23:02
anteayamnaser: nice work23:02
ddieterlytrying to get tempest tests configured to run23:03
openstackgerritC.J. Collier proposed openstack-infra/gerritbot: Improve gerritbot error handling  https://review.openstack.org/28705623:03
ddieterlynot sure if the gate_hook function will interfere with the tempest tests23:03
ddieterlylooks like the tempest tests are driven by env vars23:03
clarkbmnaser: part of that may be our data not being entirely truthful, we process the servers serially iirc23:03
clarkbmnaser: I would have to double check in the code on that23:04
*** dingyichen has joined #openstack-infra23:04
*** _sarob has joined #openstack-infra23:05
*** Qiming has joined #openstack-infra23:05
*** _sarob has quit IRC23:05
fungiwow, i go 3 days (3!) without looking at my github spam and we've added 45 new repos to gerrit in that short time23:05
mnaserclarkb: i see23:05
*** _sarob has joined #openstack-infra23:05
*** jamesmcarthur has quit IRC23:07
openstackgerritErno Kuvaja proposed openstack/requirements: Bump of glance-store to 0.13.0  https://review.openstack.org/28749623:07
*** sarob has quit IRC23:08
*** Sukhdev has quit IRC23:08
*** hashar has quit IRC23:09
*** gordc has quit IRC23:09
*** jamesmcarthur has joined #openstack-infra23:10
openstackgerritMax V.K. proposed openstack-infra/jenkins-job-builder: upd google-cloud-storage plugin support  https://review.openstack.org/28749723:10
*** cznewt has quit IRC23:11
*** tiswanso has quit IRC23:12
*** yamamoto_ has joined #openstack-infra23:12
*** tiswanso has joined #openstack-infra23:12
*** tiswanso has quit IRC23:12
*** sobersabre has joined #openstack-infra23:13
*** thiagop has quit IRC23:13
*** tongli has quit IRC23:14
*** sigmavirus24 is now known as sigmavirus24_awa23:15
*** regXboi has quit IRC23:17
*** ajmiller has quit IRC23:17
sobersabrehi. I've updated google-cloud-storage plugin support for jenkins job builder, I've added the previous committer of that plugin as reviewer. what's the correct reviewers list?23:17
*** ajmiller has joined #openstack-infra23:18
sobersabrehttps://review.openstack.org/#/c/287497/123:18
openstackgerritSam Yaple proposed openstack-infra/shade: [WIP] Add update_service()  https://review.openstack.org/28750423:18
anteayasobersabre: https://review.openstack.org/#/admin/groups/194,members23:18
*** salv-orlando has joined #openstack-infra23:19
*** achanda has joined #openstack-infra23:20
openstackgerritYAMAMOTO Takashi proposed openstack-infra/project-config: midonet: Rename rally job so that it matches regex for doc-only changes  https://review.openstack.org/28695623:21
sobersabreanteaya: thanks.23:21
*** rockyg has quit IRC23:22
*** salv-orl_ has quit IRC23:22
sobersabrefdegir: ping23:22
pabelangerfungi: have to run for a bit, but wanted to share some bindep failures on fedora-23 with your: http://paste.openstack.org/show/489071/23:22
pabelangerfungi: seems like bindep is not properly selecting the packages for some reason23:23
NakatoAJaeger: With post constraints jobs, do you remember what the issue was?  I've been taking a look at that lately and I've not been able to figure out where things were going wrong.23:23
pabelangereg: uuid-dev is getting pulled in when only listed as platform:dpkg23:23
pabelangerhave to run, but hope to hack on it later tonight23:23
*** alivigni has quit IRC23:23
*** yamamoto_ has quit IRC23:24
fungipabelanger: the good news is that bindep is pretty easily debuggable, and has a lot of existing test framework in place so if you figure out how to fix it we should be able to pretty easily avoid regressing23:24
*** dimtruck is now known as zz_dimtruck23:25
fungilots of mocked out interactions23:26
fungibut also we have functional testing jobs for it (via the fallback jobs)23:27
*** cznewt has joined #openstack-infra23:27
*** achanda has quit IRC23:28
pabelangerfungi: actually, I have a way for use to debug it in the gate23:28
pabelanger1 sec23:28
openstackgerritPaul Belanger proposed openstack-infra/project-config: Enable bindep / devstack cache for fedora22 nodes  https://review.openstack.org/28748823:30
*** ashleighfarnham has quit IRC23:30
pabelangerfungi: ianw: enabling ^ will install bindep on fedora-22 but not cache the packages at DIB build time23:30
pabelangerwe can then, once new fedora-23 dibs land, use the experimental bindep-fallback job to debug23:31
*** pahuang has joined #openstack-infra23:31
pabelangerI first tested with that patch avoid and worked as expected23:31
*** Sukhdev has joined #openstack-infra23:31
*** jamesmcarthur has quit IRC23:32
clarkbso I ran this exact job on vexxhost earlier and don't think its related to the cloud but http://logs.openstack.org/36/259236/44/check/gate-tempest-dsvm-full/cd9593c/logs/devstacklog.txt.gz#_2016-03-02_23_22_51_961 failed23:32
clarkblooks like horizon is trying to compress things?23:32
mnaserclarkb: any idea where we can find id's of error'd servers?  i see one had an issue so id like to see what happened23:33
clarkbmnaser: we don't publish the nodepool logs but I can look them up for you23:33
mnaserif it's not too much of a hassle, im just curious23:34
*** annegentle has joined #openstack-infra23:35
clarkbmnaser: http://paste.openstack.org/show/489073/ thats one, looks like create server failed so we never got a uuid? you might be able to grep for that 8491827 number in your logs as it will be used in the hostname23:35
mnaseri suspect i know the cause (it shouldn't happen again).  i'll keep an eye out23:36
*** pahuang has quit IRC23:36
mnaserit's probably a combination that it happened with a small route blip in ospf23:36
clarkbmnaser: http://paste.openstack.org/show/489074/ is similar no uuid becuase create server failed23:37
mnasernoticed a few inefficient traffic patterns so we're moving it around a bit23:37
*** ddieterly has quit IRC23:39
*** caphrim007 has quit IRC23:47
*** pahuang has joined #openstack-infra23:49
*** sarob has joined #openstack-infra23:49
*** ayoung has joined #openstack-infra23:50
*** salv-orlando has quit IRC23:50
*** annegentle has quit IRC23:51
*** esikache1 has joined #openstack-infra23:51
*** salv-orlando has joined #openstack-infra23:51
*** pradk has quit IRC23:53
*** _sarob has quit IRC23:53
*** kencjohnston has quit IRC23:53
*** sarob has quit IRC23:54
*** Jeffrey4l has quit IRC23:55
tchaypohttp://www.theregister.co.uk/2016/03/01/new_jenkins_moves_into_alpha/23:55
*** esikache1 has quit IRC23:55
*** Jeffrey4l has joined #openstack-infra23:56
clarkbsomething something hey jjb23:56
tchaypoIt sounds like jenkins2.0 will have an equivalent of JJB - but it involves writing groovy instead of yaml.23:56
mordredyah23:56
*** msuriar has quit IRC23:56
mordredbecause that's what I want23:56
pleia2hah23:56
tchaypoI'm going to go laugh at them23:56
mordredtchaypo: please do23:56
*** msuriar has joined #openstack-infra23:57
mordredtchaypo: consider me laughing in concert with you23:57
clarkbI just want the ssh slaves plugin to work23:57
clarkbthat would alleviate about 100% of our current jenkins problems23:57
*** arxcruz has quit IRC23:57
gusSomewhere not at the top of my todo list is to go rewrite the input/templating part of jjb to use jsonnet instead.23:57
gus(particularly for the multiple template inheritance, and actual sane language semantics)23:57
*** ajmiller has quit IRC23:58

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!