Saturday, 2014-12-06

NobodyCamdroped a localhost connection00:00
adam_gthat pasted log fails horribly in the beginning, but ive seen other failures where it eventually recovers and picks up resources. i have a local devstack up where im hitting it horribly similar to whats pasted above00:00
devanandahttp://logs.openstack.org/94/138294/7/check/check-tempest-dsvm-ironic-pxe_ssh/e3124f6/logs/screen-n-cpu.txt.gz?level=INFO00:01
devananda2014-12-05 10:12:39.294 30427 ERROR nova.compute.manager [-] No compute node record for host devstack-trusty-hpcloud-b3-333748700:01
*** Masahiro has joined #openstack-ironic00:01
devanandaa full minute before the first one of these00:01
devananda2014-12-05 10:13:39.490 30427 AUDIT nova.compute.resource_tracker [-] Auditing locally available compute resources00:01
openstackgerritOpenStack Proposal Bot proposed openstack/ironic: Updated from global requirements  https://review.openstack.org/13976200:02
jrollwhich doesn't actually find any resources00:03
jrollthis must be a networking thing00:03
jrollhttp://logs.openstack.org/94/138294/7/check/check-tempest-dsvm-ironic-pxe_ssh/e3124f6/logs/screen-ir-api.txt.gz?level=INFO00:03
jrollsome requests are getting through ok, though00:04
*** Masahiro has quit IRC00:05
devanandayup00:05
devanandalook at the timestamps00:05
adam_ghmm00:06
devanandain devstacklog.txt, it starts creating ironic at 10:10:4600:06
devanandaand finishes about 3 minutes later00:06
devanandasorry, not ironic -- dib00:06
devananda2014-12-05 10:10:46.623 | ++ init_ironic00:07
devananda2014-12-05 10:11:04.720 | ++ ramdisk-image-create ubuntu deploy-ironic -o /opt/stack/new/devstack/files/ir-deploy-pxe_ssh00:07
adam_gi believe those dropped connections are the n-cpu periodics00:08
adam_gncpu comes up configured for ironic before devstack sets up ironic/creates ramdisk00:09
devananda2014-12-05 10:13:01.834 | WARNING: urllib3.connectionpool HttpConnectionPool is full, discarding connection: 127.0.0.100:09
devanandathat's showing up in devstacklog, when it is trying to upload the images to glance00:09
adam_g2014-12-05 10:15:01.821 30427 DEBUG nova.openstack.common.periodic_task [-] Running periodic task ComputeManager.update_available_resource run_periodic_tasks00:10
adam_g2014-12-05 10:15:01.936 31500 INFO urllib3.connectionpool [-] Resetting dropped connection: 127.0.0.100:10
adam_gdevananda, interesting, im showing the same message locally:  WARNING: urllib3.connectionpool HttpConnectionPool is full, discarding connection: 127.0.0.100:11
adam_ganywhere openstack client is used00:11
devanandainteresting00:12
*** romcheg has quit IRC00:12
jrollfyi, fun virt driver bug https://review.openstack.org/#/c/139767/00:12
devanandamaybe a red herring?00:12
adam_gtodo images00:12
adam_git sounds like a red herring, yeah00:12
devananda2014-12-05 10:13:20.119 | ++ enroll_nodes00:13
devanandaback to timestamps, nova's running a full 3 minutes before Ironic has any nodes enrolled00:13
adam_gdevananda, that should be fine00:13
devanandaso if tempest is keying nova tests off of "is nova available" -- yup. they're going to fail00:13
devanandahm. why is that ok?00:14
adam_gdevananda, because devstack's enroll_nodes() polls hypervisor-stats for an expected node count before exiting00:14
adam_gthe issue im seeing in gate and locally is that00:14
devananda2014-12-05 10:17:20.310 | ++ die 515 'Nova hypervisor-stats did not register at least 3 nodes'00:14
adam_gdevananda, sorry, that log is a bad example -- it totally failed hard in the beginning00:15
adam_gone sec, i can point to a different failure00:15
devanandaoh00:15
devanandak k00:15
adam_gbut the symptoms im seeing: ironic returns the enrolled nodes to nova at during a later periodic task sync, nova counts the nodes in its hypervisor count, but ironic returns power_state=None for them so their resources are skipped by nova00:16
adam_gand looking at the conductor logs, im seeing it stuck waiting for a reservation (presumably trying to validate its power status)00:16
adam_ghttp://logs.openstack.org/29/139329/3/check/check-tempest-dsvm-ironic-parallel-nv/8811be4/logs/screen-n-cpu.txt.gz#_2014-12-05_16_57_26_34500:16
devanandaafter that i'm getting food. and probably a drink00:16
*** harlowja_ has quit IRC00:17
adam_gi can patch devstack to also poll for other hypervisor resources, but this seems like a new issue we've not seen before00:17
adam_gso to my original question, given a 'Attempting to reserve node' message, is it possible to see what task currently has that node reserved?00:18
adam_galso, does it make sense to update that message and change '... to reserve node $N'  to '... to reserve node $UUID'?00:19
jrolladam_g: "Attempting to reserve node" is logged even when the reservation is successful00:20
jrollI believe00:20
adam_gyea, i just realized that :)00:21
adam_gtho im used to seeing some output of the reserving task after00:21
devanandaadam_g: any where we're logging a node ID, we should be logging the UUID instead00:21
jrollright, I agree something is weird00:21
adam_gdevananda, cool, ill put up a patch00:22
jrolldevananda: you should review this and make sure I'm not doing anything dumb00:22
jrollhttps://review.openstack.org/#/c/13976700:22
devanandajroll: then i'll look later. right now, i'm inclined to do dumb things00:24
jrollha00:25
jrollright on, thanks00:25
*** davideagnello has quit IRC00:25
NobodyCamoh agent question do I need to set swift_endpoint_url=?00:29
devanandaadam_g: that error is 2 seconds before the local host SSH key is added00:29
devananda2014-12-05 16:57:28.043 | ++ ironic_ssh_check /opt/stack/data/ironic/ssh_keys/ironic_key 23.253.237.43 22 stack 1000:29
adam_gdevananda, interesting, nice catch00:31
devanandayea, we should probably be ensuring that the SSH power driver can access localhost before we start ironic-conductor00:31
devanandanot 3 minutes later00:31
devanandaheh00:31
devananda2014-12-05 16:57:27.980 | ++ ssh-keygen -q -t rsa -P '' -f /opt/stack/data/ironic/ssh_keys/ironic_key00:31
devanandabut00:32
devananda2014-12-05 16:54:06.119 2733 INFO ironic.common.driver_factory [-] Loaded the following drivers: ['fake', 'pxe_ssh', 'pxe_ipmitool']00:32
devanandaadam_g: that sounds like it could produce the problem you're seeing00:32
devanandathough, damn, there should be some logging about it :(00:32
*** harlowja has joined #openstack-ironic00:32
adam_gdevananda, yeah--stepping through this locally now and im not seeing any logging from the conductor tasks being output anywhere00:32
devanandahttps://github.com/openstack/ironic/blob/master/ironic/conductor/manager.py#L85400:34
devanandatry adding something there00:34
adam_gyeah, thats where i'm at00:34
devanandaoh hah!00:35
devanandathink i found it00:35
devanandadef _do_sync_power_state00:35
devanandatry: task.driver.power.validate00:35
devanandaexcept (...): return00:35
devanandathere's no logging at all if it fails the validate check00:35
devanandafor a node which as no previous power state00:35
devanandathat was intentional, under the assumption that newly added nodes might not be manageable yet (eg, if they fail a basic validate check)00:36
devanandaand operators don't need to see a WARNING for that00:36
adam_g*** MissingParameterValue: Node 1cf7e486-8882-45e8-9465-5259e43ea2f2 does not have any port associated with it.00:36
adam_gis the exception im hitting there00:36
adam_gthats my own fault (i was clearing the node inventory and recreating, but forgot about ports)00:37
devanandahttps://github.com/openstack/ironic/blob/master/ironic/drivers/modules/ssh.py#L32600:38
devanandait's actually checking to see if the keyfile is present on the host00:38
devanandawhich, for the first ~3 minutes, it's not00:38
devanandasince devstack didn't create it yet00:38
devanandaso the SSH driver is probably raising InvalidParameterValue from .validate()00:38
devanandaand that's getting ignored in the sync_power_state loop00:38
adam_gthat makes sense00:39
NobodyCamnice catch devananda :)00:39
*** david-lyle is now known as david-lyle_afk00:39
adam_gneed to refactor devstack a bit00:39
adam_gor, at least lib/ironic00:39
devanandathe part that's off here is devstack starting ironic and enrolling nodes, 3 minutes before creating the key that ironic needs00:39
devanandato manage those nodes00:40
devanandaok - time for food!00:40
devananda$2 tacos!00:41
adam_gdevananda, yeah--the ssh stuff needs to split away from things that need ironic api to be up00:42
adam_ghave a good one!00:42
NobodyCamnight devananda00:46
adam_gdevstack change simpler than expected https://review.openstack.org/#/c/139770/100:55
JayFI'll take a look00:56
JayFnice fix adam_g, ty00:58
adam_gJayF, thanks devananda for finding that, i totally overlooked that :)00:58
*** Masahiro has joined #openstack-ironic01:02
*** Masahiro has quit IRC01:06
*** jerryz has joined #openstack-ironic01:09
*** alexpilotti has quit IRC01:20
*** marcoemorais has quit IRC01:43
*** Marga_ has quit IRC01:57
*** Marga_ has joined #openstack-ironic01:57
*** ryanpetrello has joined #openstack-ironic01:58
*** Marga_ has quit IRC02:02
openstackgerritMerged openstack/ironic-python-agent: Workflow documentation is now in infra-manual  https://review.openstack.org/13933002:08
*** pensu has joined #openstack-ironic02:28
*** jerryz has quit IRC02:47
*** jerryz has joined #openstack-ironic02:47
*** Masahiro has joined #openstack-ironic02:50
*** Masahiro has quit IRC02:55
*** lazy_prince has quit IRC03:27
*** Shrews has quit IRC03:29
*** Haomeng has quit IRC03:40
*** Haomeng has joined #openstack-ironic03:42
*** Masahiro has joined #openstack-ironic03:51
*** Masahiro has quit IRC03:56
openstackgerritZhiQiang Fan proposed openstack/ironic: rename oslo.concurrency to oslo_concurrency  https://review.openstack.org/13978204:21
*** achanda has joined #openstack-ironic04:27
*** achanda has quit IRC04:43
*** achanda has joined #openstack-ironic04:44
*** Marga_ has joined #openstack-ironic04:45
*** Marga_ has quit IRC04:45
*** Marga_ has joined #openstack-ironic04:46
*** killer_prince has joined #openstack-ironic05:16
*** killer_prince is now known as lazy_prince05:16
*** achanda has quit IRC05:29
*** achanda has joined #openstack-ironic05:30
*** achanda has quit IRC05:35
*** Masahiro has joined #openstack-ironic05:40
*** Masahiro has quit IRC05:45
*** achanda has joined #openstack-ironic05:45
*** achanda has quit IRC05:47
openstackgerritShivanand Tendulker proposed openstack/ironic-specs: Ironic Management Interfaces to support UEFI Secure Boot  https://review.openstack.org/13584506:06
*** pcrews has quit IRC06:30
*** pcrews has joined #openstack-ironic06:31
*** harlowja is now known as harlowja_away06:52
*** Masahiro has joined #openstack-ironic07:29
*** rushiagr_away is now known as rushiagr07:31
*** Masahiro has quit IRC07:33
*** andreykurilin_ has joined #openstack-ironic08:41
*** pensu has quit IRC08:46
*** andreykurilin_ has quit IRC08:51
*** shakamunyi has joined #openstack-ironic08:51
*** shakamunyi has quit IRC08:51
*** Masahiro has joined #openstack-ironic09:18
*** Masahiro has quit IRC09:22
*** rushiagr is now known as rushiagr_away09:54
openstackgerritZhiQiang Fan proposed openstack/ironic: rename oslo.concurrency to oslo_concurrency  https://review.openstack.org/13978210:01
*** rushiagr_away is now known as rushiagr10:10
*** andreykurilin_ has joined #openstack-ironic10:19
*** romcheg has joined #openstack-ironic10:33
*** Masahiro has joined #openstack-ironic11:06
*** Masahiro has quit IRC11:11
*** andreykurilin_ has quit IRC11:15
*** andreykurilin_ has joined #openstack-ironic11:15
*** romcheg has quit IRC11:25
*** Marga_ has quit IRC11:32
*** andreykurilin_ has quit IRC12:14
*** lazy_prince has quit IRC12:30
*** alexpilotti has joined #openstack-ironic12:43
*** bradjones has quit IRC12:44
*** bradjones has joined #openstack-ironic12:47
*** Masahiro has joined #openstack-ironic12:55
*** Masahiro has quit IRC12:59
*** rushiagr is now known as rushiagr_away13:14
*** rushiagr_away is now known as rushiagr13:20
*** killer_prince has joined #openstack-ironic13:53
*** killer_prince is now known as lazy_prince13:53
*** lazy_prince has quit IRC14:16
*** killer_prince has joined #openstack-ironic14:26
*** killer_prince is now known as lazy_prince14:27
*** jerryz_ has joined #openstack-ironic14:39
*** jerryz has quit IRC14:43
*** Masahiro has joined #openstack-ironic14:44
*** Masahiro has quit IRC14:49
*** rushiagr is now known as rushiagr_away14:54
*** ryanpetrello has quit IRC15:04
*** lazy_prince has quit IRC15:10
-openstackstatus- NOTICE: [reminder] gerrit will be offline for 30 minutes starting at 16:00 utc for project renames15:20
openstackgerritArata Notsu proposed openstack/ironic: Correct vmware ssh power manager  https://review.openstack.org/13981015:22
*** killer_prince has joined #openstack-ironic15:44
*** killer_prince is now known as lazy_prince15:44
NobodyCammorning Ironic15:49
*** Shrews has joined #openstack-ironic15:58
-openstackstatus- NOTICE: gerrit will be offline for 30 minutes while we rename a few projects. eta 16:30 utc16:03
*** ChanServ changes topic to "gerrit will be offline for 30 minutes while we rename a few projects. eta 16:30 utc"16:03
*** jerryz_ has quit IRC16:16
*** Masahiro has joined #openstack-ironic16:20
*** achanda has joined #openstack-ironic16:23
*** Masahiro has quit IRC16:25
*** andreykurilin_ has joined #openstack-ironic16:31
*** openstackgerrit has quit IRC16:45
*** openstackgerrit has joined #openstack-ironic16:46
*** ChanServ changes topic to "Bare Metal Provisioning | Status: http://bit.ly/ironic-whiteboard | Docs: http://docs.openstack.org/developer/ironic/ | Bugs: https://bugs.launchpad.net/ironic"16:50
*** pensu has joined #openstack-ironic16:54
*** rushiagr_away is now known as rushiagr17:30
*** achanda has quit IRC17:38
openstackgerritArata Notsu proposed openstack/ironic: Correct vmware ssh power manager  https://review.openstack.org/13981017:46
*** romcheg has joined #openstack-ironic18:01
*** ryanpetrello has joined #openstack-ironic18:03
*** Masahiro has joined #openstack-ironic18:09
*** Masahiro has quit IRC18:13
*** shakamunyi has joined #openstack-ironic18:23
*** ryanpetrello has quit IRC18:49
*** igordcard has joined #openstack-ironic18:54
*** ryanpetrello has joined #openstack-ironic18:54
*** igordcard has quit IRC18:55
*** andreykurilin_ has quit IRC19:12
*** ryanpetrello has quit IRC19:15
*** romcheg has quit IRC19:17
*** pensu has quit IRC19:18
*** ryanpetrello has joined #openstack-ironic19:20
*** romcheg has joined #openstack-ironic19:20
*** ryanpetrello has quit IRC19:31
*** ryanpetrello has joined #openstack-ironic19:37
*** romcheg has quit IRC19:40
*** ryanpetrello has quit IRC19:49
*** ryanpetrello has joined #openstack-ironic19:53
*** romcheg has joined #openstack-ironic19:58
*** Masahiro has joined #openstack-ironic19:58
*** ryanpetrello has quit IRC19:58
*** Masahiro has quit IRC20:02
*** andreykurilin_ has joined #openstack-ironic20:21
*** Marga_ has joined #openstack-ironic20:31
*** igordcard has joined #openstack-ironic20:35
*** shakamunyi has quit IRC20:55
*** rushiagr is now known as rushiagr_away21:17
*** krtaylor has quit IRC21:21
*** krtaylor has joined #openstack-ironic21:27
*** Masahiro has joined #openstack-ironic21:47
*** Masahiro has quit IRC21:51
*** romcheg has quit IRC22:05
*** romcheg has joined #openstack-ironic22:24
*** Marga_ has quit IRC22:37
*** toabctl has quit IRC23:05
*** toabctl has joined #openstack-ironic23:07
*** ekarlso- has quit IRC23:07
*** ryanpetrello has joined #openstack-ironic23:12
*** vdrok has quit IRC23:21
*** andreykurilin_ has quit IRC23:22
*** vdrok has joined #openstack-ironic23:22
*** andreykurilin_ has joined #openstack-ironic23:22
*** Masahiro has joined #openstack-ironic23:35
*** Marga_ has joined #openstack-ironic23:38
*** Masahiro has quit IRC23:39
*** igordcard has quit IRC23:42
*** Marga_ has quit IRC23:43
*** ryanpetrello has quit IRC23:53
*** ryanpetrello has joined #openstack-ironic23:53

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!