*** cgoncalves has joined #openstack-nova | 00:09 | |
*** rcernin has joined #openstack-nova | 00:13 | |
*** brinzhang has joined #openstack-nova | 00:29 | |
*** trident has joined #openstack-nova | 00:33 | |
*** ircuser-1 has quit IRC | 00:35 | |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: ksa auth conf and client for cyborg access https://review.opendev.org/631242 | 00:39 |
---|---|---|
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Add Cyborg device profile groups to request spec. https://review.opendev.org/631243 | 00:39 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Create and bind Cyborg ARQs. https://review.opendev.org/631244 | 00:39 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Get resolved Cyborg ARQs and add PCI BDFs to VM's domain XML. https://review.opendev.org/631245 | 00:39 |
*** _alastor_ has joined #openstack-nova | 00:48 | |
*** _alastor_ has quit IRC | 00:52 | |
*** imacdonn has quit IRC | 01:13 | |
*** imacdonn has joined #openstack-nova | 01:13 | |
*** slaweq has joined #openstack-nova | 01:14 | |
*** slaweq has quit IRC | 01:24 | |
*** altlogbot_2 has quit IRC | 01:28 | |
*** altlogbot_0 has joined #openstack-nova | 01:29 | |
*** spatel has joined #openstack-nova | 01:37 | |
*** Dinesh_Bhor has joined #openstack-nova | 01:44 | |
*** slaweq has joined #openstack-nova | 02:16 | |
*** brinzhang_ has joined #openstack-nova | 02:17 | |
*** brinzhang has quit IRC | 02:20 | |
*** slaweq has quit IRC | 02:24 | |
*** BjoernT has joined #openstack-nova | 02:33 | |
openstackgerrit | Yongli He proposed openstack/nova master: clean up orphan instances https://review.opendev.org/627765 | 02:35 |
*** BjoernT_ has joined #openstack-nova | 03:08 | |
*** BjoernT has quit IRC | 03:09 | |
*** spatel has quit IRC | 03:14 | |
*** slaweq has joined #openstack-nova | 03:15 | |
*** slaweq has quit IRC | 03:24 | |
*** psachin has joined #openstack-nova | 03:28 | |
openstackgerrit | Boxiang Zhu proposed openstack/nova master: Support agile samples name https://review.opendev.org/669591 | 03:28 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: ksa auth conf and client for cyborg access https://review.opendev.org/631242 | 03:33 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Add Cyborg device profile groups to request spec. https://review.opendev.org/631243 | 03:33 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Create and bind Cyborg ARQs. https://review.opendev.org/631244 | 03:33 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Get resolved Cyborg ARQs and add PCI BDFs to VM's domain XML. https://review.opendev.org/631245 | 03:33 |
*** prometheanfire has joined #openstack-nova | 03:43 | |
prometheanfire | I'm guessing volumes showing up as attached in nova but not cinder (and not actually being attached) is a nova problem? | 03:43 |
prometheanfire | can't 'remove volume' it No volume with a name or ID of '3e92f9b6-1a3b-4a7e-8487-6ff253e888db' exists. | 03:43 |
prometheanfire | problem starts when attaching it, times out with a 504 :| | 03:50 |
*** udesale has joined #openstack-nova | 04:09 | |
*** slaweq has joined #openstack-nova | 04:11 | |
*** factor has quit IRC | 04:13 | |
*** factor has joined #openstack-nova | 04:14 | |
openstackgerrit | Takashi NATSUME proposed openstack/python-novaclient master: Deprecate cells v1 and extension commands and APIs https://review.opendev.org/669597 | 04:22 |
*** slaweq has quit IRC | 04:24 | |
*** slaweq has joined #openstack-nova | 05:11 | |
*** ociuhandu has joined #openstack-nova | 05:20 | |
*** slaweq has quit IRC | 05:25 | |
*** ociuhandu has quit IRC | 05:27 | |
*** ociuhandu_ has joined #openstack-nova | 05:27 | |
*** whoami-rajat has joined #openstack-nova | 05:30 | |
*** BjoernT_ has quit IRC | 05:32 | |
*** Luzi has joined #openstack-nova | 05:32 | |
*** ociuhandu_ has quit IRC | 05:33 | |
*** brinzhang_ has quit IRC | 05:36 | |
*** ratailor has joined #openstack-nova | 05:45 | |
*** ileixe has joined #openstack-nova | 05:54 | |
*** ccamacho has joined #openstack-nova | 06:00 | |
*** brinzhang has joined #openstack-nova | 06:02 | |
*** slaweq has joined #openstack-nova | 06:11 | |
*** pcaruana has joined #openstack-nova | 06:12 | |
*** yaawang has quit IRC | 06:15 | |
openstackgerrit | Arthur Dayne proposed openstack/nova-specs master: Proposal for a safer noVNC console with password authentication https://review.opendev.org/623120 | 06:16 |
*** maciejjozefczyk has joined #openstack-nova | 06:21 | |
*** etp has joined #openstack-nova | 06:24 | |
*** slaweq has quit IRC | 06:26 | |
*** damien_r has joined #openstack-nova | 06:28 | |
*** xek_ has joined #openstack-nova | 06:30 | |
*** damien_r has quit IRC | 06:32 | |
*** luksky11 has joined #openstack-nova | 06:34 | |
*** slaweq has joined #openstack-nova | 06:39 | |
*** yaawang has joined #openstack-nova | 06:43 | |
openstackgerrit | Merged openstack/nova master: Clean up test_virtapi https://review.opendev.org/667419 | 06:45 |
*** yaawang has quit IRC | 06:50 | |
*** yaawang has joined #openstack-nova | 06:55 | |
*** rcernin has quit IRC | 07:12 | |
*** ivve has joined #openstack-nova | 07:13 | |
*** helenafm has joined #openstack-nova | 07:14 | |
*** ricolin has joined #openstack-nova | 07:18 | |
*** psachin has quit IRC | 07:19 | |
openstackgerrit | wangwei1 proposed openstack/nova master: fix spelling error in nova/api/validation/__init__.py https://review.opendev.org/669244 | 07:20 |
openstackgerrit | Boxiang Zhu proposed openstack/nova master: Add host and hypervisor_hostname flag to create server https://review.opendev.org/645520 | 07:22 |
*** tssurya has joined #openstack-nova | 07:28 | |
*** jangutter has joined #openstack-nova | 07:31 | |
*** adriant has quit IRC | 07:38 | |
*** ociuhandu has joined #openstack-nova | 07:45 | |
*** damien_r has joined #openstack-nova | 07:49 | |
*** ralonsoh has joined #openstack-nova | 07:55 | |
*** ttsiouts has joined #openstack-nova | 08:01 | |
*** rpittau|afk is now known as rpittau | 08:03 | |
*** ttsiouts has quit IRC | 08:13 | |
*** ttsiouts has joined #openstack-nova | 08:13 | |
*** ttsiouts has quit IRC | 08:18 | |
*** ttsiouts has joined #openstack-nova | 08:24 | |
*** mdbooth has joined #openstack-nova | 08:33 | |
*** psachin has joined #openstack-nova | 08:34 | |
*** gokhani has joined #openstack-nova | 08:34 | |
openstackgerrit | Merged openstack/nova master: Add VirtAPI.update_compute_provider_status https://review.opendev.org/668706 | 08:41 |
*** yaawang has quit IRC | 08:41 | |
*** brinzhang_ has joined #openstack-nova | 08:47 | |
*** brinzhang has quit IRC | 08:50 | |
*** derekh has joined #openstack-nova | 08:54 | |
*** lpetrut has joined #openstack-nova | 09:00 | |
*** cdent has joined #openstack-nova | 09:05 | |
*** Zara has joined #openstack-nova | 09:08 | |
*** priteau has joined #openstack-nova | 09:08 | |
*** ociuhandu has quit IRC | 09:11 | |
*** brinzhang has joined #openstack-nova | 09:13 | |
*** brinzhang has quit IRC | 09:13 | |
*** brinzhang has joined #openstack-nova | 09:14 | |
*** brinzhang_ has quit IRC | 09:16 | |
*** brinzhang has quit IRC | 09:17 | |
*** brinzhang has joined #openstack-nova | 09:17 | |
*** brinzhang_ has joined #openstack-nova | 09:18 | |
*** mdbooth has quit IRC | 09:21 | |
*** brinzhang has quit IRC | 09:22 | |
*** davidsha has joined #openstack-nova | 09:23 | |
*** mdbooth has joined #openstack-nova | 09:38 | |
*** cdent has quit IRC | 09:45 | |
*** panda is now known as panda|bbl | 09:48 | |
*** ociuhandu has joined #openstack-nova | 09:48 | |
openstackgerrit | Brin Zhang proposed openstack/nova master: Specify availability_zone to unshelve https://review.opendev.org/663851 | 09:51 |
openstackgerrit | Miguel Ángel Herranz Trillo proposed openstack/nova stable/queens: Fix type error on call to mount device https://review.opendev.org/669629 | 09:57 |
*** luksky11 has quit IRC | 10:12 | |
*** moshele has joined #openstack-nova | 10:15 | |
moshele | @sean-k-mooney: hi | 10:16 |
*** ttsiouts has quit IRC | 10:32 | |
*** ttsiouts has joined #openstack-nova | 10:33 | |
*** panda|bbl has quit IRC | 10:33 | |
*** cdent has joined #openstack-nova | 10:33 | |
*** awalende has joined #openstack-nova | 10:36 | |
gokhani | hi team, because of power outage, Most of our compute nodes unexpectedly shut down and now I can not start our instances. Instances Power status is No State. Error log is http://paste.openstack.org/show/754107/. My environment is OpenStack Pike and Instances are on a nfs shared storage. Nova version is 16.1.6.dev2. There are more important instances on my environment. How can I rescue my istances? What would you suggest ? | 10:37 |
*** ttsiouts has quit IRC | 10:37 | |
*** panda has joined #openstack-nova | 10:37 | |
*** priteau has quit IRC | 10:42 | |
*** udesale has quit IRC | 11:00 | |
*** ttsiouts has joined #openstack-nova | 11:01 | |
*** priteau has joined #openstack-nova | 11:09 | |
*** luksky11 has joined #openstack-nova | 11:11 | |
*** cdent has quit IRC | 11:12 | |
*** priteau has quit IRC | 11:16 | |
*** tesseract has joined #openstack-nova | 11:17 | |
*** cdent has joined #openstack-nova | 11:18 | |
openstackgerrit | Ivaylo Mitev proposed openstack/nova master: VMware VMDK detach: get adapter type from instance VM https://review.opendev.org/653738 | 11:19 |
*** moshele has quit IRC | 11:20 | |
*** tesseract has quit IRC | 11:20 | |
*** tesseract has joined #openstack-nova | 11:21 | |
*** mdbooth has quit IRC | 11:24 | |
*** ratailor has quit IRC | 11:25 | |
*** ricolin has quit IRC | 11:26 | |
*** mdbooth has joined #openstack-nova | 11:29 | |
openstackgerrit | Merged openstack/nova master: Correct the comment of RequestSpec's network_metadata https://review.opendev.org/667061 | 11:56 |
*** _alastor_ has joined #openstack-nova | 12:02 | |
*** etp has quit IRC | 12:02 | |
*** etp has joined #openstack-nova | 12:02 | |
*** sean-k-mooney has quit IRC | 12:03 | |
*** _alastor_ has quit IRC | 12:06 | |
*** etp has quit IRC | 12:08 | |
*** sean-k-mooney has joined #openstack-nova | 12:16 | |
*** cdent has quit IRC | 12:26 | |
openstackgerrit | Brin Zhang proposed openstack/nova master: Specify availability_zone to unshelve https://review.opendev.org/663851 | 12:33 |
*** artom has joined #openstack-nova | 12:33 | |
openstackgerrit | Shilpa Devharakar proposed openstack/nova master: Support filtering of hosts by forbidden aggregates https://review.opendev.org/667952 | 12:35 |
*** edleafe has joined #openstack-nova | 12:42 | |
*** shilpasd has joined #openstack-nova | 12:49 | |
openstackgerrit | Merged openstack/os-resource-classes master: Add Python 3 Train unit tests https://review.opendev.org/669479 | 12:57 |
openstackgerrit | Merged openstack/os-traits master: Add Python 3 Train unit tests https://review.opendev.org/669480 | 13:00 |
*** takashin has left #openstack-nova | 13:02 | |
*** lbragstad has joined #openstack-nova | 13:03 | |
*** damien_r has quit IRC | 13:04 | |
*** tesseract has quit IRC | 13:14 | |
*** tesseract has joined #openstack-nova | 13:16 | |
*** mriedem has joined #openstack-nova | 13:23 | |
openstackgerrit | Matt Riedemann proposed openstack/nova stable/rocky: Fix type error on call to mount device https://review.opendev.org/669664 | 13:25 |
*** spatel has joined #openstack-nova | 13:28 | |
*** brinzhang_ has quit IRC | 13:29 | |
*** spatel has quit IRC | 13:29 | |
*** francoisp has quit IRC | 13:30 | |
*** spatel has joined #openstack-nova | 13:31 | |
*** francoisp has joined #openstack-nova | 13:32 | |
*** cdent has joined #openstack-nova | 13:34 | |
bauzas | mriedem: around ? | 13:41 |
bauzas | mriedem: I replied to the thread but I wonder about something | 13:42 |
mriedem | yes | 13:42 |
bauzas | in both http://paste.openstack.org/show/734146/ and https://github.com/larsks/os-placement-tools/blob/master/check_placement.py they first ask about all the instances | 13:42 |
bauzas | and then they loop around allocations by checking some persisted object | 13:42 |
bauzas | but then, we could have some races | 13:43 |
bauzas | plus, it could be super huge to get all the instances for CERN eg. | 13:43 |
bauzas | so, I wonder whether I should call the API for every instance | 13:43 |
bauzas | erh | 13:43 |
bauzas | I mean, checking the DB | 13:43 |
mriedem | i'd think you'd avoid races by checking if an instance has a task_state != None, and if looking at migrations, is the migration in progress | 13:46 |
mriedem | mnaser's script doesn't loop over instances, | 13:48 |
*** Luzi has quit IRC | 13:48 | |
mriedem | it gets hypervisors and the servers on those hypervisors and compares to the resource provider inventory/allocation per hypervisor/server | 13:48 |
*** eharney has joined #openstack-nova | 13:49 | |
*** amodi has joined #openstack-nova | 13:50 | |
mriedem | i'd think it would be easiest to first determine what kinds of problems the audit command is going to try and find | 13:50 |
mriedem | and then figure out the best way to implement that | 13:50 |
*** awalende has quit IRC | 13:50 | |
mriedem | i think we know the two main issues are missing allocations (which heal_allocations should fix) and leaked / orphaned allocations, so i'd start by focusing on the latter issue | 13:51 |
*** awalende has joined #openstack-nova | 13:51 | |
*** BjoernT has joined #openstack-nova | 13:52 | |
*** efried_pto is now known as efried | 13:54 | |
mriedem | i think we could pretty easily identify leaked allocations by starting from the provider allocations and working backward - looking for servers or migration records with the allocation consumer id (since we don't have the type recorded yet) | 13:55 |
*** awalende has quit IRC | 13:55 | |
*** awalende_ has joined #openstack-nova | 13:55 | |
mriedem | so i'd probably start by getting all hypervisors (nodes/resource providers) and for each hypervisor, get the allocations - similar to what mnaser's script does | 13:59 |
mriedem | getting the hypervisors + servers in the same call like his script would also tell you if allocations on that provider was a server or not - and if not, it's either leaked or it's a migration allocation | 14:00 |
mriedem | so no i don't think you need to loop over all instances | 14:00 |
mriedem | maybe i should just write this thing :) | 14:00 |
*** awalende_ has quit IRC | 14:00 | |
*** _alastor_ has joined #openstack-nova | 14:02 | |
*** priteau has joined #openstack-nova | 14:02 | |
*** FlorianFa has quit IRC | 14:05 | |
bauzas | mriedem: I'm pretty sure I can do it | 14:06 |
*** hongbin has joined #openstack-nova | 14:07 | |
openstackgerrit | François Palin proposed openstack/nova master: WIP - call volume detach even if terminate_connection fails https://review.opendev.org/669674 | 14:16 |
kashyap | mriedem: You're right ... indeed I didn't hear any responses on the list so far (http://lists.openstack.org/pipermail/openstack-discuss/2019-July/007521.html) | 14:19 |
kashyap | alex_xu: Hiya, when you're about, do you have a comment on the above thread? (And the change wich Matt Cced you on, which removes Intel CMT events) | 14:19 |
* kashyap goes back to the Other Yak(tm) | 14:19 | |
efried | kashyap: As predicted, my wife immediately stole the sleep book | 14:20 |
efried | and it is life-changing | 14:20 |
kashyap | efried: Hehe. Sharing is caring. I'm sure she will not hide her fascination with the book | 14:20 |
efried | I don't know why, but stuff I've been telling her for years for some reason is having more of an impact coming from a complete stranger | 14:20 |
kashyap | efried: Ah, she already finished it? | 14:20 |
efried | no, hasn't finished yet, but every chapter brings new insight. | 14:21 |
efried | she has bought a copy for her parents | 14:21 |
efried | and recommended it to a number of people. | 14:21 |
kashyap | :-) | 14:21 |
efried | I got a couple chapters in before she stole it, and I'm definitely sold | 14:21 |
kashyap | efried: I know, you must've become like a "broken record", and people zone out the moment broken reel start spinning :D | 14:21 |
kashyap | Sorry, just teasing. | 14:22 |
efried | though for me it's more fascination with the science than altering lifestyle, which was already pretty close to right sleep-wise. | 14:22 |
kashyap | Yeah, agreed. | 14:23 |
kashyap | efried: Also, I'd highly recommend to watch him give a tech talk. He's one of the most effective speakers I've ever seen. | 14:23 |
efried | noted | 14:24 |
kashyap | The way he pauses for the words to sink in, diction, is all worth observing. At least to me. | 14:24 |
kashyap | https://www.youtube.com/watch?v=aXflBZXAucQ | 14:24 |
*** dpawlik has quit IRC | 14:24 | |
kashyap | efried: If you click, look how quickly (within 5 seconds) he escalates the talk. :D | 14:24 |
* kashyap stops drumming the beat and goes to do muck around with Secure Boot patches | 14:25 | |
*** purplerbot has quit IRC | 14:28 | |
prometheanfire | was pointed here for nova adding a cinder volume to a server without (the record/db at least) but not actually adding it to the running VM (client reports a 504 when trying to add the volume) | 14:29 |
*** purplerbot has joined #openstack-nova | 14:30 | |
prometheanfire | it shows up in the volumes_attached field, but can't remove any of them as they either do not exist or..... | 14:30 |
prometheanfire | Invalid volume: Invalid input received: Invalid volume: Unable to detach volume. Volume status must be 'in-use' and attach_status must be 'attached' to detach. (HTTP 400) (Request-ID: req-01d8a3c5-66f4-43a4-bac5-9f1104a292fe) (HTTP 400) (Request-ID: req-358132e4-38a1-493a-b87b-d135791bae2d) | 14:30 |
*** ricolin has joined #openstack-nova | 14:35 | |
*** mlavalle has joined #openstack-nova | 14:36 | |
*** gouthamr has quit IRC | 14:47 | |
*** awalende has joined #openstack-nova | 14:55 | |
*** gouthamr has joined #openstack-nova | 14:56 | |
*** awalende has quit IRC | 14:59 | |
efried | kashyap: Would you mind taking a look at this please? | 15:02 |
efried | https://review.opendev.org/#/c/667975/ | 15:02 |
efried | It seems straightforward enough, but I'm not sure if there could e.g. be security problems, or if there's another accepted way to do this, etc. | 15:02 |
kashyap | efried: In a meeting; will look | 15:03 |
efried | no hurry | 15:03 |
efried | thanks | 15:03 |
stephenfin | lyarwood: Could you send this on its merry way? https://review.opendev.org/#/c/667355/ | 15:09 |
lyarwood | stephenfin: looking | 15:11 |
lyarwood | stephenfin: sorry I did mean to sort that out last week once the stein change landed | 15:11 |
*** BjoernT_ has joined #openstack-nova | 15:11 | |
lyarwood | stephenfin: done | 15:11 |
mriedem | gibi: comments in https://review.opendev.org/#/c/669188/ just so i'm sure i'm following it correctly (i know this patch was because i brought it up elsewhere) | 15:13 |
*** dklyle has joined #openstack-nova | 15:13 | |
gibi | mriedem: looking.. | 15:13 |
*** BjoernT has quit IRC | 15:14 | |
*** ivve has quit IRC | 15:16 | |
*** gyee has joined #openstack-nova | 15:17 | |
*** mdbooth has quit IRC | 15:21 | |
gibi | mriedem: replied. I only added some extra explanation but I think we agree. | 15:26 |
gibi | mriedem: I can re-spin the patch to fix the test redundancies | 15:27 |
*** wwriverrat has joined #openstack-nova | 15:28 | |
*** dklyle has quit IRC | 15:28 | |
*** dklyle has joined #openstack-nova | 15:28 | |
mriedem | yeah just fix the test and i'm +2 | 15:28 |
gibi | mriedem: OK, I will do it quickly | 15:28 |
*** amodi has quit IRC | 15:29 | |
cdent | mriedem, efried : I have a situation where under high load the _update_to_placement call made at the end of instance_claim in the compute manager can fail on ResourceProviderConflict all 4 retry attempts (because of nova-scheduler simultaneously making many allocations to the same resource provider). This fails the instance build even though the instance did build. | 15:29 |
efried | cdent: That is very interesting. | 15:30 |
cdent | I'm considering a hack to make instance_claim not call _update_to_placement and just letting the periodic job do that. The reason _update_to_placement is actually talking to placement is because inventory (DISK_GB max_unit) does change after many (most) instance creations | 15:31 |
cdent | ideally that max_unit thing wouldn't be in place but that's not really any immediate option | 15:31 |
efried | I'm guessing backing off the retry interval wouldn't be of help, because the problem is the timing between the GET and the PUT(s). | 15:31 |
sean-k-mooney | cdent: were we not looking at disabling the periodic job at one point | 15:31 |
efried | wait, what? | 15:31 |
efried | max_unit changes? | 15:32 |
cdent | to account for multiple different source of one disk_gb in a cluster | 15:32 |
mriedem | cdent: is the vcenter driver reporting different max_unit DISK_GB from update_provider_tree during the claim? | 15:32 |
cdent | being presented as one disk | 15:32 |
sean-k-mooney | cdent: you mean for things like ceph | 15:32 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Remove assumption of http error if consumer not exists https://review.opendev.org/669188 | 15:32 |
mriedem | this is likely not libvirt/ceph | 15:33 |
mriedem | it's vcenter with shared storage provider or something? | 15:33 |
cdent | vcenter _faking_ shared stored providers | 15:33 |
sean-k-mooney | cdent: normally it should not need to change and should stay at the total capasity right | 15:33 |
cdent | sean-k-mooney: yes | 15:33 |
efried | oh, this isn't in libvirt, okay. Yeah, in libvirt max_unit == total and doesn't seem to change. | 15:33 |
cdent | but if you have N datastores in one cluster, no single request can be more than the free space on one of those datastores | 15:33 |
efried | okay, but as you say, fixing that is unrelated to the 409 bounce | 15:34 |
cdent | (this is why I've been eager for shared providers, as it could fix this (except that storage "policies" break all that) | 15:34 |
sean-k-mooney | cdent: well if you have over allcoation set to somthing other then 1 it could | 15:34 |
mriedem | sure, but why would max_unit change between initially reporting the inventory and then during the claim? | 15:34 |
sean-k-mooney | but ya i get you point | 15:34 |
mriedem | when the service starts up, it runs upt which should report the max_unit from the Nth datastore with the most amount of total disk right? | 15:34 |
efried | mriedem: the claim itself reduces the amount of free space on one of those disks | 15:34 |
sean-k-mooney | we physically could not allocate more then the free space on the data store | 15:35 |
cdent | let's assume for the moment that the max_unit hack is immutable as that's not really germane to the point, it's just the proximate cause | 15:35 |
mriedem | efried: couldn't we make that same argument for the libvirt driver then? | 15:35 |
kashyap | melwitt: efried: On that 'initenv' change (https://review.opendev.org/#/c/667976/2), doesn't it seem like a broken detection by 'cloud-init'? | 15:36 |
gibi | mriedem: done https://review.opendev.org/669188 | 15:36 |
cdent | the point is that a failed _update_to_placement during instance claim (which is about inventory, traits and aggregates, not allocations) can fail an already suceeded instance | 15:36 |
sean-k-mooney | mriedem: i dont think we should be adjusting max_unit in either case | 15:36 |
efried | mriedem: In libvirt it's not an issue because we're only looking at one disk, so the (total - usage) alread ylimits. | 15:36 |
kashyap | [Ignore me, I'll add a comment in the change.] | 15:36 |
mriedem | cdent: what do you mean by "already succeeded? | 15:36 |
efried | kashyap: Note the subsequent patch and the bug it points to. | 15:36 |
mriedem | we do the claim before the driver.spawn right? | 15:36 |
sean-k-mooney | it should remian at total size and really on palcment to do the free calulation + allcoation raitio | 15:36 |
cdent | mriedem: let me look at my notes | 15:37 |
sean-k-mooney | or total size of largest data store i gues for vcenter | 15:37 |
mriedem | yeah driver.spawn is way late in the build process, it's like the last thing | 15:37 |
mriedem | first we claim, then we build volumes and networking, then we call driver.spawn | 15:37 |
efried | anyway, I think the point here is whether we can maybe do without that _update_to_placement at the end of instance_claim? What else is it doing? | 15:37 |
mriedem | instance_claim calls _update which was historically for updating info on the compute node record | 15:38 |
cdent | mriedem: yeah, sorry, not that it did work, but _would_ work | 15:38 |
mriedem | for things like free_vcpus and whatnot | 15:38 |
mriedem | yeah i was going to say... https://github.com/openstack/nova/blob/86524773b8cd3a52c98409c7ca183b4e1873e2b8/nova/compute/manager.py#L2223 :) | 15:38 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: ksa auth conf and client for cyborg access https://review.opendev.org/631242 | 15:38 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Add Cyborg device profile groups to request spec. https://review.opendev.org/631243 | 15:38 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Create and bind Cyborg ARQs. https://review.opendev.org/631244 | 15:38 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Get resolved Cyborg ARQs and add PCI BDFs to VM's domain XML. https://review.opendev.org/631245 | 15:39 |
*** amodi has joined #openstack-nova | 15:39 | |
mriedem | so i think what's being asked is a kwarg on _update to say whether or not to call _update_to_placement | 15:39 |
*** mdbooth has joined #openstack-nova | 15:39 | |
efried | though the fact that it's doing something in this case is an argument in favor of keeping it. Kinda wondering what breaks if we don't have that. Presumably it would be possible for the disk allocation to be unsatisfiable late in the spawn process. | 15:39 |
mriedem | which in the startup and update_available_resource periodic would be yes, | 15:40 |
mriedem | but in instance_claim you're saying no | 15:40 |
cdent | mriedem: pretty much, yes | 15:40 |
efried | configurable ^ perhaps | 15:40 |
cdent | you'd have to know that if you chose to do that you have to keep the periodic regular | 15:40 |
cdent | s/regular/frequent/ | 15:40 |
cdent | what I really want to do is not have inventory changing all the time... | 15:41 |
mriedem | reshapes are in that flow but it looks like reshape should only be allowed on startup | 15:41 |
kashyap | efried: Yeah, just read the bug. Miguel is right. And this change makes sense. I don't see any security implications here (checked also w/ a libvirt dev) | 15:41 |
sean-k-mooney | cdent: as mriedem asked earlier is there any reason on start that we could not set teh max_unit to the larges datasotre and then not update it | 15:41 |
sean-k-mooney | or not update it outside the periodic | 15:42 |
mriedem | sean-k-mooney: from the upt perspective i'm not sure if the driver knows if it's called from the periodic, on startup, or during a claim | 15:42 |
sean-k-mooney | e.g. does it really need to be set to the free space | 15:42 |
mriedem | note that _move_claim will have the same issue here i'd think | 15:42 |
cdent | a) what mriedem said, b) the idea is to make sure placement has a way to prevent someone asking for more physical disk than is available | 15:43 |
cdent | mriedem: it's less of a risk there as they a) don't happen as much, b) happen even less in the vmware environment, c) this problem really only shows up when throwing > 2000 vms as the same nova-compute | 15:43 |
*** helenafm has quit IRC | 15:43 | |
cdent | s/as the/at the/ | 15:43 |
sean-k-mooney | cdent: ya i get that. i wonder if this is somthing we coudl do when we make the inital allocation | 15:44 |
mriedem | the initial allocation doesn't involve the driver | 15:44 |
sean-k-mooney | e.g. is this somethign we shoudl consider for all driver in general | 15:44 |
sean-k-mooney | true | 15:44 |
efried | kashyap: Thank you. Your votes/comments on those patches will be beneficial. Sounded like melwitt will also appreciate it. | 15:44 |
cdent | no other driver (that I'm aware of) would need this as they report real and true inventory | 15:44 |
sean-k-mooney | but shoudl we be doing it for libvirt with ceph/nfs | 15:44 |
cdent | it's only when you have non-contiguous disk(s) | 15:45 |
sean-k-mooney | cdent: ok would this be solved in the futrue by representing | 15:45 |
mriedem | in the libvirt+ceph case, we report total and max_unit the same, and placement will reject requests that are too big because of the capacity calculation in placement, right? | 15:45 |
sean-k-mooney | the datastores as different shareing providers | 15:46 |
cdent | sean-k-mooney: yes, except for the thing I said above about something called "storage policy" which would disrupt placement's understanding of where things are | 15:46 |
*** tesseract has quit IRC | 15:46 | |
sean-k-mooney | mriedem: yes which is fine for ceph as we have a singel pool for vms | 15:46 |
mriedem | i'd prefer not to have a config option for this behavior, | 15:47 |
mriedem | you could add a variable on the driver itself, | 15:47 |
mriedem | which could be controlled by the driver depending on if the vcenter driver is doing this shared storage pool modeling thing | 15:47 |
cdent | yeah, I'm not even clear if it should ever be upstreamed | 15:47 |
mriedem | the RT would check that, and by default we do as we do today | 15:47 |
* cdent nods | 15:47 | |
mriedem | i've done similar hacks in the RT for ironic | 15:47 |
mriedem | b/c they are a special unicorn as well | 15:47 |
mriedem | *we've done | 15:48 |
*** tesseract has joined #openstack-nova | 15:48 | |
cdent | unicorn's are a PITA | 15:48 |
cdent | hmm unicorns too | 15:48 |
mriedem | always pooping out frosting on my yard and such | 15:48 |
cdent | Thanks mriedem, sean-k-mooney, efried this has been very useful. | 15:48 |
cdent | and such | 15:49 |
kashyap | efried: Yes, will review | 15:49 |
cdent | If it turns out I can extract a little bit of 'if this virtdriver magic' i'll make it so | 15:49 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: ksa auth conf and client for cyborg access https://review.opendev.org/631242 | 15:50 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Add Cyborg device profile groups to request spec. https://review.opendev.org/631243 | 15:50 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Create and bind Cyborg ARQs. https://review.opendev.org/631244 | 15:50 |
openstackgerrit | Sundar Nadathur proposed openstack/nova master: WIP: Get resolved Cyborg ARQs and add PCI BDFs to VM's domain XML. https://review.opendev.org/631245 | 15:50 |
*** lpetrut has quit IRC | 15:50 | |
*** tssurya has quit IRC | 15:53 | |
mriedem | gibi: looks like https://review.opendev.org/#/c/637955/31 and https://review.opendev.org/#/c/669188/2 conflict, maybe you want to rebase the fix underneath the bigger change? | 15:53 |
sean-k-mooney | cdent: by the way i assume there are buisness reasons that vmware would prefer not to map each compute node to a specifc data store and use cinder with volume type to express storage policies for everything else | 15:57 |
cdent | sean-k-mooney: yeah, the basic root of all these things is "let the DRS continue to work" | 15:57 |
sean-k-mooney | i know vmware have lots of fancy feature built into the hypervior and storage solution so im guessign the current approch is trying to expose them | 15:57 |
sean-k-mooney | right ok makes sense | 15:58 |
cdent | it creates a huge impedance mismatch between the way nova thinks and the way vmware thinks, but that's the way it goes | 15:58 |
sean-k-mooney | ya hyperv also has some limited clustering of vms and storage pools but its not as advanced as what vcenter does so the impedence miss match is less of an issue for them i guess | 15:59 |
cdent | there's been discussion of tying to listen to events from the DRS via some agent that would then "correct" placement when DRS makes a change | 16:00 |
cdent | but that would require more effort than currently available resources | 16:01 |
gibi | mriedem: will do | 16:01 |
sean-k-mooney | that might require you to reshap allocation if it moved things right if you had a shareing provider per datastore | 16:01 |
sean-k-mooney | although if you had just one RP and hid the datastore i guess it woul just have to modify the max unit | 16:02 |
sean-k-mooney | as it does today | 16:02 |
sean-k-mooney | cdent: also i assume you are more or less the resouces that are available to work on it | 16:03 |
cdent | sean-k-mooney: I consider myself already fully booked | 16:04 |
sean-k-mooney | cdent: yep proably over booked | 16:05 |
prometheanfire | which db field is the 'volumes_attached' field sourced from? | 16:07 |
prometheanfire | smcginnis: you might know? ^ | 16:07 |
mriedem | prometheanfire: block_device_mappings | 16:08 |
smcginnis | mriedem is the expert there. ;) | 16:08 |
prometheanfire | mriedem: thanks, I can't reproduce it, but cinder created a volume fine, but nova was 504 on the attach (it shows up in the db but no action taken on the compute host) | 16:09 |
prometheanfire | restart nova services with debug/verbose and it attaches fine | 16:09 |
prometheanfire | since it was only half added, the api can't delete it | 16:09 |
*** whoami-rajat has quit IRC | 16:10 | |
mriedem | prometheanfire: meaning nova-compute asked cinder to create the volume, right? | 16:10 |
mriedem | bdm source_type != 'volume' | 16:10 |
mriedem | "the api can't delete it" - you mean the cinder api can't delete the volume? or the compute api can't delete the server? | 16:11 |
prometheanfire | mriedem: no, volume created manually (worked), then nova was asked to attach volume (failed) | 16:11 |
prometheanfire | api can't remove it from the instance | 16:11 |
prometheanfire | the failed part half attached it to the instance | 16:11 |
*** BjoernT has joined #openstack-nova | 16:11 | |
prometheanfire | added to block_device_mapping table but no action taken on compute host | 16:12 |
openstackgerrit | Merged openstack/os-vif master: Add Python 3 Train unit tests https://review.opendev.org/669438 | 16:12 |
mriedem | so there is a bdm, and the volume shows up for the server but when you try to detach it what happens? nova gets an error from cinder? | 16:12 |
mriedem | saying it's already detached (not attached) or something? | 16:12 |
prometheanfire | nova questions cinder if it's attached, cinder says no, nova does nothing | 16:13 |
mriedem | as in nova doesn't delete the bdm so it stops showing up as attached to the server | 16:14 |
prometheanfire | does nothing -> says it's not attached, why are you asking me to detach something that's not attached (returned to user) | 16:14 |
mriedem | which release? | 16:14 |
prometheanfire | stein | 16:14 |
prometheanfire | I haven't been able to reproduce this after restarting nova stuff infra side | 16:14 |
prometheanfire | so maybe some timeout or hanging connection? | 16:15 |
*** whoami-rajat has joined #openstack-nova | 16:15 | |
*** BjoernT_ has quit IRC | 16:15 | |
mriedem | my guess is cinder is failing on this call: https://github.com/openstack/nova/blob/86524773b8cd3a52c98409c7ca183b4e1873e2b8/nova/compute/api.py#L4175 | 16:15 |
prometheanfire | maybe, for the detach part | 16:15 |
mriedem | well what error do you get from the compute API? | 16:16 |
mriedem | "Invalid volume: %(reason)s" | 16:16 |
prometheanfire | No volume with a name or ID of '3e92f9b6-1a3b-4a7e-8487-6ff253e888db' exists. | 16:16 |
prometheanfire | since I removed the volumes | 16:17 |
prometheanfire | Invalid volume: Invalid input received: Invalid volume: Unable to detach volume. Volume status must be 'in-use' and attach_status must be 'attached' to detach. (HTTP 400) (Request-ID: req-01d8a3c5-66f4-43a4-bac5-9f1104a292fe) (HTTP 400) (Request-ID: req-358132e4-38a1-493a-b87b-d135791bae2d) | 16:17 |
mriedem | msg = _("Unable to detach volume. Volume status must be 'in-use' " | 16:17 |
mriedem | "and attach_status must be 'attached' to detach.") | 16:17 |
mriedem | yup | 16:17 |
mriedem | that's the begin_detaching call failing | 16:17 |
prometheanfire | because it never actually attached :D | 16:18 |
openstackgerrit | sean mooney proposed openstack/os-vif master: Sync Sphinx requirement https://review.opendev.org/666387 | 16:18 |
mriedem | right i get it | 16:18 |
*** tesseract has quit IRC | 16:18 | |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: nova-manage: heal port allocations https://review.opendev.org/637955 | 16:18 |
gibi | mriedem: rebased ^^ | 16:18 |
mriedem | prometheanfire: interesting you wouldn't hit this in the compute service when the attach fails and delete the bdm https://github.com/openstack/nova/blob/86524773b8cd3a52c98409c7ca183b4e1873e2b8/nova/compute/manager.py#L5722 | 16:19 |
prometheanfire | haven't been able to reproduce after restart of services, but am leaving it in debug mode in case it happens again | 16:20 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Move consts from neutronv2/api to constants module https://review.opendev.org/668945 | 16:21 |
mriedem | were there errors in the compute log after the failed attach about not being able to delete the bdm? | 16:21 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Use neutron contants in cmd/manage.py https://review.opendev.org/668946 | 16:21 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Add 'resource_request' to neutronv2/constants https://review.opendev.org/668947 | 16:21 |
prometheanfire | compute didn't even attempt to attach from what I could see | 16:22 |
prometheanfire | didn't get that far | 16:22 |
*** davidsha has quit IRC | 16:22 | |
mriedem | so...the rpc cast from nova api to nova compute failed? | 16:22 |
mriedem | you said you got a 504 somewhere | 16:22 |
prometheanfire | 504 from nova-api to client | 16:22 |
prometheanfire | openstackclient | 16:23 |
mriedem | ok i don't know why that would happen | 16:23 |
prometheanfire | I'm fine waiting til I can reproduce, I've left stuff in debug | 16:24 |
sean-k-mooney | stephenfin: jangutter so as i was saying on the placmeent channel ... i adressed my nit in https://review.opendev.org/#/c/666387/2 if ye want to take a look at that | 16:26 |
openstackgerrit | Balazs Gibizer proposed openstack/nova master: Translatable output strings in heal allocation https://review.opendev.org/668925 | 16:28 |
*** cdent has quit IRC | 16:33 | |
sean-k-mooney | aspiers: so you know that domaincap api you added for sev ... its broken | 16:33 |
*** dtantsur is now known as dtantsur|afk | 16:34 | |
sean-k-mooney | we cant default to q35 here https://github.com/openstack/nova/blob/master/nova/virt/libvirt/host.py#L744-L745 the logic breaks if you have emulators installed that dont support it | 16:34 |
sean-k-mooney | currently we dont call this code in nova yet because your sev stuff that uses it has not merged yet but i tried to use it for my device model filter stuff and it expolodes beautifully | 16:35 |
*** igordc has joined #openstack-nova | 16:37 | |
efried | mriedem: I'm going to bump current runways by, what say, 2 days due to last week's holiday? | 16:38 |
mriedem | shrug | 16:39 |
mriedem | not sure how much it matters | 16:39 |
efried | aspiers: SEV series is in merge conflict - any chance of getting that rebased soon? | 16:40 |
*** ttsiouts has quit IRC | 16:43 | |
*** ttsiouts has joined #openstack-nova | 16:44 | |
*** ivve has joined #openstack-nova | 16:48 | |
*** ttsiouts has quit IRC | 16:49 | |
*** rpittau is now known as rpittau|afk | 16:50 | |
*** luksky11 has quit IRC | 16:51 | |
sean-k-mooney | mriedem: so quick question. https://review.opendev.org/#/c/659703/8 change the behavior of force config drive to remove the config drive after first boot | 16:54 |
sean-k-mooney | to me that seams like a regression as it could lead to data lose in the form of injected files if the user does not copy the injected files form the config_drive or if the operator customerises the config dirve contentce with data scripts that run on each boot | 16:55 |
*** psachin has quit IRC | 16:55 | |
sean-k-mooney | if you are using the dynamic vendor data and do not have teh metadta service deployed is this not an issue? | 16:57 |
sean-k-mooney | the injected files case might be ok if that sets instance.config_drive automaticall but i think https://review.opendev.org/#/c/659703/8/nova/virt/configdrive.py@169 might break some usecauses | 16:58 |
sean-k-mooney | as such im not sure how safe it is to backport this | 16:59 |
*** derekh has quit IRC | 17:00 | |
*** mdbooth has quit IRC | 17:02 | |
mriedem | "change the behavior of force config drive to remove the config drive after first boot" - you mean if you boot the server with force_config_drive=True, then change to force_config_drive=False? the bug is actually the opposite - they created a server without a config drive, then changed to force_config_drive=True and after that they can't reboot the servers on the host w/o a config drive since the file doesn't exist | 17:02 |
mriedem | i said on lyarwood's stein backport, "I'd like to move a bit slowly with this one since it's a very latent issue and is a bit of a behavior change, though justified for the reboot issue. It should also be pretty rare (I don't imagine lots of people are changing the force_config_drive value on their computes once they are deployed)." | 17:03 |
mriedem | so there it's been sitting | 17:03 |
sean-k-mooney | mriedem: no what im concerned about is the comment sugges that on a reboot the config drive will be removed | 17:04 |
mriedem | sean-k-mooney: i'm not really following you on what specific use case you think this regressed | 17:04 |
sean-k-mooney | e.g. its only present on first boot | 17:04 |
mriedem | what comment? | 17:04 |
sean-k-mooney | https://review.opendev.org/#/c/660914/1/nova/virt/configdrive.py | 17:05 |
mriedem | from the bug, if you go from no config drive to a forced config drive and reboot the vm, it pukes in libvirt | 17:05 |
mriedem | that's the bug they are fixing | 17:05 |
sean-k-mooney | if we reboot the vm that was started with a config drive launched_at will not be null right | 17:05 |
sean-k-mooney | no im talkinbg about if you boot a vme with force_config=ture | 17:06 |
sean-k-mooney | then you reboot it it should continute to have a config drive | 17:06 |
*** dasp has quit IRC | 17:06 | |
sean-k-mooney | but instance.launched_at will not be None on the second boot | 17:06 |
mriedem | you mean on the reboot | 17:06 |
sean-k-mooney | yes | 17:07 |
sean-k-mooney | boot with force it will have a config drive | 17:07 |
mriedem | so boot with force_config_drive=true, change force_config_drive=false, reboot the vm, the config drive isn't in the vm | 17:07 |
sean-k-mooney | then if you hard reboot it will go away right | 17:07 |
sean-k-mooney | no config change | 17:07 |
sean-k-mooney | if you leave it at force_config_drive=true | 17:07 |
sean-k-mooney | always | 17:07 |
sean-k-mooney | boot a vm | 17:08 |
sean-k-mooney | then hard reboot it | 17:08 |
sean-k-mooney | it should still have a config drive after the reboot | 17:08 |
sean-k-mooney | but on the reboot launched_at would be non None | 17:08 |
sean-k-mooney | so and not instance.launched_at would be false | 17:09 |
mriedem | yeah i see what you're saying, and instance.config_drive is only set from the API? | 17:09 |
sean-k-mooney | i think its set in the flavor or image | 17:09 |
sean-k-mooney | actully no | 17:09 |
sean-k-mooney | you are right | 17:09 |
sean-k-mooney | the api | 17:09 |
sean-k-mooney | so im thinking about deployment where the metadata service is not deployed | 17:09 |
sean-k-mooney | so you use force_config_drive=true to make cloud init work and for things like vendor data or device role tagging | 17:10 |
sean-k-mooney | or file injection with the v2.1 api | 17:10 |
*** igordc has quit IRC | 17:10 | |
sean-k-mooney | the last point is less relevent beacue that is depercated so i would be ok with that on master but im not sure we should backport that change and it might break people on upgrade that dont deploy the metadata service | 17:11 |
mriedem | yeah as i said i wasn't totally comfortable with backporting it | 17:11 |
mriedem | oh wait | 17:12 |
mriedem | i remember digging into this | 17:12 |
mriedem | https://github.com/openstack/nova/blob/master/nova/compute/manager.py#L1758 | 17:12 |
mriedem | instance.config_drive will be set on compute if force_config_drive=True and the user doesn't specify it in the api | 17:13 |
mriedem | so i don't think your scenario holds | 17:13 |
mriedem | https://review.opendev.org/#/c/659703/4/nova/virt/configdrive.py@169 | 17:13 |
mriedem | so tl;dr, you likely need to recreate whatever regression you think there is and report a bug if we're going to talk about reverting that change | 17:14 |
mriedem | and i've got a hungry kid here and need to make lunch | 17:14 |
sean-k-mooney | https://github.com/openstack/nova/blob/master/nova/virt/configdrive.py#L181 | 17:15 |
sean-k-mooney | so if we have already set instace.launched_at then it wont | 17:15 |
sean-k-mooney | mriedem: sure take care of lunch :) | 17:15 |
mriedem | on first create, if the host has force_config_drive=True, we'll set instance.config_drive=True, and on subsequent calls to update_instance we won't update it because "if not True" is False | 17:16 |
mriedem | anyway, like i said, this would be easier if you can actually recreate a problem and report a bug rather than both of us mostly just guessing based on code | 17:16 |
mriedem | but i did look into this earlier on the patch | 17:17 |
sean-k-mooney | i think we need to reorder 1757 and 1758 https://github.com/openstack/nova/blob/master/nova/compute/manager.py#L1758 | 17:17 |
sean-k-mooney | mriedem: ok no worries | 17:17 |
sean-k-mooney | i can test this in my devstack setup and see if it causes an issue or not | 17:17 |
sean-k-mooney | go make food :) ill let you know if i find anything | 17:18 |
*** ralonsoh has quit IRC | 17:19 | |
*** priteau has quit IRC | 17:21 | |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Add host and hypervisor_hostname flag to create server https://review.opendev.org/645520 | 17:21 |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Update AZ admin doc to mention the new way to specify hosts https://review.opendev.org/666767 | 17:21 |
openstackgerrit | Merged openstack/nova stable/rocky: Ignore hw_vif_type for direct, direct-physical vNIC types https://review.opendev.org/667355 | 17:35 |
*** spatel has quit IRC | 17:38 | |
sean-k-mooney | ya so with master if you set force_config_drive=true, then boot a new vm and hard reboot it it will not have the config dirve on teh second boot | 17:40 |
sean-k-mooney | so that intoduced a new bug | 17:40 |
sean-k-mooney | which should be fixable by swapping instance.launched_at and configdrive.update_instace here https://github.com/openstack/nova/blob/master/nova/compute/manager.py#L1758 | 17:41 |
sean-k-mooney | which ill go test now | 17:41 |
*** tbachman has joined #openstack-nova | 17:41 | |
*** igordc has joined #openstack-nova | 17:41 | |
*** maciejjozefczyk has quit IRC | 17:42 | |
*** dasp has joined #openstack-nova | 17:42 | |
*** igordc has quit IRC | 17:43 | |
*** maciejjozefczyk has joined #openstack-nova | 17:45 | |
sean-k-mooney | yep that fixes it | 17:45 |
sean-k-mooney | ill file a bug and push a patch | 17:46 |
*** igordc has joined #openstack-nova | 17:46 | |
sean-k-mooney | the order didnt matter befoce because required_by did not depend on the launched_at field not it does so we need to set that after | 17:47 |
sean-k-mooney | ill try these change teh config to diable force and see if ti does the right thing now | 17:47 |
sean-k-mooney | yep it works corectly after config updte too. old vms keep the config drive an new ones dont have them. and if i go back from false to true old vms dont get a config drive but new vms do | 17:54 |
*** maciejjozefczyk has quit IRC | 17:54 | |
*** ociuhandu_ has joined #openstack-nova | 18:01 | |
*** ociuhandu has quit IRC | 18:03 | |
*** BjoernT_ has joined #openstack-nova | 18:05 | |
*** ociuhandu_ has quit IRC | 18:06 | |
*** BjoernT has quit IRC | 18:06 | |
*** whoami-rajat has quit IRC | 18:20 | |
*** priteau has joined #openstack-nova | 18:23 | |
*** spatel has joined #openstack-nova | 18:28 | |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Add host and hypervisor_hostname flag to create server https://review.opendev.org/645520 | 18:30 |
openstackgerrit | Matt Riedemann proposed openstack/nova master: Update AZ admin doc to mention the new way to specify hosts https://review.opendev.org/666767 | 18:30 |
spatel | sean-k-mooney: one of my vm stuck in powering-on stat so i did | 18:32 |
spatel | nova reset-state --active 368f2b45-c4b8-460e-9269-023ef80a69d1 | 18:32 |
spatel | now now when i start vm it is saying - Cannot 'start' instance 368f2b45-c4b8-460e-9269-023ef80a69d1 while it is in vm_state active (HTTP 409) (Request-ID: req-29b635ac-4d8f-4139-9170-19824f019806) | 18:33 |
mriedem | i'm +2 on "Add host and hypervisor_hostname flag to create server https://review.opendev.org/645520" now | 18:33 |
sean-k-mooney | spatel: you will need to hard reboot it | 18:33 |
spatel | what is the option for command line to hard-reboot? | 18:36 |
spatel | let me try from GUI | 18:36 |
sean-k-mooney | "openstack server reboot --hard" but its available in horizon too | 18:37 |
openstackgerrit | sean mooney proposed openstack/nova master: libvirt: make config drives sticky bug 1835822 https://review.opendev.org/669738 | 18:42 |
openstack | bug 1835822 in OpenStack Compute (nova) "vms loose acess to config drive with CONF.force_config_drive=True after hard reboot" [Medium,Confirmed] https://launchpad.net/bugs/1835822 - Assigned to sean mooney (sean-k-mooney) | 18:42 |
sean-k-mooney | mriedem: ^ that is the fix for the config drive issue | 18:42 |
spatel | sean-k-mooney: that works :)_ | 18:43 |
spatel | you are awesome | 18:43 |
sean-k-mooney | no i just have hit the same proablem | 18:43 |
sean-k-mooney | if you use reset-state to set it to active you need to use hard-reboot to fix it | 18:44 |
*** ociuhandu has joined #openstack-nova | 18:47 | |
*** whoami-rajat has joined #openstack-nova | 18:57 | |
*** BjoernT_ has quit IRC | 19:05 | |
*** ricolin has quit IRC | 19:21 | |
*** wwriverrat has left #openstack-nova | 19:24 | |
openstackgerrit | Matt Riedemann proposed openstack/python-novaclient master: Add host and hypervisor_hostname to create servers https://review.opendev.org/647671 | 19:26 |
*** BjoernT has joined #openstack-nova | 19:28 | |
*** factor has quit IRC | 19:30 | |
*** factor has joined #openstack-nova | 19:30 | |
*** BjoernT_ has joined #openstack-nova | 19:41 | |
*** BjoernT has quit IRC | 19:44 | |
*** icarusfactor has joined #openstack-nova | 19:45 | |
*** eharney has quit IRC | 19:46 | |
*** factor has quit IRC | 19:47 | |
*** factor has joined #openstack-nova | 20:01 | |
*** ociuhandu has quit IRC | 20:02 | |
*** icarusfactor has quit IRC | 20:03 | |
*** bbowen has joined #openstack-nova | 20:14 | |
openstackgerrit | Merged openstack/os-vif master: Sync Sphinx requirement https://review.opendev.org/666387 | 20:23 |
*** luksky11 has joined #openstack-nova | 20:25 | |
*** icarusfactor has joined #openstack-nova | 20:25 | |
*** factor has quit IRC | 20:27 | |
openstackgerrit | Lee Yarwood proposed openstack/nova master: nova-lvm: Disable [validation]/run_validation in tempest.conf https://review.opendev.org/662176 | 20:34 |
openstackgerrit | Lee Yarwood proposed openstack/nova master: nova-lvm: Disable [validation]/run_validation in tempest.conf https://review.opendev.org/662176 | 20:34 |
*** priteau has quit IRC | 20:44 | |
*** spatel has quit IRC | 20:48 | |
*** priteau has joined #openstack-nova | 20:50 | |
*** pcaruana has quit IRC | 20:50 | |
mriedem | oh lyarwood | 20:59 |
mriedem | wrong location for that variable | 20:59 |
*** whoami-rajat has quit IRC | 21:00 | |
*** nicolasbock has joined #openstack-nova | 21:04 | |
*** BjoernT_ has quit IRC | 21:07 | |
sean-k-mooney | mriedem: im going to leave it for this evening but any idea why http://paste.openstack.org/show/754180/ would fail on the last assert. | 21:10 |
sean-k-mooney | it kind of looks like the compute manager code is not running | 21:11 |
sean-k-mooney | i might move it to the fake libvirt functional tests | 21:11 |
sean-k-mooney | but i would have expected this to work the the fake driver | 21:11 |
mriedem | sean-k-mooney: you're not waiting for the server to be ACTIVE for one | 21:12 |
mriedem | unless this is using the CastAsCall fixtured, but i'd avoid that for new tests if possible | 21:12 |
sean-k-mooney | ah yes | 21:12 |
mriedem | you also shouldn't need more than one compute for this test | 21:12 |
sean-k-mooney | oh ya i know im just using a for because it was quick but i belive it just uses self.compute | 21:13 |
sean-k-mooney | i will change that | 21:13 |
sean-k-mooney | but ill try adding teh wait that is proably the issue | 21:13 |
sean-k-mooney | hum no that does not seam to fix it | 21:19 |
sean-k-mooney | oh i didnt wait correctly | 21:23 |
*** priteau has quit IRC | 21:27 | |
*** priteau has joined #openstack-nova | 21:27 | |
*** priteau has quit IRC | 21:29 | |
*** brault has quit IRC | 21:30 | |
sean-k-mooney | ok ill add the rest of the test cases tommorow but adding self._wait_for_state_change(created_server, 'BUILD') solve my issue thanks | 21:30 |
*** ivve has quit IRC | 21:39 | |
*** mriedem has quit IRC | 21:53 | |
*** slaweq has quit IRC | 22:07 | |
*** tjgresha has joined #openstack-nova | 22:10 | |
*** mlavalle has quit IRC | 22:13 | |
*** luksky11 has quit IRC | 22:19 | |
*** slaweq has joined #openstack-nova | 22:23 | |
*** slaweq has quit IRC | 22:28 | |
*** lbragstad has quit IRC | 22:48 | |
*** lbragstad has joined #openstack-nova | 22:50 | |
*** tkajinam has quit IRC | 23:01 | |
*** tkajinam has joined #openstack-nova | 23:01 | |
*** hongbin has quit IRC | 23:11 | |
*** andreaf has quit IRC | 23:15 | |
*** andreaf has joined #openstack-nova | 23:15 | |
*** _alastor_ has quit IRC | 23:23 | |
*** hoonetorg has quit IRC | 23:32 | |
*** rcernin has joined #openstack-nova | 23:36 | |
*** spatel has joined #openstack-nova | 23:41 | |
*** gyee has quit IRC | 23:42 | |
*** hoonetorg has joined #openstack-nova | 23:45 | |
*** BjoernT has joined #openstack-nova | 23:48 | |
*** BjoernT has quit IRC | 23:48 | |
*** gyee has joined #openstack-nova | 23:57 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!