*** ianychoi has joined #airshipit | 00:01 | |
*** sthussey has quit IRC | 00:24 | |
*** kaspars__ has quit IRC | 02:41 | |
*** irclogbot_2 has quit IRC | 03:01 | |
*** irclogbot_3 has joined #airshipit | 03:01 | |
*** roman_g has quit IRC | 04:35 | |
*** aojea has joined #airshipit | 06:34 | |
*** mnaser has quit IRC | 07:06 | |
*** mnaser has joined #airshipit | 07:06 | |
*** happyhemant has joined #airshipit | 07:44 | |
*** lemko has joined #airshipit | 08:44 | |
*** roman_g has joined #airshipit | 09:30 | |
*** dimitris__ has quit IRC | 10:23 | |
*** fdegir has quit IRC | 12:45 | |
*** georgk has quit IRC | 12:45 | |
*** fdegir has joined #airshipit | 12:46 | |
*** georgk has joined #airshipit | 12:46 | |
*** altlogbot_0 has joined #airshipit | 13:03 | |
*** aaronsheffield has joined #airshipit | 13:13 | |
*** kranthikirang has joined #airshipit | 13:28 | |
*** altlogbot_0 has quit IRC | 13:32 | |
*** kranthikirang1 has joined #airshipit | 13:32 | |
*** altlogbot_0 has joined #airshipit | 13:32 | |
*** kranthikirang has quit IRC | 13:35 | |
*** altlogbot_0 has quit IRC | 13:38 | |
*** altlogbot_0 has joined #airshipit | 13:38 | |
kranthikirang1 | @team, anyone help me to figure out problem with MAAS deploy node step? I am seeing a failure while actually booting from local disk; When I try bootstrapping the node manually through MAAS using same partition table, disks and interfaces, everything works properly; | 13:46 |
---|---|---|
*** sthussey has joined #airshipit | 13:52 | |
openstackgerrit | Merged openstack/airship-pegleg master: Ensure cryptostrings contain all char types https://review.openstack.org/651812 | 13:56 |
*** altlogbot_0 has quit IRC | 14:00 | |
kranthikirang1 | The main difference I see is when we boot via dry-dock it does download bootactions/files from maas for that particular node and try to execute those actions and fails; I wonder if there is a way to control them using the site specs; Where as if we I do manually its not actually getting the bootactions | 14:02 |
*** altlogbot_1 has joined #airshipit | 14:25 | |
openstackgerrit | Dimitrios Markou proposed openstack/airship-treasuremap master: [WIP] Create documentation for airsloop site https://review.openstack.org/651652 | 14:26 |
*** kaspars__ has joined #airshipit | 14:27 | |
*** altlogbot_1 has quit IRC | 14:29 | |
*** altlogbot_1 has joined #airshipit | 14:29 | |
*** altlogbot_1 has quit IRC | 14:33 | |
*** altlogbot_1 has joined #airshipit | 14:33 | |
*** altlogbot_1 has quit IRC | 14:33 | |
*** nick_kar has quit IRC | 14:34 | |
openstackgerrit | Dimitrios Markou proposed openstack/airship-treasuremap master: [WIP] Create documentation for airsloop site https://review.openstack.org/651652 | 14:35 |
openstackgerrit | Kaspars Skels proposed openstack/airship-treasuremap master: [wip] Add sloop type and airsloop site https://review.openstack.org/649195 | 14:37 |
kranthikirang1 | Is there anyway we can modify drydock bootactions using site spces? I see they are needed to continue with kubernetes join ..etc but how can I at least see what its trying to do with boot scripts | 14:46 |
openstackgerrit | Samuel Pilla proposed openstack/airship-specs master: [WIP] (armada) Chart Time Metrics https://review.openstack.org/652092 | 14:47 |
*** altlogbot_0 has joined #airshipit | 14:47 | |
openstackgerrit | Merged openstack/airship-promenade master: Log client-id in UCP API endpoints https://review.openstack.org/634071 | 14:51 |
*** altlogbot_0 has quit IRC | 14:51 | |
*** altlogbot_0 has joined #airshipit | 14:52 | |
*** altlogbot_0 has quit IRC | 14:55 | |
*** altlogbot_0 has joined #airshipit | 14:56 | |
*** altlogbot_0 has quit IRC | 14:56 | |
*** altlogbot_1 has joined #airshipit | 14:58 | |
*** lemko has quit IRC | 15:03 | |
*** pkaralis has quit IRC | 15:06 | |
openstackgerrit | Kaspars Skels proposed openstack/airship-treasuremap master: Sloop type and airsloop site https://review.openstack.org/649195 | 15:21 |
openstackgerrit | Dimitrios Markou proposed openstack/airship-treasuremap master: [WIP] Create documentation for airsloop site https://review.openstack.org/651652 | 15:24 |
openstackgerrit | Kaspars Skels proposed openstack/airship-treasuremap master: Sloop type and Airsloop site https://review.openstack.org/649195 | 15:25 |
*** roman_g has quit IRC | 15:32 | |
*** roman_g has joined #airshipit | 15:34 | |
openstackgerrit | Merged openstack/airship-treasuremap master: Update docs to include generated certs into collected dir https://review.openstack.org/639181 | 15:39 |
*** aojea has quit IRC | 15:59 | |
openstackgerrit | Scott Hussey proposed openstack/airship-promenade master: (apiserver) [WIP] support key rotation https://review.openstack.org/631935 | 16:23 |
openstackgerrit | Dimitrios Markou proposed openstack/airship-treasuremap master: [WIP] Create documentation for airsloop site https://review.openstack.org/651652 | 16:52 |
openstackgerrit | Dimitrios Markou proposed openstack/airship-treasuremap master: [WIP] Create documentation for airsloop site https://review.openstack.org/651652 | 16:55 |
*** happyhemant has quit IRC | 17:19 | |
openstackgerrit | Dmitrii Kabanov proposed openstack/airship-drydock master: Add possibility to check response code in auth test https://review.openstack.org/651935 | 18:05 |
*** irclogbot_3 has quit IRC | 18:08 | |
*** irclogbot_3 has joined #airshipit | 18:10 | |
*** roman_g has quit IRC | 18:26 | |
openstackgerrit | Scott Hussey proposed openstack/airship-promenade master: apiserver support for etcd encryption https://review.openstack.org/628290 | 18:54 |
openstackgerrit | Scott Hussey proposed openstack/airship-promenade master: (apiserver) [WIP] support key rotation https://review.openstack.org/631935 | 18:54 |
openstackgerrit | Dmitrii Kabanov proposed openstack/airship-drydock master: Add possibility to check response code in auth test https://review.openstack.org/651935 | 18:59 |
kaspars__ | @kranthikirang | 19:18 |
kaspars__ | you can look in the MAAS node logs | 19:18 |
kaspars__ | it should show some errors related to pulling bootaction, etc | 19:19 |
kaspars__ | if the node actually manage to deploy | 19:19 |
kaspars__ | you can SSH into the server and then do | 19:19 |
kaspars__ | grep -R prom /var/log/syslog | 19:19 |
kaspars__ | it should show promenade/bootaction related logs | 19:19 |
openstackgerrit | Evgeniy L proposed openstack/airship-treasuremap master: Enable nested virtualization by default https://review.openstack.org/652139 | 19:40 |
openstackgerrit | Drew Walters proposed openstack/airship-armada master: CI: Add Airskiff check https://review.openstack.org/599020 | 19:58 |
openstackgerrit | Drew Walters proposed openstack/airship-armada master: CI: Add Airskiff check https://review.openstack.org/599020 | 20:38 |
openstackgerrit | Sean Eagan proposed openstack/airship-armada master: Introduce v2 docs https://review.openstack.org/648246 | 20:43 |
openstackgerrit | Merged openstack/airship-drydock master: Add possibility to check response code in auth test https://review.openstack.org/651935 | 20:52 |
openstackgerrit | Kaspars Skels proposed openstack/airship-treasuremap master: Sloop type and Airsloop site https://review.openstack.org/649195 | 21:11 |
openstackgerrit | Merged openstack/airship-promenade master: Change image pull policy from Always to IfNotPresent. https://review.openstack.org/626832 | 21:19 |
openstackgerrit | Drew Walters proposed openstack/airship-armada master: CI: Add Airskiff check https://review.openstack.org/599020 | 21:25 |
kranthikirang1 | kaspars__: Node is not being deployed completely; When it tries to boot from local hard disk it fails with "mdadm: CREATE group disk not found"; I think its happening when its try to boot to the kernel_package we pass; | 21:39 |
kaspars__ | interesting - so I have had some strange issues when missing HWE and GA for comissioning/deployment. In other words they both need to match. I have used GA kernel for airsloop (simplified demo/lab reference site) setup which looks like this | 21:43 |
kaspars__ | - maas.yaml - https://review.openstack.org/#/c/649195/51/type/sloop/charts/ucp/comps/maas.yaml | 21:43 |
kaspars__ | - https://review.openstack.org/#/c/649195/51/site/airsloop/profiles/host/compute.yaml | 21:43 |
kaspars__ | I used the same kernel as you suggested yesterday and DELL 720xd server deployed well (I only have 1 of those in this setup) | 21:43 |
kaspars__ | it could be something related to MAAS/hardware and kernels - so you may want to try different variations, etc | 21:44 |
kaspars__ | also - in some cases if your server has non-writable flash drive - you may want to disable that in bios - I have seen that causing issues | 21:45 |
kaspars__ | otherwise - I think I'm out of tips - this could be something related to MAAS and particular HW support related.. | 21:45 |
openstackgerrit | Sean Eagan proposed openstack/airship-armada master: [WIP]: Improve wait API and semantics in v2 docs https://review.openstack.org/636440 | 21:52 |
kranthikirang1 | kaspars__: OK, When I use MAAS to deploy using ga-16.04 manually everything works like charm. Only change I have observed in while passing the kernel_package as a kernel parameter while deploying the node; I am not sure if that is causing any issues with HP gen9 | 22:08 |
kranthikirang1 | kaspars__: However when I use hwe-16.04 manually also enlisting the node itself is failing in one of the node with kernel panic | 22:09 |
kranthikirang1 | Now I am trying to see what happens if we remove kernel_package parameter in profile | 22:09 |
kaspars__ | I mean make sure that you actually set MAAS to use GA for comissioing also | 22:27 |
kaspars__ | if you see my samples there - you need to update 2 files | 22:27 |
kaspars__ | 1) the maas.yaml as well as host profile | 22:28 |
kaspars__ | sure - you can also remove kernel paramater completely - so it would use some default/latest | 22:28 |
kaspars__ | and that might work better. | 22:28 |
kaspars__ | I like locking it down as it would not change over time - or let's say on new deployment after a month or so Ubuntu may already have a newer kernel version | 22:29 |
kaspars__ | (when you deploy it manually - you may check which kernel version it used and might as well directly use that..) | 22:30 |
kranthikirang1 | kaspars__: I have tried using matching kernel version(linux-image-4.4.0-142-generic)\ but that didn't solve my problem; | 22:35 |
kranthikirang1 | I did change in maas.yaml and host profile | 22:36 |
openstackgerrit | Alexander Noskov proposed openstack/airship-promenade master: Replace kubectl and kubelet binary with hyperkube image https://review.openstack.org/652162 | 22:52 |
openstackgerrit | Alexander Noskov proposed openstack/airship-promenade master: Add shell autocompletion for kubectl https://review.openstack.org/652163 | 22:52 |
kranthikirang1 | kaspars__: Disabling kernel_pacakges solved my problem; Now able to deploy the nodes | 22:58 |
kaspars__ | awesome! promjoin worked? did nodes join k8s cluster? | 23:13 |
kaspars__ | maybe you can mark which kernel worked - so I might update airsloop to use "potentially more boradly working" kernel | 23:13 |
kaspars__ | just doing 'sudo kubectl get nodes -o wide' should show which kernel is deployed | 23:14 |
kranthikirang1 | kaspars__: I see nodes are joined and ceph-osds are up now :) | 23:16 |
kranthikirang1 | 4.4.0-145-generic | 23:17 |
kranthikirang1 | but my genesis is using 4.4.0-142-generic; I was a bit skeptical to boot genesis to newly installed kernel | 23:17 |
kranthikirang1 | for this POC; I can live with that :) | 23:18 |
kaspars__ | haha sure! sounds like a plan! this is good news for a Friday :) | 23:18 |
kaspars__ | thanks for the info on the kernel - I might use that as well.. for the airsloop simplified site. | 23:18 |
kranthikirang1 | I have used ga-16.04 (for enlisting and commission, no minimum kernel) and no kernel_package supplied for deploy node | 23:19 |
kranthikirang1 | kaspars__: What is the PS for airsloop? Perhaps I can also contribute more there :) would love to | 23:19 |
kaspars__ | Sloop type and Airsloop site Sloop type/site is a minimalistic approach to Airship with reduced requirements towards hardware and external dependencies while keeping all the functional features. | 23:20 |
kaspars__ | something I'm working on right now | 23:20 |
kaspars__ | https://review.openstack.org/#/c/649195/ | 23:20 |
kranthikirang1 | Can I add HP gen9 templates to airship-seaworthy so that people can use? | 23:20 |
kaspars__ | it has 1 control, and 1 compute by default - should be good for demo/lab setups to easier get onto using airship | 23:20 |
kranthikirang1 | kaspars__: sounds good; Will follow you | 23:20 |
kaspars__ | yes, I think host profiles would fit into global/profiles/host | 23:21 |
kaspars__ | where we could create library of different servers - so feel free to create a PS | 23:21 |
kaspars__ | let's see if others like it as well | 23:21 |
kranthikirang1 | OK | 23:21 |
kranthikirang1 | will do that | 23:21 |
kaspars__ | (I would put it on global, not seaworthy site level) | 23:21 |
kranthikirang1 | sure; but in seaworthy we have dell example as well | 23:22 |
kaspars__ | yeah - I'm not sure if that was the best idea. | 23:22 |
kaspars__ | I think it was meant at the time more like when people would create custom sites | 23:22 |
kaspars__ | they would know when to place their profiles, etc | 23:22 |
kaspars__ | as it may be hard to look around global, etc | 23:23 |
kaspars__ | (potentially even if the same HW it may differ in terms of disks, and number of NIC cards) | 23:23 |
kaspars__ | in other words - global is something that can be shared between multiple sites | 23:24 |
kaspars__ | and host/hardware profiles seem to fit the bill.. | 23:24 |
kranthikirang1 | OK; I see in global cp.yaml and dp.yaml already | 23:28 |
kranthikirang1 | and generic.yaml for hardware | 23:28 |
kaspars__ | right, exactly.. so we might move the DELL stuff over as well | 23:28 |
kranthikirang1 | I thought r720.yaml for dell in site specs was an example | 23:28 |
kaspars__ | airship-seaworthy including host/hardware profiles are definetely working site documents | 23:29 |
kaspars__ | the manifests are gated with airship-seaworthy - so it's a working 6 server site | 23:29 |
kaspars__ | they are much like a reference/examples for sure | 23:30 |
kaspars__ | I'm not exactly sure how compatible those would be to other DELL r720xd servers | 23:30 |
kranthikirang1 | yeah, have to modify the profile based on disks, interfaces and other variables like bios version ..etc | 23:31 |
kaspars__ | right! have a good weekend and congrats on getting already this far! :) | 23:32 |
*** aaronsheffield has quit IRC | 23:40 | |
kranthikirang1 | kaspars__: Thank you; I see my compute nodes are also part of k8s cluster now; Need to see how openstack-helm comes up | 23:46 |
*** sthussey has quit IRC | 23:47 | |
kranthikirang1 | kaspars__: I see Armada misses proxy information and no site specs/global specs have this configuration | 23:54 |
kranthikirang1 | I will create a PS on Monday | 23:54 |
kranthikirang1 | 2 23:53:48,533 INFO base_task_runner.py:101:_read_task_logs base_task_runner Job 64: Subtask armada_post_apply armada.exceptions.api_exceptions.ClientError: Error - received 500: {"type": "error", "message": "Failed to apply manifest: ('Git exception occurred, [', 'https://git.openstack.org/openstack/openstack-helm-infra', '] may not be a valid git repository.')", "retry": false} | 23:55 |
kranthikirang1 | but I have to re-do everything | 23:55 |
Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!