*** k_mouza has joined #openstack-containers | 01:04 | |
*** k_mouza has quit IRC | 01:09 | |
*** openstacking_123 has quit IRC | 01:21 | |
*** threestrands has joined #openstack-containers | 01:21 | |
*** born2bake has quit IRC | 01:34 | |
*** rcernin has quit IRC | 02:01 | |
*** rcernin has joined #openstack-containers | 02:03 | |
*** rcernin has quit IRC | 02:20 | |
*** rcernin has joined #openstack-containers | 02:52 | |
*** dasp has quit IRC | 02:54 | |
*** ricolin has joined #openstack-containers | 02:57 | |
*** dasp has joined #openstack-containers | 03:00 | |
*** sapd1 has joined #openstack-containers | 03:01 | |
*** hongbin has joined #openstack-containers | 03:30 | |
*** threestrands has quit IRC | 03:36 | |
openstackgerrit | Simon Merrick proposed openstack/magnum-ui master: Fix formatting issue in workflow message https://review.opendev.org/732474 | 03:39 |
---|---|---|
*** hongbin has quit IRC | 03:40 | |
*** vishalmanchanda has joined #openstack-containers | 04:05 | |
*** ykarel|away is now known as ykarel | 04:08 | |
*** rcernin has quit IRC | 04:19 | |
*** rcernin has joined #openstack-containers | 04:22 | |
*** udesale has joined #openstack-containers | 04:41 | |
*** k_mouza has joined #openstack-containers | 05:04 | |
*** k_mouza has quit IRC | 05:09 | |
openstackgerrit | Simon Merrick proposed openstack/magnum-ui master: Fix formatting issue in workflow message https://review.opendev.org/732474 | 05:19 |
*** threestrands has joined #openstack-containers | 05:26 | |
openstackgerrit | Simon Merrick proposed openstack/magnum-ui master: Fix formatting issue in workflow message https://review.opendev.org/732474 | 05:42 |
*** nikparasyr has joined #openstack-containers | 05:47 | |
*** faizy98 has quit IRC | 05:54 | |
*** xinliang has joined #openstack-containers | 06:48 | |
*** xinliang has quit IRC | 06:55 | |
*** sapd1 has quit IRC | 06:57 | |
*** ttsiouts has joined #openstack-containers | 07:12 | |
*** born2bake has joined #openstack-containers | 07:25 | |
*** rcernin has quit IRC | 07:48 | |
*** ttsiouts has quit IRC | 08:16 | |
*** threestrands has quit IRC | 08:32 | |
*** strigazi has quit IRC | 08:34 | |
*** strigazi has joined #openstack-containers | 08:35 | |
*** ykarel is now known as ykarel|lunch | 08:43 | |
*** k_mouza has joined #openstack-containers | 09:16 | |
*** ttsiouts has joined #openstack-containers | 09:32 | |
*** ykarel|lunch is now known as ykarel | 09:36 | |
tobias-urdin | im staring myself blind on issues with getting magnum 9.3.0 to properly deploy k8s on Fedora CoreOS 31 | 09:45 |
tobias-urdin | it stops here according to the heat-container-agent journal log | 09:45 |
tobias-urdin | Jun 02 07:57:37 tty-lxwwrmrpn7jt-master-0 podman[2647]: [2020-06-02 07:57:37,575] (heat-config) [DEBUG] Running /var/lib/heat-config/hooks/script < /var/lib/heat-config/deployed/5fc20446-73c6-489a-bbd2-62d45ab49114.json | 09:45 |
tobias-urdin | Jun 02 07:58:05 tty-lxwwrmrpn7jt-master-0 podman[2647]: Command failed, will not cache new data. Command 'os-refresh-config' died with <Signals.SIGTERM: 15>. | 09:45 |
tobias-urdin | Jun 02 07:58:05 tty-lxwwrmrpn7jt-master-0 systemd[1]: Stopping Run heat-container-agent... | 09:45 |
tobias-urdin | all scripts are properly concatenated in /var/lib/heat-config/heat-config-script/5fc20446-73c6-489a-bbd2-62d45ab49114 | 09:45 |
tobias-urdin | so the data is properly read by heat-container-agent and placed in there | 09:46 |
tobias-urdin | but it stops on this line | 09:46 |
tobias-urdin | https://github.com/openstack/magnum/blob/9.3.0/magnum/drivers/common/templates/kubernetes/fragments/make-cert.sh#L171 | 09:46 |
tobias-urdin | but the last curl request is successful https://github.com/openstack/magnum/blob/9.3.0/magnum/drivers/common/templates/kubernetes/fragments/make-cert.sh#L130 | 09:46 |
tobias-urdin | and the /etc/kubernetes/certs/kubelet.crt file contains the certificate | 09:46 |
tobias-urdin | [02/Jun/2020:07:57:51 +0200] "GET /v1/certificates/d3b3cd30-d34a-401b-ad9b-4bc2aba30fe5 HTTP/1.1" 200 1421 "-" "curl/7.70.0" | 09:46 |
tobias-urdin | [02/Jun/2020:07:57:58 +0200] "POST /v1/certificates HTTP/1.1" 201 4200 "-" "curl/7.70.0" | 09:46 |
tobias-urdin | Using these versions; Fedora CoreOS 31.20200505.3.0 Magnum 9.3.0 Heat 13.0.1 | 09:47 |
tobias-urdin | anybody has a clue? since it never continues to the echo commands after the function that calls curl I assume it thinks curl command failed | 09:47 |
tobias-urdin | but none of the logs refer to why the script exited/died so it's very hard to troubleshoot, copied the rest of the script from the /var/lib/heat-config/heat-config-script/5fc20446-73c6-489a-bbd2-62d45ab49114 file and executed it and k8s was properly deployed | 09:48 |
tobias-urdin | there also seems to be a race conditions sometimes when ssh on CoreOS is not ready causing the $ssh_cmd way to get a connection refused on port 22 but that happens more rarely | 09:49 |
tobias-urdin | strigazi: brtknr | 09:49 |
brtknr | tobias-urdin: look inside /var/log/heat-config/heat-config-script | 09:51 |
tobias-urdin | brtknr: yeah, it stops here https://github.com/openstack/magnum/blob/9.3.0/magnum/drivers/common/templates/kubernetes/fragments/make-cert.sh#L171 | 09:57 |
tobias-urdin | sec i'll post a censored log from that line | 09:57 |
brtknr | tobias-urdin: which version of heat container agent are you using? | 09:58 |
tobias-urdin | the default hardcoded in there heat-container-agent:ussuri-dev | 09:58 |
tobias-urdin | http://paste.openstack.org/show/794231/ from the log | 09:58 |
*** udesale has quit IRC | 09:59 | |
tobias-urdin | the log output doesn't help much, other than it stopping at the last curl request there, but api log says it's successful | 09:59 |
tobias-urdin | "POST /v1/certificates HTTP/1.1" 201 4200 "-" "curl/7.70.0" | 09:59 |
tobias-urdin | and /etc/kubernetes/certs/kubelet.crt file is populated | 09:59 |
*** k_mouza has quit IRC | 10:00 | |
brtknr | tobias-urdin: did you censor the api IP address? http://api:9511/v1/certificates | 10:00 |
tobias-urdin | yeah, it's correct there | 10:00 |
brtknr | I dont see any error messages? | 10:01 |
tobias-urdin | me neither, that's why im thrown off, it just stops | 10:02 |
tobias-urdin | heat stack says "timed out" on kube_masters | 10:03 |
tobias-urdin | but the /etc/kubernetes/certs/kubelet.crt is populated from that curl request, and api log (the POST line above) was 201 | 10:03 |
tobias-urdin | do you know if there is any recent fixes I might not have in 9.3.0? that's usually been the case that before, some missed backports, missing a new release to get the new changes out | 10:04 |
tobias-urdin | but i've never got fedora coreos to work, only the very old fedora atomic 27 | 10:05 |
*** ramishra has quit IRC | 10:05 | |
jakeyip | hmm anyone's cluster at train yet? | 10:08 |
tobias-urdin | brtknr: tried another deploy now and noticed something | 10:15 |
tobias-urdin | Jun 02 10:10:07 newtest-z4kds5tnljh2-master-0 zincati[1018]: [INFO ] staged deployment '31.20200505.3.0' available, proceeding to finalize it | 10:15 |
tobias-urdin | Jun 02 10:10:10 newtest-z4kds5tnljh2-master-0 rpm-ostree[1027]: Initiated txn FinalizeDeployment for client(dbus:1.194 unit:zincati.service uid:980): /org/projectatomic/rpmostree1/fedora_coreos | 10:15 |
tobias-urdin | Jun 02 10:10:10 newtest-z4kds5tnljh2-master-0 rpm-ostree[1027]: Finalized deployment; rebooting into 01f074cc6cd88d8d2b43f821da692f2367c101eb4377802cb35092bde0ef02f7 | 10:15 |
tobias-urdin | Jun 02 10:10:10 newtest-z4kds5tnljh2-master-0 systemd-logind[1279]: System is rebooting. | 10:15 |
tobias-urdin | this commit "fcos: Disable zincati auto-updates" https://github.com/openstack/magnum/commit/56f3be8bcf42e7772385e843355a5963705e9f2b | 10:15 |
tobias-urdin | is not released so the node is interrupted by auto updates on deploy | 10:15 |
tobias-urdin | jakeyip: yes | 10:15 |
brtknr | tobias-urdin: ah yes, you need that patch but we have yet to release 9.4.0 | 10:15 |
brtknr | waiting for some things to merge | 10:15 |
brtknr | tobias-urdin: workaround is to use latest version of fcos | 10:16 |
brtknr | tobias-urdin: workaround is to use latest stable version of | 10:16 |
tobias-urdin | ok, the above would explain why it stops working at different places all the time, sometimes almost a full deploy | 10:16 |
tobias-urdin | i'll upload new image | 10:16 |
tobias-urdin | and try again | 10:16 |
*** k_mouza has joined #openstack-containers | 10:18 | |
*** ramishra has joined #openstack-containers | 10:27 | |
*** k_mouza has quit IRC | 10:36 | |
*** ramishra has quit IRC | 10:36 | |
*** k_mouza has joined #openstack-containers | 10:38 | |
*** ramishra has joined #openstack-containers | 10:40 | |
*** ramishra has quit IRC | 11:02 | |
*** mgariepy has quit IRC | 11:03 | |
*** ramishra has joined #openstack-containers | 11:03 | |
*** ramishra has quit IRC | 11:08 | |
*** ramishra has joined #openstack-containers | 11:08 | |
openstackgerrit | Bharat Kunwar proposed openstack/magnum master: Source /etc/bashrc where kubectl when kubectl is used https://review.opendev.org/732524 | 11:12 |
brtknr | tobias-urdin: you might also need ^ | 11:14 |
tobias-urdin | brtknr: ok, only 1.18 related thought right, i will only test with 1.17 as latest according to the compatibility matrix | 11:42 |
tobias-urdin | i will test everything later today and report back | 11:42 |
*** sapd1 has joined #openstack-containers | 11:43 | |
*** ramishra has quit IRC | 11:52 | |
*** k_mouza has quit IRC | 11:58 | |
*** k_mouza has joined #openstack-containers | 11:58 | |
born2bake | Hi brtknr this can be closed - https://storyboard.openstack.org/#!/story/2007741 | 11:59 |
born2bake | errors were due to cinder issues I had | 12:00 |
brtknr | born2bake: ok cool | 12:04 |
brtknr | born2bake: can you briefly explain in the story what the issue you had was? | 12:04 |
born2bake | done | 12:09 |
born2bake | brtknr I am just wondering, how magnum k8s cluster differ from kubespray-vanilla-kubeadm clusters? | 12:13 |
brtknr | Magnum doesnt use kubeadm for a start born2bake | 12:14 |
*** mgariepy has joined #openstack-containers | 12:14 | |
brtknr | Magnum cluster comes bootstrapped with OCCM and allows you to use openstack credential for auth | 12:14 |
*** ramishra has joined #openstack-containers | 12:16 | |
*** ttsiouts has quit IRC | 12:36 | |
*** sapd1 has quit IRC | 12:42 | |
*** ttsiouts has joined #openstack-containers | 12:45 | |
*** ramishra has quit IRC | 12:54 | |
*** ramishra has joined #openstack-containers | 12:55 | |
*** munimeha1 has joined #openstack-containers | 13:25 | |
*** openstacking_123 has joined #openstack-containers | 13:26 | |
*** sapd1 has joined #openstack-containers | 13:46 | |
tobias-urdin | brtknr: stuck on this now, I assume kubernetes v1.18 client is in ussuri-dev heat-container-agent? | 13:53 |
tobias-urdin | sh-5.0# kubectl patch node ${INSTANCE_NAME} --patch '{"metadata": {"labels": {"node-role.kubernetes.io/master": ""}}}' | 13:53 |
tobias-urdin | error: no configuration has been provided, try setting KUBERNETES_MASTER environment variable | 13:53 |
tobias-urdin | setting KUBERNETES_MASTER=http://localhost:8080 fixes it | 13:53 |
tobias-urdin | becomes and infinite loop here: https://github.com/openstack/magnum/blob/stable/train/magnum/drivers/common/templates/kubernetes/fragments/enable-services-master.sh#L33 | 13:54 |
tobias-urdin | same as KUBERNETES_MASTER being set for minion should be set for master https://github.com/openstack/magnum/blob/stable/train/magnum/drivers/common/templates/kubernetes/fragments/configure-kubernetes-minion.sh#L305 | 13:54 |
*** nikparasyr has quit IRC | 14:11 | |
*** jmlowe has quit IRC | 14:17 | |
*** udesale has joined #openstack-containers | 14:18 | |
*** jmlowe has joined #openstack-containers | 14:18 | |
*** openstacking_123 has quit IRC | 14:40 | |
*** belmoreira has joined #openstack-containers | 14:55 | |
*** ttsiouts has quit IRC | 15:04 | |
*** belmoreira has quit IRC | 15:07 | |
*** ttsiouts has joined #openstack-containers | 15:12 | |
brtknr | tobias-urdin: this is a known issue | 15:12 |
brtknr | use train-stable-2 tag for now | 15:13 |
brtknr | i am trying to fix it this morning but needs to more: https://review.opendev.org/732524 | 15:13 |
*** jmlowe has quit IRC | 15:13 | |
*** jmlowe has joined #openstack-containers | 15:16 | |
*** ttsiouts has quit IRC | 15:23 | |
tobias-urdin | brtknr: thanks, it's working now when using train-stable-2, i patched scripts to set KUBERNETES_MASTER, will try removing it since might not be needed when using train-stable-2 | 15:42 |
*** ykarel is now known as ykarel|away | 15:43 | |
tobias-urdin | yeah worked without that workaround | 15:52 |
*** sapd1 has quit IRC | 15:59 | |
*** udesale has quit IRC | 16:04 | |
*** sapd1 has joined #openstack-containers | 16:37 | |
*** mgariepy has quit IRC | 17:05 | |
*** yolanda has quit IRC | 17:11 | |
*** k_mouza has quit IRC | 17:48 | |
*** sapd1 has quit IRC | 17:53 | |
born2bake | brtknr I am wondering, is octavia supported? If I want to use LoadBalancer service types within k8s cluster, cloud provider does support it right? Is magnum able to deploy it automatically as well? | 18:06 |
*** mgariepy has joined #openstack-containers | 18:25 | |
*** ttsiouts has joined #openstack-containers | 19:25 | |
*** k_mouza has joined #openstack-containers | 19:48 | |
*** ttsiouts has quit IRC | 19:52 | |
*** k_mouza has quit IRC | 19:53 | |
*** k_mouza has joined #openstack-containers | 19:57 | |
*** k_mouza has quit IRC | 20:02 | |
*** k_mouza has joined #openstack-containers | 20:03 | |
*** vishalmanchanda has quit IRC | 20:05 | |
*** ttsiouts has joined #openstack-containers | 20:06 | |
*** ttsiouts has quit IRC | 20:10 | |
*** ricolin has quit IRC | 20:38 | |
*** k_mouza has quit IRC | 21:00 | |
*** rcernin has joined #openstack-containers | 22:57 | |
*** rcernin has quit IRC | 22:59 | |
*** rcernin has joined #openstack-containers | 23:23 | |
openstackgerrit | Feilong Wang proposed openstack/magnum master: [WIP]Fix proxy issue for etcd and kubelet https://review.opendev.org/733027 | 23:32 |
*** k_mouza has joined #openstack-containers | 23:52 | |
*** k_mouza has quit IRC | 23:52 | |
jakeyip | tobias-urdin: how are you setting up your cloud? puppet / ansible / something else? | 23:54 |
*** born2bake has quit IRC | 23:56 |
Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!