@nexn:matrix.org | Thank you NilesX, I did everything according to the document but kindly give the clarity about RAM . I have only 48GB RAM. However, the document suggests 64 GB of RAM ,Is it the reason for unable to unlock the controller host? | 06:32 |
---|---|---|
@maloute:matrix.org | I did my test with only 40GB RAM and it was working fine at least I did succeed in unlocking | 06:38 |
@maloute:matrix.org | And from this screenshot, I'm not sure but it seems to indicate that docker imagers were not in the registry and that it's downloading them and succeeded in it, so for me that does not seem to be the problem | 06:45 |
@nexn:matrix.org | > <@maloute:matrix.org> I did my test with only 40GB RAM and it was working fine at least I did succeed in unlocking | 06:45 |
Okay, that's fine. While running the Bootstrap playbook, it indicates a 404 client error and manifests unknown .Could i know what's the reason | ||
@maloute:matrix.org | Starlingx maintain a docker repository for managing docker image for your cluster, in a first time installation it needs to populate it because it's empty | 06:47 |
@nexn:matrix.org | While I'm trying to unlock the host, it indicates an alarm as the major issue is that the configuration is out of date. may I know why it is showing ? | 06:49 |
@maloute:matrix.org | Did you go to the end of install process before unlocking? | 06:51 |
@nexn:matrix.org | Yes, we did it. | 06:52 |
@maloute:matrix.org | I don't have an answer to your question but I did a couple of reinstalls before having a functional cluster | 06:52 |
@nexn:matrix.org | while I am checking the health status of the k8s cluster, it shows all hosts provisioned have failed. could I know the reason? | 06:55 |
@maloute:matrix.org | What does the command system host-unlock controller-0 says ? | 06:59 |
@maloute:matrix.org | Did it reboot? Normally your controller you should be rebooting after a couple of minutes | 07:17 |
@nexn:matrix.org | > <@maloute:matrix.org> Did it reboot? Normally your controller you should be rebooting after a couple of minutes | 07:19 |
should I reboot, or automatically it should reboot it after this command system host-unlock controller-0 | ||
@nexn:matrix.org | after running the command system host-unlock controller-0 it shows the notification as manifest timeout and couldn't perform any automatic rebooting task after running the unlock command | 07:22 |
@maloute:matrix.org | It should reboot automatically but it's not instant, it did reboot after 3-4 minutes for me | 07:23 |
@nexn:matrix.org | it doesn't perform any rebooting task even after 10-15 minutes, instead of it throws the notifications as manifest timeout | 07:25 |
@maloute:matrix.org | Ok so it indeed seems that it seems related to the screenshot with manifest unknown message. I thought this message was before succeeding in downloading images and that it could be disregarded but it seems that's not the case. | 07:35 |
@maloute:matrix.org | Did you try a cluster reinstall to see if you have the same problem? | 07:35 |
@nexn:matrix.org | Kindly share how to do cluster reinstall because I have not aware about it | 08:56 |
@maloute:matrix.org | Well it's the same as your first install, boot on a USB key with starlingx in it and install 😁 the installer will wipe your current installation | 09:01 |
@nexn:matrix.org | okay I got it. I did it many times. | 09:02 |
@nexn:matrix.org | Is there SRIOV and DPDK put influence on unlocking the host? | 09:03 |
@maloute:matrix.org | Not sure about that, I tried without sr-iov and dpdk | 09:04 |
@nexn:matrix.org | Even I am also doing without sriov and dpdk | 09:05 |
@nexn:matrix.org | Kindly share any documents or reference to troubleshoot this issue | 09:07 |
@nexn:matrix.org | is the above step is necessary to do? | 09:13 |
@maloute:matrix.org | No I didn't use the vault steps, only used a localhost.yaml file | 09:47 |
@nexn:matrix.org | Okay. so you created a localhost.yaml file with your familiar text editor like vi, vim ,nano. Am I right? | 09:49 |
@maloute:matrix.org | Yep | 09:49 |
@nexn:matrix.org | cool. Either you edit /usr/share/ansible/stx-ansible/playbooks/bootstrap.yml or not ? | 09:51 |
@nexn:matrix.org | Just you create localhost.yaml file and stick with the procedure that is mentioned in the link. Is it? | 09:53 |
@maloute:matrix.org | I didn't use this bootstrap file, just created an empty /home/sysadmin/localhost.yml file and put some config lines on it | 09:53 |
@nexn:matrix.org | Could you please share the whole configuration line of your localhost.yaml | 09:55 |
@maloute:matrix.org | ``` | 09:57 |
sysadmin@localhost:~$ cat <<EOF > localhost.yml | ||
system_mode: simplex | ||
dns_servers: | ||
- 1.1.1.1 | ||
external_oam_subnet: 10.X.22.0/24 | ||
external_oam_gateway_address: 10.X.22.254 | ||
external_oam_floating_address: 10.X.22.10X | ||
admin_username: admin | ||
admin_password: <yourwebadminpassword> | ||
ansible_become_pass: <sysadminaccountpassword> | ||
EOF | ||
``` | ||
@nexn:matrix.org | ansible-playbook --ask-vault-pass /usr/share/ansible/stx-ansible/playbooks/bootstrap.yml | 09:58 |
Is this command works fine for you? | ||
@nexn:matrix.org | > <@maloute:matrix.org> ``` | 09:59 |
> sysadmin@localhost:~$ cat <<EOF > localhost.yml | ||
> system_mode: simplex | ||
> dns_servers: | ||
> - 1.1.1.1 | ||
> external_oam_subnet: 10.X.22.0/24 | ||
> external_oam_gateway_address: 10.X.22.254 | ||
> external_oam_floating_address: 10.X.22.10X | ||
> admin_username: admin | ||
> admin_password: <yourwebadminpassword> | ||
> ansible_become_pass: <sysadminaccountpassword> | ||
> EOF | ||
> ``` | ||
Thanks. | ||
@maloute:matrix.org | I did not use the vault part, created the localhost.yml file, and launched that command: `sysadmin@localhost:~$ ansible-playbook /usr/share/ansible/stx-ansible/playbooks/bootstrap.yml` | 10:00 |
@nexn:matrix.org | okay fine. is that vault part affect my unlocking od host. Isn't it | 10:01 |
@maloute:matrix.org | from my understanding the vault part is not linked to your problem, it's just a way to store password in a vault to not have these in clear text | 10:03 |
@nexn:matrix.org | Yeah. You are right | 10:03 |
@nexn:matrix.org | could you kindly share what are the commands did you had given while your deployment | 10:04 |
@nexn:matrix.org | I follow this link. https://docs.starlingx.io/r/stx.8.0/deploy_install_guides/release/bare_metal/aio_simplex_install_kubernetes.html | 10:05 |
@nexn:matrix.org | If you share , it would be best that to find where I went wrong. Just I need sequal commands. Is it possible. | 10:09 |
@maloute:matrix.org | let me see if i can compile this | 10:13 |
@nexn:matrix.org | Yeah,Please. Even I am also compiling this as a simple document of what I did. And I will send it to you also | 10:16 |
@maloute:matrix.org | Here you go: | 10:17 |
``` | ||
export CONTROLLER0_OAM_CIDR=10.X.22.102/24 | ||
export DEFAULT_OAM_GATEWAY=10.X.22.254 | ||
sudo ip address add $CONTROLLER0_OAM_CIDR dev eno2 | ||
sudo ip link set up dev eno2 | ||
sudo ip route add default via $DEFAULT_OAM_GATEWAY dev eno2 | ||
sudo vi /etc/resolv.conf | ||
nameserver 1.1.1.1 | ||
sysadmin@localhost:~$ sudo ntpdate pool.ntp.org | ||
sysadmin@localhost:~$ cat <<EOF > localhost.yml | ||
system_mode: simplex | ||
dns_servers: | ||
- 1.1.1.1 | ||
external_oam_subnet: 10.X.22.0/24 | ||
external_oam_gateway_address: 10.X.22.254 | ||
external_oam_floating_address: 10.X.22.10X | ||
admin_username: admin | ||
admin_password: <yourwebadminpassword> | ||
ansible_become_pass: <sysadminaccountpassword> | ||
EOF | ||
sysadmin@localhost:~$ ansible-playbook /usr/share/ansible/stx-ansible/playbooks/bootstrap.yml | ||
sysadmin@localhost:~$ source /etc/platform/openrc | ||
[sysadmin@localhost ~(keystone_admin)]$ OAM_IF=eno2 | ||
[sysadmin@localhost ~(keystone_admin)]$ system host-if-modify controller-0 $OAM_IF -c platform | ||
[sysadmin@localhost ~(keystone_admin)]$ system interface-network-assign controller-0 $OAM_IF oam | ||
[sysadmin@localhost ~(keystone_admin)]$ system ntp-modify ntpservers=0.pool.ntp.org,1.pool.ntp.org | ||
system host-label-assign controller-0 openstack-control-plane=enabled | ||
system host-label-assign controller-0 openstack-compute-node=enabled | ||
system host-label-assign controller-0 openvswitch=enabled | ||
system host-cpu-modify -f platform -p0 6 controller-0 | ||
system host-fs-list controller-0 | ||
system host-lvg-list controller-0 | ||
system host-fs-modify controller-0 docker=60 | ||
system modify --vswitch_type none | ||
export NODE=controller-0 | ||
[sysadmin@localhost ~(keystone_admin)]$ system host-fs-add ${NODE} instances=50 | ||
system storage-backend-add ceph --confirmed | ||
system host-disk-list controller-0 | ||
system host-stor-add controller-0 osd <disk-uuid> | ||
system host-stor-list controller-0 | ||
~(keystone_admin)$ system host-unlock controller-0 | ||
``` | ||
@maloute:matrix.org | but it don't have any comments | 10:18 |
@nexn:matrix.org | Thanks. What about your exyternal connection to run the bootstrap playbook? | 10:19 |
@nexn:matrix.org | Is that your OAM and external connectivity to run the bootsrap playbook are same | 10:20 |
@maloute:matrix.org | yep | 10:20 |
@maloute:matrix.org | with my 10.1.22.0/24 network, I had external connectivity | 10:21 |
@nexn:matrix.org | Is this my 10.1.22.0/24 network , not only OAM but also for external connectivity? | 10:22 |
@nexn:matrix.org | * Is this 10.1.22.0/24 network , not only OAM but also for external connectivity? | 10:23 |
@maloute:matrix.org | it was the same for me | 10:23 |
@nexn:matrix.org | Done my friend. | 10:23 |
@nexn:matrix.org | I will follow your steps and let you know what will happen. | 10:24 |
@nexn:matrix.org | May I know how many disk (storage) that you have? | 10:26 |
@maloute:matrix.org | I had two disk, one for the system, and another one for ceph | 10:27 |
@nexn:matrix.org | In my environment I have three disks | 10:27 |
@nexn:matrix.org | > <@maloute:matrix.org> Here you go: | 12:10 |
> ``` | ||
> export CONTROLLER0_OAM_CIDR=10.X.22.102/24 | ||
> export DEFAULT_OAM_GATEWAY=10.X.22.254 | ||
> sudo ip address add $CONTROLLER0_OAM_CIDR dev eno2 | ||
> sudo ip link set up dev eno2 | ||
> sudo ip route add default via $DEFAULT_OAM_GATEWAY dev eno2 | ||
> | ||
> sudo vi /etc/resolv.conf | ||
> nameserver 1.1.1.1 | ||
> sysadmin@localhost:~$ sudo ntpdate pool.ntp.org | ||
> | ||
> sysadmin@localhost:~$ cat <<EOF > localhost.yml | ||
> system_mode: simplex | ||
> dns_servers: | ||
> - 1.1.1.1 | ||
> external_oam_subnet: 10.X.22.0/24 | ||
> external_oam_gateway_address: 10.X.22.254 | ||
> external_oam_floating_address: 10.X.22.10X | ||
> admin_username: admin | ||
> admin_password: <yourwebadminpassword> | ||
> ansible_become_pass: <sysadminaccountpassword> | ||
> EOF | ||
> | ||
> sysadmin@localhost:~$ ansible-playbook /usr/share/ansible/stx-ansible/playbooks/bootstrap.yml | ||
> | ||
> sysadmin@localhost:~$ source /etc/platform/openrc | ||
> [sysadmin@localhost ~(keystone_admin)]$ OAM_IF=eno2 | ||
> [sysadmin@localhost ~(keystone_admin)]$ system host-if-modify controller-0 $OAM_IF -c platform | ||
> [sysadmin@localhost ~(keystone_admin)]$ system interface-network-assign controller-0 $OAM_IF oam | ||
> [sysadmin@localhost ~(keystone_admin)]$ system ntp-modify ntpservers=0.pool.ntp.org,1.pool.ntp.org | ||
> | ||
> system host-label-assign controller-0 openstack-control-plane=enabled | ||
> system host-label-assign controller-0 openstack-compute-node=enabled | ||
> system host-label-assign controller-0 openvswitch=enabled | ||
> | ||
> system host-cpu-modify -f platform -p0 6 controller-0 | ||
> | ||
> system host-fs-list controller-0 | ||
> system host-lvg-list controller-0 | ||
> system host-fs-modify controller-0 docker=60 | ||
> system modify --vswitch_type none | ||
> | ||
> export NODE=controller-0 | ||
> [sysadmin@localhost ~(keystone_admin)]$ system host-fs-add ${NODE} instances=50 | ||
> | ||
> system storage-backend-add ceph --confirmed | ||
> system host-disk-list controller-0 | ||
> system host-stor-add controller-0 osd <disk-uuid> | ||
> system host-stor-list controller-0 | ||
> ~(keystone_admin)$ system host-unlock controller-0 | ||
> ``` | ||
Is this steps were given in your 40GB machine ? | ||
@maloute:matrix.org | Yes | 12:11 |
@nexn:matrix.org | Okayyy... | 12:11 |
@nexn:matrix.org | How about data network? It seems that you did not follow the data netwok steps at that time of your host configuration | 12:13 |
@nexn:matrix.org | How about your hardware specification? | 12:14 |
@maloute:matrix.org | Are you using an application? If not you don't need data interfaces | 12:15 |
@nexn:matrix.org | Are you asking about containerized application? | 12:16 |
@maloute:matrix.org | Yes like openstack | 12:17 |
@maloute:matrix.org | I'm asking about starlingx applications | 12:19 |
@maloute:matrix.org | Not kubernetes app | 12:19 |
@nexn:matrix.org | Yes we are using openstack.you already said for openstack we need data network interfaces. | 12:20 |
@maloute:matrix.org | Yes, in that doc I had not installed stx-openstack so there was no data interfaces | 12:22 |
@maloute:matrix.org | But you can add your data interfaces after the first unlocking | 12:23 |
@nexn:matrix.org | Yeah I noticed. Thank you | 12:26 |
Generated by irclog2html.py 2.17.3 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!