Thursday, 2020-10-08

ianwclarkb: oh, how confusing.  it seems that xenial hosts have docker setup with journalctl logging, while bionic hosts have json-file00:23
clarkbthat is unexpected00:23
ianwthat's why eavesdrop is spamming itself to syslog, and the other hosts are not recording their output00:24
*** openstackgerrit has joined #opendev00:32
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660500:32
openstackgerritIan Wienand proposed opendev/system-config master: Add initial borg backup server  https://review.opendev.org/75660700:55
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660501:08
*** tkajinam has quit IRC01:52
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660501:53
*** tkajinam has joined #opendev01:53
*** hamalq has quit IRC02:08
*** hamalq has joined #opendev02:09
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Add containerfile element  https://review.opendev.org/70008302:26
openstackgerritIan Wienand proposed openstack/diskimage-builder master: WIP: boot test of containerfile image  https://review.opendev.org/72214802:31
openstackgerritIan Wienand proposed openstack/diskimage-builder master: WIP: boot test of containerfile image  https://review.opendev.org/72214802:32
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660502:36
openstackgerritIan Wienand proposed openstack/diskimage-builder master: Add containerfile element  https://review.opendev.org/70008302:38
openstackgerritIan Wienand proposed openstack/diskimage-builder master: WIP: boot test of containerfile image  https://review.opendev.org/72214802:38
*** hamalq has quit IRC03:01
*** hamalq has joined #opendev03:01
*** hamalq has quit IRC03:04
*** hamalq has joined #opendev03:05
*** ysandeep|away is now known as ysandeep03:16
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660503:33
openstackgerritIan Wienand proposed opendev/system-config master: Remove Ubuntu Xenial ARM64 base testing  https://review.opendev.org/75662703:33
openstackgerritIan Wienand proposed opendev/system-config master: Remove rsyslogd xconsole workaround  https://review.opendev.org/75662803:33
ianwThis configuration file is valid at least on Ubuntu Oneiric,03:34
ianw    Precise and Quantal, which covers the releases we currently use.03:34
ianwit survived until i started fiddling on focal nodes, so pretty good03:34
openstackgerritIan Wienand proposed openstack/diskimage-builder master: WIP: boot test of containerfile image  https://review.opendev.org/72214803:43
openstackgerritIan Wienand proposed opendev/system-config master: ARM64 : run base test on Focal  https://review.opendev.org/75662903:59
openstackgerritIan Wienand proposed opendev/system-config master: ARM64 : run base test on Focal  https://review.opendev.org/75662904:09
openstackgerritIan Wienand proposed opendev/system-config master: Remove rsyslogd xconsole workaround  https://review.opendev.org/75662804:09
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660504:09
*** hamalq has quit IRC04:23
openstackgerritIan Wienand proposed openstack/diskimage-builder master: WIP: boot test of containerfile image  https://review.opendev.org/72214804:49
*** ykarel has joined #opendev04:55
*** marios has joined #opendev05:56
openstackgerritIan Wienand proposed opendev/system-config master: Remove rsyslogd xconsole workaround  https://review.opendev.org/75662806:20
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660506:20
*** Eighth_Doctor has quit IRC06:35
*** mordred has quit IRC06:35
*** ralonsoh has joined #opendev06:37
*** Eighth_Doctor has joined #opendev06:44
*** eolivare has joined #opendev06:52
*** hamalq has joined #opendev06:56
*** hamalq_ has joined #opendev06:58
openstackgerritIan Wienand proposed openstack/diskimage-builder master: WIP: boot test of containerfile image  https://review.opendev.org/72214807:00
*** hamalq has quit IRC07:01
*** slaweq has joined #opendev07:01
*** hashar has joined #opendev07:02
*** fressi has joined #opendev07:04
*** mordred has joined #opendev07:09
*** fressi has quit IRC07:17
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660507:21
*** fressi has joined #opendev07:23
*** rpittau|afk is now known as rpittau07:27
*** andrewbonney has joined #opendev07:27
*** hamalq_ has quit IRC07:33
*** auristor has quit IRC07:33
*** ttx has quit IRC07:33
*** hamalq has joined #opendev07:34
*** auristor has joined #opendev07:38
openstackgerritJavier Peña proposed openstack/diskimage-builder master: Ensure yum-utils is installed in epel element  https://review.opendev.org/75601007:50
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660507:51
*** tosky has joined #opendev07:54
*** moppy has quit IRC08:01
*** moppy has joined #opendev08:02
*** ttx has joined #opendev08:02
*** DSpider has joined #opendev08:05
*** roman_g has joined #opendev08:19
*** hamalq has quit IRC08:26
*** hamalq has joined #opendev08:34
*** eolivare has quit IRC08:39
*** eolivare has joined #opendev08:39
*** fressi has quit IRC08:54
*** ysandeep is now known as ysandeep|lunch08:55
*** fressi has joined #opendev09:01
*** johnsom has quit IRC09:25
*** johnsom has joined #opendev09:25
*** ysandeep|lunch is now known as ysandeep09:27
*** dulek has quit IRC09:58
*** dulek has joined #opendev09:58
*** hamalq has quit IRC10:03
*** hashar has quit IRC10:21
openstackgerritCarlos Goncalves proposed openstack/project-config master: Add nested-virt-centos-8 label  https://review.opendev.org/75667410:28
openstackgerritCarlos Goncalves proposed openstack/project-config master: Add nested-virt-centos-8-stream label  https://review.opendev.org/75667410:32
*** paramite has joined #opendev11:06
*** lpetrut has joined #opendev11:25
*** hashar has joined #opendev12:49
openstackgerritNicolas Alvarez proposed openstack/project-config master: Add initial files to project-config repo.  https://review.opendev.org/75671714:02
openstackgerritNicolas Alvarez proposed openstack/project-config master: Add initial files to project-config repo.  https://review.opendev.org/75671714:04
*** lpetrut has quit IRC14:21
*** hashar has quit IRC14:27
openstackgerritJavier Peña proposed openstack/project-config master: Create acls for rpm-packaging-release group  https://review.opendev.org/75672214:29
*** slaweq has quit IRC14:38
*** slaweq has joined #opendev14:42
openstackgerritNicolas Alvarez proposed openstack/project-config master: Add SNMP Armada app to StarlingX.  https://review.opendev.org/75671714:50
openstackgerritJeremy Stanley proposed opendev/storyboard master: Be explicit about base container image  https://review.opendev.org/71522914:52
*** ykarel is now known as ykarel|away14:57
*** priteau has joined #opendev14:58
*** mlavalle has joined #opendev15:03
*** ysandeep is now known as ysandeep|away15:04
openstackgerritMerged openstack/project-config master: Create acls for rpm-packaging-release group  https://review.opendev.org/75672215:12
*** ykarel|away has quit IRC15:24
*** hashar has joined #opendev15:43
*** marios is now known as marios|out15:55
*** rpittau is now known as rpittau|afk16:01
*** marios|out has quit IRC16:06
*** yoctozepto has quit IRC16:16
*** yoctozepto has joined #opendev16:17
*** eolivare has quit IRC16:17
*** eolivare has joined #opendev16:17
*** fressi has quit IRC16:20
*** eolivare_ has joined #opendev16:21
*** eolivare has quit IRC16:23
clarkbfor half a second I was thinking we could make our own dib chroot bases using builders of various distros in zuul jobs. Then I realized thats essentially what the container bootstrapping idea is and now I like the idea even more :)16:56
*** mlavalle has quit IRC17:08
*** mlavalle has joined #opendev17:09
*** auristor has quit IRC17:17
*** auristor has joined #opendev17:23
*** Topner has joined #opendev18:03
*** Topner has quit IRC18:07
*** roman_g has quit IRC18:10
*** qchris has quit IRC18:21
*** qchris has joined #opendev18:55
*** andrewbonney has quit IRC19:00
*** roman_g has joined #opendev19:20
*** priteau has quit IRC19:31
*** ralonsoh has quit IRC19:50
*** iurygregory has quit IRC20:00
*** iurygregory has joined #opendev20:02
*** slaweq has quit IRC20:26
*** roman_g has quit IRC20:53
*** roman_g has joined #opendev20:53
*** roman_g has quit IRC20:54
*** roman_g has joined #opendev20:54
*** roman_g has quit IRC20:54
*** roman_g has joined #opendev20:55
*** roman_g has joined #opendev20:56
*** roman_g has quit IRC20:56
ianwyeah i'll keep poking, i feel like there was talk the base element was working but not the bootloader bits, maybe?  i'm sure we can get something working20:57
*** roman_g has joined #opendev20:57
*** roman_g has joined #opendev20:57
*** roman_g has quit IRC20:58
*** roman_g has joined #opendev20:58
clarkbI don't recall20:58
*** roman_g has quit IRC20:58
clarkbthe notedb migration is not fast if anyone is wondering20:59
clarkbits been running for 4.5 hours now20:59
*** roman_g has joined #opendev20:59
clarkbit complained about one change so far because it doesn't have any patchsets and it bascially says meh in that case20:59
*** roman_g has quit IRC20:59
clarkb(I think we can live with that20:59
*** roman_g has joined #opendev21:00
*** roman_g has quit IRC21:00
fungithat change is similarly inaccessible in production anyway21:00
*** roman_g has joined #opendev21:01
*** roman_g has quit IRC21:01
clarkbThe rough upgrade plan I'm starting to formulate in my head is long weekend outage with upgrade and checkpoint at 2.16 pre notedb so that we can fall back there if necessary, but try and get to 3.221:01
*** roman_g has joined #opendev21:01
*** roman_g has quit IRC21:02
fungior 3.3 if it's ga by then21:02
clarkbon day one we can do that 2.13 backups, and similar prep work, then upgrade to 2.16 pretty quickly giving time to turn on 2.16 but maybe not expose it publicly and if working do another set of backups there. Then kick off notedb migration and go enjoy the evening/sleep21:02
clarkbthen hopeflly on day 2 we can do the 2.16 -> 3.2 upgrade process21:03
clarkb(granted still have more to learn about the upgrade at this point but that is roughly what I'm thinking right now)21:03
fungiand then roll back to 2.16 if absolutely necessary21:11
clarkbya21:11
ianw++ it will be a big relief to have it upgraded21:13
ianwfungi/clarkb: if you have sec to check the borg backup server addition that i didn't miss anything would be great -> https://review.opendev.org/75660721:15
ianwif no objections i might try borg backups on review-test?21:15
clarkbianw: I think using review-test would be fine but I've put it in emergency to avoid having its docker-compose file updated or gerrit config files updated21:16
clarkbso you may need to manually trigger things also might be good to wait for when we're done thrashing the disk :)21:16
clarkbI can give you an all clear for that assuming this finishes before I go to bed :)21:17
ianwok, no rush :)21:18
ianwclarkb: i ended up with a small stack from https://review.opendev.org/#/c/756605/ to implement collecting docker container logs21:19
ianwi think it's quite compelling for gitea, where currently we're using json logging and we destroy the container logs when they're refreshed21:19
ianwsince we're trying to capture backtraces etc.21:20
ianw(that's -1 because it looks for the gitea ssh log, which isn't there .. i think it only gets created when a line is logged so we might need to exercise port 222 in the test)21:20
clarkbreading the commit message that sounds great21:20
clarkbI'll try to make time to review it in a bit21:20
ianwhaha no, it will probably work now that i've spelt "/var/log/containers" correctly21:22
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660521:23
fungisometimes that does matter21:23
openstackgerritIan Wienand proposed opendev/system-config master: docker: install rsyslog to capture container output  https://review.opendev.org/75660521:25
*** eolivare_ has quit IRC21:29
ianwit would be pretty easy to send the container logs somewhere central too21:32
ianwsplunk was a big deal when i was at vmware, basically it's logstash21:33
*** hamalq has joined #opendev21:58
clarkbianw: in your logging change instead of needing to edit docker-compose for every container can we set up our docker json config file to do that for all containers by default? or does the docker- prefix addition require we do it more explicitly?22:10
*** hashar has quit IRC22:11
ianwclarkb: yeah, i think to have the consistent docker-<sensible-container-name> we need to do it for each container individually22:13
clarkband I guess without a consistent prefix we can't have the auto dumping to specific files22:13
clarkbhrm22:13
clarkbI guess I can live with the extra config then22:14
*** qchris has quit IRC22:42
clarkbbased on how long the reindex at the end of the note db migration is taking compared to the one we did as part of the 2.16 upgrade I think the aggressive git gc really makes a big difference22:43
clarkbunfortunately we can't do a gc here because its all baked into the one command22:43
clarkbwe are at 63% and it started about an hour and 20 minutes ago. The reindex we did as part of the 2.16 upgrade took 35 minutes22:44
clarkbit has also complained about a couple changes but look at them in prod I think the problems were preexisting22:44
clarkbhttps://review.opendev.org/#/c/463526/ is one example and if you load them up and try to view a list of patchsets you never get one22:44
clarkbthe reindexing seems to fail on patchsets that it can't find22:45
clarkb(the change is abandoned too which makes me wonder if something weird happened there)22:45
clarkbwhen this is done my next step is going to be another round of gc'ing so that the 3.0 3.1 and 3.2 upgrades can go more smoothly22:46
ianwwhat is it with the specs repo ... i remember a bunch of issues there at some point22:46
ianw2018-03-07 nova-specs recovery22:47
ianwhttp://lists.openstack.org/pipermail/openstack-dev/2017-June/119166.html22:47
clarkbya it could be related to that I suppose22:47
clarkbiirc they corrupted the repo somehow and github complained but gerrit didn't22:47
clarkbmaybe newer gerrit now is mad about it22:47
clarkbalso as soon as i start talking about this stuff it complains about https://review.opendev.org/#/c/628296/22:47
clarkbonly a mergeability check on ^ though22:48
ianwthat's a huge bunch of renames22:48
clarkbya I bet it hit a timeout and isn't a real error22:49
ianwyeah, i'd consider nova-specs a very suspect repo22:49
clarkbbecause we're running with like 16 threads and the disk is very busy right now reindexing a timeout or 3 isn't unexpected22:49
clarkbwe can always trigger an online reindex later and have it update things more slowly once we're steady state post upgrade22:50
clarkbI plan to be around this evenign due to the rax db outages22:51
clarkbso I'll also try to get the 3.2 upgrade path on -test as far as I can22:51
clarkbthen hopefully tomorrow can be spent double checking repos like nova-specs and openstack/openstack as well as beginning the testing of replication to gitea (I plan to hold a system-config-run-gitea job node and replicate to it) and other things like cla signing and general git review comaptibility and change pushing22:52
clarkbso many things22:52
*** qchris has joined #opendev22:53
clarkbI also need to go through my notes and start drafting WIP changes for the things we need to update in gerrit config (disable change.move for example), the apache config, and I forget what else22:53
*** mlavalle has quit IRC22:54
clarkblatest thing it is mad about is https://review.opendev.org/#/c/11544/ which is a zuul change that also errors trying to load it in prod22:58
clarkbseems like so far every issue (of which we've had maybe 6 out of 550k) has a semi reasonable explanation even if it isn't ideal22:58
*** tosky has quit IRC22:59
clarkbhttps://review.opendev.org/#/c/22128/ is another with the patchset discovery problem23:01
clarkbps2 seems to not exist for ^23:02
clarkbthere are fewer than 600 changes to reindex now I assume all for nova23:51
*** DSpider has quit IRC23:55

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!