Sunday, 2018-01-14

*** lbragstad has joined #openstack-infra00:07
*** jkilpatr has joined #openstack-infra00:10
*** xarses has joined #openstack-infra00:23
*** tosky has quit IRC00:26
*** markvoelker has joined #openstack-infra00:36
*** spzala has joined #openstack-infra00:49
*** Apoorva has joined #openstack-infra00:53
*** spzala has quit IRC00:54
*** edmondsw has joined #openstack-infra00:59
*** edmondsw has quit IRC01:03
*** r-daneel has quit IRC01:03
*** hongbin_ has quit IRC01:04
*** hongbin has joined #openstack-infra01:05
*** hongbin has quit IRC01:05
*** hongbin has joined #openstack-infra01:06
*** Apoorva has quit IRC01:06
*** hongbin has quit IRC01:07
*** hongbin has joined #openstack-infra01:07
*** kmalloc has joined #openstack-infra01:07
*** hongbin has quit IRC01:07
*** hongbin has joined #openstack-infra01:08
mnaseranother spamwave01:08
*** markvoelker has quit IRC01:10
*** hongbin has quit IRC01:13
*** sbra has quit IRC01:31
*** sbra has joined #openstack-infra01:32
*** kiennt26 has joined #openstack-infra01:35
*** kiennt26 has quit IRC01:35
*** kiennt26 has joined #openstack-infra01:36
*** r-daneel has joined #openstack-infra01:58
*** Goneri has joined #openstack-infra01:59
*** xinliang has joined #openstack-infra02:02
*** markvoelker has joined #openstack-infra02:02
*** r-daneel has quit IRC02:05
*** Goneri has quit IRC02:16
*** armax has joined #openstack-infra02:41
*** jascott1 has quit IRC02:43
*** armax has quit IRC02:44
*** armax has joined #openstack-infra02:45
*** armax has quit IRC02:45
*** armax has joined #openstack-infra02:45
*** armax has quit IRC02:46
*** armax has joined #openstack-infra02:46
*** armax has quit IRC02:46
*** armax has joined #openstack-infra02:47
*** armax has quit IRC02:47
*** armax has joined #openstack-infra02:48
*** armax has quit IRC02:48
*** olaph has joined #openstack-infra03:00
*** spzala has joined #openstack-infra03:01
*** spzala has quit IRC03:06
*** yamamoto has joined #openstack-infra03:08
*** yamamoto has quit IRC03:12
*** yamamoto has joined #openstack-infra03:12
*** yamamoto has quit IRC03:17
*** bobh has joined #openstack-infra03:17
*** yamamoto has joined #openstack-infra03:17
*** olaph1 has joined #openstack-infra03:18
*** olaph has quit IRC03:18
*** claudiub has quit IRC03:18
*** yamamoto has quit IRC03:20
*** ijw has quit IRC03:23
*** kmalloc has quit IRC03:36
*** jascott1 has joined #openstack-infra03:44
*** kiennt26 has quit IRC03:44
*** ramishra has joined #openstack-infra04:17
*** yamamoto has joined #openstack-infra04:21
*** ihrachys has quit IRC04:24
*** bobh has quit IRC04:29
*** yamamoto has quit IRC04:29
*** yamamoto has joined #openstack-infra04:30
*** rosmaita has quit IRC04:33
*** rosmaita has joined #openstack-infra04:34
*** edmondsw has joined #openstack-infra04:35
*** yamamoto has quit IRC04:38
*** edmondsw has quit IRC04:39
*** armax has joined #openstack-infra04:41
*** spzala has joined #openstack-infra05:00
*** spzala has quit IRC05:01
*** markvoelker has quit IRC05:03
*** kiennt26 has joined #openstack-infra05:06
*** yamamoto has joined #openstack-infra05:08
*** yamamoto has quit IRC05:09
*** yamamoto has joined #openstack-infra05:09
*** yamamoto has quit IRC05:11
*** yamamoto has joined #openstack-infra05:12
*** yamamoto has quit IRC05:17
*** olaph has joined #openstack-infra05:23
*** ijw has joined #openstack-infra05:24
*** olaph1 has quit IRC05:25
*** ijw has quit IRC05:30
*** olaph1 has joined #openstack-infra05:38
*** olaph has quit IRC05:39
*** kiennt26 has quit IRC05:53
*** armax has quit IRC06:01
*** olaph has joined #openstack-infra06:03
*** olaph1 has quit IRC06:05
*** armaan has joined #openstack-infra06:14
*** yamamoto has joined #openstack-infra06:17
*** edmondsw has joined #openstack-infra06:23
*** yamamoto has quit IRC06:25
*** dsariel has joined #openstack-infra06:26
*** edmondsw has quit IRC06:28
*** dbecker has quit IRC06:31
*** dbecker has joined #openstack-infra06:36
*** larainema has quit IRC06:40
*** sbra has quit IRC06:41
*** sbra has joined #openstack-infra06:41
*** vsaienk0 has joined #openstack-infra06:46
*** vsaienk0 has quit IRC06:56
*** vsaienk0 has joined #openstack-infra07:01
*** markvoelker has joined #openstack-infra07:03
*** olaph1 has joined #openstack-infra07:10
*** vsaienk0 has quit IRC07:11
*** olaph has quit IRC07:12
*** r-daneel has joined #openstack-infra07:16
*** cshastri has joined #openstack-infra07:16
*** olaph has joined #openstack-infra07:17
*** olaph1 has quit IRC07:18
*** ijw has joined #openstack-infra07:26
*** psachin has joined #openstack-infra07:28
*** psachin has quit IRC07:31
*** psachin has joined #openstack-infra07:33
*** markvoelker has quit IRC07:38
*** psachin has quit IRC07:41
*** edmondsw has joined #openstack-infra08:11
*** edmondsw has quit IRC08:16
*** frickler has quit IRC08:23
*** ijw has quit IRC08:36
*** frickler has joined #openstack-infra08:37
*** lathiat has quit IRC08:52
*** lathiat has joined #openstack-infra08:57
*** sshnaidm|afk has quit IRC08:58
*** e0ne has joined #openstack-infra09:16
*** ykarel|away has joined #openstack-infra09:20
*** markvoelker has joined #openstack-infra09:35
*** e0ne has quit IRC09:35
*** ijw has joined #openstack-infra09:37
*** olaph1 has joined #openstack-infra09:37
*** olaph has quit IRC09:38
*** ijw has quit IRC09:42
*** bkero-lounge has joined #openstack-infra09:58
*** edmondsw has joined #openstack-infra09:59
*** edmondsw has quit IRC10:03
*** markvoelker has quit IRC10:09
*** slaweq has joined #openstack-infra10:10
*** pbourke has quit IRC10:20
*** pbourke has joined #openstack-infra10:21
*** olaph has joined #openstack-infra10:23
*** olaph1 has quit IRC10:24
*** olaph1 has joined #openstack-infra10:28
*** olaph has quit IRC10:30
*** pcichy has quit IRC10:33
*** pcichy has joined #openstack-infra10:33
*** adriant has quit IRC10:39
*** adriant has joined #openstack-infra10:41
*** adriant has quit IRC10:43
*** slaweq has quit IRC10:44
*** adriant has joined #openstack-infra10:45
*** claudiub has joined #openstack-infra10:48
*** yamamoto has joined #openstack-infra10:57
*** masber has quit IRC11:10
*** masber has joined #openstack-infra11:10
*** sshnaidm|afk has joined #openstack-infra11:13
*** yamamoto has quit IRC11:20
*** masber has quit IRC11:35
*** ijw has joined #openstack-infra11:37
*** olaph has joined #openstack-infra11:38
*** masuberu has joined #openstack-infra11:38
*** olaph1 has quit IRC11:39
*** edmondsw has joined #openstack-infra11:47
*** edmondsw has quit IRC11:52
*** yamamoto has joined #openstack-infra11:56
*** yamamoto has quit IRC12:04
*** olaph1 has joined #openstack-infra12:04
*** olaph has quit IRC12:05
*** markvoelker has joined #openstack-infra12:06
*** yamamoto has joined #openstack-infra12:07
*** yamamoto has quit IRC12:12
*** ijw has quit IRC12:22
*** slaweq has joined #openstack-infra12:23
*** dsariel has quit IRC12:29
*** dsariel has joined #openstack-infra12:31
*** markvoelker has quit IRC12:39
*** ramishra has quit IRC12:44
*** e0ne has joined #openstack-infra12:46
*** yamamoto has joined #openstack-infra12:57
*** dhill__ has quit IRC13:15
*** sshnaidm|afk has quit IRC13:18
*** e0ne has quit IRC13:19
*** tosky has joined #openstack-infra13:19
*** nicolasbock has joined #openstack-infra13:22
*** edmondsw has joined #openstack-infra13:35
*** e0ne has joined #openstack-infra13:36
*** dsariel has quit IRC13:36
*** e0ne has quit IRC13:37
*** edmondsw has quit IRC13:40
*** sshnaidm|afk has joined #openstack-infra13:48
*** ykarel|away has quit IRC13:54
*** psachin has joined #openstack-infra14:00
*** armaan has quit IRC14:00
*** armaan has joined #openstack-infra14:00
*** dsariel has joined #openstack-infra14:05
*** dhill_ has joined #openstack-infra14:22
*** ijw has joined #openstack-infra14:22
*** ijw has quit IRC14:27
*** dsariel has quit IRC14:30
*** markvoelker has joined #openstack-infra14:36
*** katkapilatova1 has joined #openstack-infra14:42
*** jascott1 has quit IRC14:57
*** markvoelker has quit IRC15:10
*** dsariel has joined #openstack-infra15:13
*** caphrim007_ has joined #openstack-infra15:18
*** pcichy has quit IRC15:19
*** pcichy has joined #openstack-infra15:19
*** freerunner has quit IRC15:19
*** gtmanfred has quit IRC15:20
*** tobiash has quit IRC15:20
*** caphrim007 has quit IRC15:21
*** freerunner has joined #openstack-infra15:21
*** gtmanfred has joined #openstack-infra15:22
*** tobiash has joined #openstack-infra15:22
*** swest has quit IRC15:23
*** swest has joined #openstack-infra15:24
*** dsariel has quit IRC15:34
*** dsariel has joined #openstack-infra15:43
*** armaan has quit IRC15:55
*** armaan has joined #openstack-infra15:55
*** psachin has quit IRC15:56
*** psachin has joined #openstack-infra15:58
*** armaan has quit IRC16:02
*** markvoelker has joined #openstack-infra16:07
*** psachin has quit IRC16:11
*** Goneri has joined #openstack-infra16:14
*** ociuhandu_ has joined #openstack-infra16:20
*** sbra has quit IRC16:20
*** sbra has joined #openstack-infra16:21
*** dsariel has quit IRC16:23
*** ijw has joined #openstack-infra16:23
*** dsariel has joined #openstack-infra16:29
*** dsariel has quit IRC16:34
*** psachin has joined #openstack-infra16:34
*** dsariel has joined #openstack-infra16:35
*** Goneri has quit IRC16:39
*** markvoelker has quit IRC16:40
*** ijw has quit IRC16:41
*** psachin has quit IRC16:43
fungicorvus: i'm just starting to diagnose now, but rndc zonestatus says the signed serial is older than the actual serial: http://paste.openstack.org/show/644245/16:44
fungiand syslog definitely shows the slaves doing an axfr of that older serial16:46
fungialso named on the master logs that it's loading the newer unsigned serial and the older signed serial, so that seems consistent16:47
*** bobh has joined #openstack-infra16:48
*** psachin has joined #openstack-infra16:56
fungidigging through the puppet modules, it looks like we actually do a service bind9 restart each time a zonefile is modified17:02
fungii wonder if that's partly to blame, though if it is i'm not entirely sure how (service stop races the signature update maybe?)17:04
fungii'd like to stop puppet for adns1, manually bump the serial on the zone, and see what happens (then catch the git repo up with that serial bump)17:05
fungibut first i have some chores i need to go do around here. will get back to this in a bit17:05
corvusfungi: oh, interesting theory.  that sounds like a good plan.17:07
*** cshastri has quit IRC17:08
*** dsariel has quit IRC17:08
*** edmondsw has joined #openstack-infra17:11
*** edmondsw has quit IRC17:16
*** psachin has quit IRC17:25
prometheanfiredid gate restart between the 12th and now? https://review.openstack.org/53287417:32
*** slaweq_ has joined #openstack-infra17:39
*** ijw has joined #openstack-infra17:42
*** ijw has quit IRC17:47
*** psachin has joined #openstack-infra17:53
*** slaweq_ has quit IRC17:59
*** nicolasbock has quit IRC18:08
*** nicolasbock has joined #openstack-infra18:09
*** psachin has quit IRC18:13
*** sshnaidm|afk is now known as sshnaidm18:27
*** nicolasbock has quit IRC18:28
*** bobh has quit IRC18:29
*** markvoelker has joined #openstack-infra18:37
*** ijw has joined #openstack-infra18:40
*** olaph1 is now known as olaph18:41
*** bobh has joined #openstack-infra18:41
*** yamamoto has quit IRC18:42
AJaegerprometheanfire: best check https://wiki.openstack.org/wiki/Infrastructure_Status18:54
*** slaweq has quit IRC18:54
AJaegerprometheanfire: so, yes it did - and a recheck is needed on that one18:55
prometheanfirek18:55
*** edmondsw has joined #openstack-infra19:00
*** dbecker has quit IRC19:00
*** sree has joined #openstack-infra19:01
*** edmondsw has quit IRC19:04
*** panda has quit IRC19:05
*** sree has quit IRC19:06
*** panda has joined #openstack-infra19:06
*** sree has joined #openstack-infra19:08
*** markvoelker has quit IRC19:11
*** sree has quit IRC19:13
*** ijw has quit IRC19:15
*** sree has joined #openstack-infra19:16
*** markvoelker has joined #openstack-infra19:19
*** dbecker has joined #openstack-infra19:19
*** sree has quit IRC19:21
*** slaweq has joined #openstack-infra19:21
*** e0ne has joined #openstack-infra19:23
*** slaweq has quit IRC19:26
*** bobh has quit IRC19:41
*** yamamoto has joined #openstack-infra19:42
*** jamesmcarthur has joined #openstack-infra19:47
fungicorvus: not _conclusive_ but here's the zonestatus after editing /var/lib/bind/zones/zuul-ci.org/zone.db to increase the serial to 1515959169 and then running `sudo rndc reload zuul-ci.org`: http://paste.openstack.org/show/644247/19:48
fungii think we probably need to replace the "notify  => Service[$::dns::namedservicename]," with an exec resource which does an rndc reload?19:49
*** jamesmca_ has joined #openstack-infra19:49
fungiin the /var/lib/bind/zones/${name} file resource definition19:49
*** yamamoto has quit IRC19:49
*** bobh has joined #openstack-infra19:50
* pabelanger subscribes to new MLs19:51
openstackgerritJeremy Stanley proposed openstack-infra/zone-zuul-ci.org master: Update zone serial after manual testing  https://review.openstack.org/53342919:51
fungicorvus: ^19:51
fungiinfra-root: ^ i'm self-approving that so i can reenable puppet on adns1.o.o19:52
pabelangerfungi: wfm19:52
*** jamesmcarthur has quit IRC19:53
openstackgerritMerged openstack-infra/zone-zuul-ci.org master: Update zone serial after manual testing  https://review.openstack.org/53342919:53
*** olaph has quit IRC19:53
*** olaph has joined #openstack-infra19:53
*** bobh has quit IRC19:54
fungiokay, puppet reenabled for adns1 again19:56
*** sree has joined #openstack-infra19:56
fungii'll see if i can get a few minutes to work on the puppet fix in a bit19:56
*** bobh has joined #openstack-infra19:59
*** sree has quit IRC20:01
*** bobh has quit IRC20:04
*** bobh has joined #openstack-infra20:08
*** bobh has quit IRC20:10
*** bobh has joined #openstack-infra20:10
*** sree has joined #openstack-infra20:15
*** sree has quit IRC20:19
*** sree has joined #openstack-infra20:20
*** sree has quit IRC20:25
ssbarneais normal to see  POST_FAILURE ? Got this a couple of minutes ago on https://review.openstack.org/#/c/533430/ --- did recheck, got same error.20:25
*** dsariel has joined #openstack-infra20:27
prometheanfireI've been getting a bunch of them20:28
prometheanfirehttps://review.openstack.org/53287420:28
clarkbthere is or was an issue with the tox jobs where proper failures result in post failure20:30
clarkbchwck ara to see if the run.yaml failes20:30
*** olaph1 has joined #openstack-infra20:31
*** sree has joined #openstack-infra20:33
*** olaph has quit IRC20:33
*** jamesmca_ has quit IRC20:34
*** pcichy has quit IRC20:37
*** sree has quit IRC20:38
*** jamesmcarthur has joined #openstack-infra20:42
ssbarneasomehow i see only some finger:// urls which are useless to me. i had the impression that this protocol was deprecated long time ago.20:45
*** slaweq has joined #openstack-infra20:45
*** edmondsw has joined #openstack-infra20:48
*** slaweq has quit IRC20:49
*** edmondsw has quit IRC20:52
*** jamesmcarthur has quit IRC20:56
*** olaph has joined #openstack-infra21:13
*** Goneri has joined #openstack-infra21:14
*** olaph1 has quit IRC21:15
*** ijw has joined #openstack-infra21:15
*** armaan has joined #openstack-infra21:27
*** rhallisey has quit IRC21:29
*** bobh has quit IRC21:30
*** aeng has joined #openstack-infra21:34
*** threestrands has joined #openstack-infra21:34
*** ijw has quit IRC21:34
*** Goneri has quit IRC21:35
*** Goneri has joined #openstack-infra21:37
*** bobh has joined #openstack-infra21:41
*** bobh has quit IRC21:45
*** ralonsoh has joined #openstack-infra21:49
*** ralonsoh has quit IRC21:49
*** jascott1 has joined #openstack-infra21:49
hjensasI  see POST_FAILURES - There is a read only filesystem it is trying to place the logs onto it seems: http://paste.openstack.org/show/644296/21:59
*** pcaruana has quit IRC22:00
*** bobh has joined #openstack-infra22:01
*** armax has joined #openstack-infra22:03
*** bobh has quit IRC22:05
*** bobh has joined #openstack-infra22:10
*** slaweq has joined #openstack-infra22:11
*** bobh has quit IRC22:15
*** slaweq has quit IRC22:17
*** rlandy has joined #openstack-infra22:17
fungihjensas: good eye. looks like the server lost contact with one of the cinder volumes that filesystem spans22:17
fungiJan 14 16:47:47 static kernel: [325191.787817] EXT4-fs (dm-2): Remounting filesystem read-only22:18
*** rcernin has joined #openstack-infra22:18
fungiso ~5.5 hours ago22:18
fungii'll unmount it and get the fsck started under a screen session22:19
*** markvoelker has quit IRC22:19
fungiinfra-root: ^ heads up, logical volume for logs.o.o is read-only due to write errirs on /dev/xvdg22:20
*** bobh has joined #openstack-infra22:21
fungihad to stop apache2 and kill the log maintenance cron script to be able to umount it22:22
*** rlandy has quit IRC22:23
fungithough in retrospect, i should probably reboot the instance to make sure /dev/xvdg reattaches sanely22:23
fungii've commented out /srv/static/logs in /etc/fstab so it won't get mounted at boot22:24
fungi#status log rebooted static.openstack.org to make sure disconnected volume /dev/xvdg reattaches correctly22:25
openstackstatusfungi: finished logging22:25
*** bobh has quit IRC22:25
fungi#status log a `fsck -y` of /dev/mapper/main-logs is underway in a root screen session on static.openstack.org22:27
openstackstatusfungi: finished logging22:27
*** bobh has joined #openstack-infra22:28
ianwfungi: ok ... ping if i can help22:29
fungi"This message is to inform you that our monitoring systems have detected a problem with the server which hosts your Cloud Block Storage device 'static.openstack.org/main05' at 17:02 UTC."22:30
ianwheh, thanks, problem noted :)22:30
fungiaccording to subsequent updates, they got the device restored to normal22:30
fungiso in theory we should be fine once this fsck completes in... some... hours22:30
*** ameliac has joined #openstack-infra22:32
*** bobh has quit IRC22:32
fungi#status alert The filesystem for the logs.openstack.org site was marked read-only at 2018-01-14 16:47 UTC due to an outage incident at the service provider; a filesystem recovery is underway, but job logs uploaded between now and completion are unlikely to be retained so please refrain from rechecking due to POST_FAILURE results until this alert is rescinded.22:33
openstackstatusfungi: sending alert22:33
*** sree has joined #openstack-infra22:34
*** jkilpatr has quit IRC22:34
ianwfungi: if we moved /srv/static/logs somewhere, when we remount the cinder volume it might be easier to copy back these intermediate logs?22:34
ianwi mean, make it a symlink for now, say?22:34
fungiianw: yes, we can in theory use rsync to copy the logs back in, but in the past when people have done that they've neglected to properly retain/fix up filesystem permissions22:34
fungiand there's ~100gb of space in the volume mounted at /srv/static anyway22:35
*** ijw has joined #openstack-infra22:35
-openstackstatus- NOTICE: The filesystem for the logs.openstack.org site was marked read-only at 2018-01-14 16:47 UTC due to an outage incident at the service provider; a filesystem recovery is underway, but job logs uploaded between now and completion are unlikely to be retained so please refrain from rechecking due to POST_FAILURE results until this alert is rescinded.22:35
*** ChanServ changes topic to "The filesystem for the logs.openstack.org site was marked read-only at 2018-01-14 16:47 UTC due to an outage incident at the service provider; a filesystem recovery is underway, but job logs uploaded between now and completion are unlikely to be retained so please refrain from rechecking due to POST_FAILURE results until this alert is rescinded."22:35
*** edmondsw has joined #openstack-infra22:36
fungiif whoever catches the tail end of the recovery wants to try and save the next few hours of logs, the logs volume can be mounted on a temporary mountpoint, rsync used to copy the newer logs into it, uploads temporarily disabled, a second rsync run to catch any stragglers, deletion of the contents of /srv/static/logs/, then unmount the logs volume and remount it there before reallowing uploads22:37
*** sree has quit IRC22:38
fungialso, unmounting or mounting /srv/static/logs has to be done with apache2 service stopped since it will hold an open file descriptor in the docroot of the vhost22:38
openstackstatusfungi: finished sending alert22:38
ianwi've got the screen open and will keep an eye22:39
fungino promises that rsync'ing into that volume will be even remotely timely22:40
*** ijw has quit IRC22:40
fungithanks for watching it ianw!22:40
*** edmondsw has quit IRC22:41
*** bobh has joined #openstack-infra22:41
ianwno ... i'm pretty sure i have symlinked out the logs dir during this happening before, then moved things back in slowly22:41
ianwbut it's one more failure point.  let's just kiss22:41
fungimight also not be a terrible idea to periodically check `df /srv/static` to make sure it doesn't fill up in the interim22:42
ianwok, seems like enough headroom (inodes too)22:43
*** armax has quit IRC22:45
*** bobh has quit IRC22:45
fungii bet after recovery we could just mount the logs volume back at /srv/static/logs and then do a second temporary read-only mount of the static volume somewhere other than /srv/static so we have an unshadowed copy of the filesystem to copy out of22:46
fungibut again, i'm not too concerned with preserving those22:46
fungimore interested in making sure we get the logs volume back into operation as soon as we can manage22:46
*** sree has joined #openstack-infra22:46
ianw++ yep22:48
*** bobh has joined #openstack-infra22:48
*** sree has quit IRC22:51
*** e0ne has quit IRC22:56
*** bobh has quit IRC22:58
*** bobh has joined #openstack-infra23:08
*** bobh has quit IRC23:12
*** bobh has joined #openstack-infra23:15
*** sree has joined #openstack-infra23:17
*** bobh has quit IRC23:20
*** sree has quit IRC23:21
*** sree has joined #openstack-infra23:39
*** sree has quit IRC23:44
*** dhajare has joined #openstack-infra23:48
*** bobh has joined #openstack-infra23:52
*** dingyichen has joined #openstack-infra23:55
*** bobh has quit IRC23:56

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!