Datum a čas: 2020-10-09 10:09 CEST Očekavaná délka: 35 minut Oznámení se týká serverů: node15.prg Typ výpadku: reset Důvod: Sluzby jadra prilis pomale Výpadek řeší: Pavel Šnajdr
Pravdepodobne kvuli vzrustajicimu poctu zombie procesu v par kontejnerech :(
ENGLISH: Date and time: 2020-10-09 10:09 CEST Expected duration: 35 minutes Affected systems: node15.prg Outage type: reset Reason: Kernel services too slow to respond Handled by: Pavel Šnajdr
Increasing number of zombie processes in few containers caused the node to slow down in responses to a crawl.
-----BEGIN BASE64 ENCODED PARSEABLE JSON----- eyJpZCI6NjkxLCJwbGFubmVkIjpmYWxzZSwiYmVnaW5zX2F0IjoiMjAyMC0x MC0wOVQxMDowOTowMCswMjowMCIsImR1cmF0aW9uIjozNSwidHlwZSI6InJl c2V0IiwiZW50aXRpZXMiOlt7Im5hbWUiOiJOb2RlIiwiaWQiOjExNiwibGFi ZWwiOiJub2RlMTUucHJnIn1dLCJoYW5kbGVycyI6WyJQYXZlbCDFoG5hamRy Il0sInRyYW5zbGF0aW9ucyI6eyJlbiI6eyJzdW1tYXJ5IjoiS2VybmVsIHNl cnZpY2VzIHRvbyBzbG93IHRvIHJlc3BvbmQiLCJkZXNjcmlwdGlvbiI6Iklu Y3JlYXNpbmcgbnVtYmVyIG9mIHpvbWJpZSBwcm9jZXNzZXMgaW4gZmV3IGNv bnRhaW5lcnMgY2F1c2VkIHRoZSBub2RlIHRvIHNsb3cgZG93biBpbiByZXNw b25zZXMgdG8gYSBjcmF3bC4ifSwiY3MiOnsic3VtbWFyeSI6IlNsdXpieSBq YWRyYSBwcmlsaXMgcG9tYWxlIiwiZGVzY3JpcHRpb24iOiJQcmF2ZGVwb2Rv Ym5lIGt2dWxpIHZ6cnVzdGFqaWNpbXUgcG9jdHUgem9tYmllIHByb2Nlc3Ug diBwYXIga29udGVqbmVyZWNoIDooIn19fQ== -----END BASE64 ENCODED PARSEABLE JSON-----