It seems the EventStreams service has started accumulating enormous amounts of memory on SCB nodes in CODFW. Just one example from scb2002:
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 4232 eventst+ 20 0 9927.1m 66888 22488 S 2.3 0.2 0:14.11 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 4044 eventst+ 20 0 9868.6m 65568 20288 S 0.3 0.2 0:05.46 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 25405 eventst+ 20 0 3106056 27476 7228 S 0.3 0.1 9:07.04 /usr/bin/nodejs src/server.js -c /etc/eventstreams/config.yaml 3545 eventst+ 20 0 10.285g 694560 22404 S 0.0 2.1 1:28.05 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 3577 eventst+ 20 0 10.455g 872428 22636 S 0.0 2.7 3:42.07 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 3597 eventst+ 20 0 10.276g 685192 22628 S 0.0 2.1 1:34.94 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 3624 eventst+ 20 0 9866.7m 64108 20132 S 0.0 0.2 0:05.51 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 3637 eventst+ 20 0 9918.4m 66332 22692 S 0.0 0.2 0:18.12 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 3798 eventst+ 20 0 9925.2m 75112 22628 S 0.0 0.2 0:36.99 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 3908 eventst+ 20 0 10.301g 712684 22764 S 0.0 2.2 4:52.33 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 3953 eventst+ 20 0 9922.2m 69192 22696 S 0.0 0.2 0:20.33 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 3985 eventst+ 20 0 9922.1m 70896 22396 S 0.0 0.2 0:23.61 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 4058 eventst+ 20 0 11.166g 1.543g 22592 S 0.0 4.9 3:47.59 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 4177 eventst+ 20 0 10.291g 700132 22552 S 0.0 2.1 1:34.04 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 4186 eventst+ 20 0 9919.1m 65492 22756 S 0.0 0.2 0:06.58 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 4209 eventst+ 20 0 10.901g 1.279g 22568 S 0.0 4.1 5:35.37 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 4225 eventst+ 20 0 10.658g 1.035g 22560 S 0.0 3.3 3:46.86 /usr/bin/nodejs /srv/deployment/eventstreams/deploy-cache/revs/6b013f9a10a7d96b40225ab313b22cb4a19e03d2/node_modules/service-runner/service-r+ 25402 eventst+ 20 0 18272 240 4 S 0.0 0.0 0:00.00 /usr/bin/firejail --blacklist=/root --blacklist=/home --caps --seccomp /usr/bin/nodejs src/server.js -c /etc/eventstreams/config.yaml 25403 eventst+ 20 0 18272 268 0 S 0.0 0.0 0:00.00 /usr/bin/firejail --blacklist=/root --blacklist=/home --caps --seccomp /usr/bin/nodejs src/server.js -c /etc/eventstreams/config.yaml
Note the processes with 2+% mem (some reaching 5%) ! This has caused, only today, all of the services to crash on scb200[236]. The same does not occur in EQIAD, mysteriously.