fix goroutine leaks in events and logs backend

When running a single podman logs this is not really important since we will exit when we finish reading the logs. However for the system service this is very important. Leaking goroutines will cause an increased memory and CPU ussage over time. Both the the event and log backend have goroutine leaks with both the file and journald drivers. The journald backend has the problem that journal.Wait(IndefiniteWait) will block until we get a new journald event. So when a client closes the connection the goroutine would still wait until there is a new journal entry. To fix this we just wait for a maximum of 5 seconds, after that we can check if the client connection was closed and exit correctly in this case. For the file backend we can fix this by waiting for either the log line or context cancel at the same time. Currently it would block waiting for new log lines and only check afterwards if the client closed the connection and thus hang forever if there are no new log lines. [NO NEW TESTS NEEDED] I am open to ideas how we can test memory leaks in CI. To test manually run a container like this: `podman run --log-driver $driver --name test -d alpine sh -c 'i=1; while [ "$i" -ne 1000 ]; do echo "line $i"; i=$((i + 1)); done; sleep inf'` where `$driver` can be either `journald` or `k8s-file`. Then start the podman system service and use: `curl -m 1 --output - --unix-socket $XDG_RUNTIME_DIR/podman/podman.sock -v 'http://d/containers/test/logs?follow=1&since=0&stderr=1&stdout=1' &>/dev/null` to get the logs from the API and then it closes the connection after 1 second. Now run the curl command several times and check the memory usage of the service. Fixes #14879 Signed-off-by: Paul Holzinger <pholzing@redhat.com>
author: Paul Holzinger <pholzing@redhat.com> 2022-07-20 12:36:02 +0200
committer: Paul Holzinger <pholzing@redhat.com> 2022-07-20 12:55:34 +0200
commit: 4e72aa58604f9384c3d2913b16baa41f00d0a9e1 (patch)
tree: b399da0cebfa736c1d8a5bfcaa0265ec46cd4d34 /libpod/container_log.go
parent: 98b22e29c9a769020de1ae8d2bf9a7630e930e78 (diff)
download: podman-4e72aa58604f9384c3d2913b16baa41f00d0a9e1.tar.gz
podman-4e72aa58604f9384c3d2913b16baa41f00d0a9e1.tar.bz2
podman-4e72aa58604f9384c3d2913b16baa41f00d0a9e1.zip
1 files changed, 10 insertions, 4 deletions
diff --git a/libpod/container_log.go b/libpod/container_log.go
index a9e0fe065..c49b54eb1 100644
--- a/libpod/container_log.go
+++ b/libpod/container_log.go
@@ -10,6 +10,7 @@ import (
 	"github.com/containers/podman/v4/libpod/define"
 	"github.com/containers/podman/v4/libpod/events"
 	"github.com/containers/podman/v4/libpod/logs"
+	"github.com/nxadm/tail"
 	"github.com/nxadm/tail/watch"
 	"github.com/sirupsen/logrus"
 )
@@ -74,14 +75,19 @@ func (c *Container) readFromLogFile(ctx context.Context, options *logs.LogOption
 
 	go func() {
 		defer options.WaitGroup.Done()
-
-		for line := range t.Lines {
+		var line *tail.Line
+		var ok bool
+		for {
 			select {
 			case <-ctx.Done():
 				// the consumer has cancelled
+				t.Kill(errors.New("hangup by client"))
 				return
-			default:
-				// fallthrough
+			case line, ok = <-t.Lines:
+				if !ok {
+					// channel was closed
+					return
+				}
 			}
 			nll, err := logs.NewLogLine(line.Text)
 			if err != nil {
author	Paul Holzinger <pholzing@redhat.com>	2022-07-20 12:36:02 +0200
committer	Paul Holzinger <pholzing@redhat.com>	2022-07-20 12:55:34 +0200
commit	4e72aa58604f9384c3d2913b16baa41f00d0a9e1 (patch)
tree	b399da0cebfa736c1d8a5bfcaa0265ec46cd4d34 /libpod/container_log.go
parent	98b22e29c9a769020de1ae8d2bf9a7630e930e78 (diff)
download	podman-4e72aa58604f9384c3d2913b16baa41f00d0a9e1.tar.gz podman-4e72aa58604f9384c3d2913b16baa41f00d0a9e1.tar.bz2 podman-4e72aa58604f9384c3d2913b16baa41f00d0a9e1.zip