High memory usage in redis #715

mauri870 · 2019-12-03T19:45:49Z

Horizon Version: v3.4.3
Laravel Version: v6.6.0
PHP Version: 7.3.11
Redis Driver & Version: predis/phpredis 1.1.1 or php-redis extension 5.1.1, same result
Database Driver & Version:

Description:

After we upgraded from Laravel 5.8 to 6.6 / Horizon 3.2.2 to 3.4.3 the resource consumption of our redis 4 server started to grow exponentially, sitting at around 5Gb.

Horizon Dashboard:

Redis instance dedicated to horizon:

Previous version vs new (the release was pushed Nov 25):

Steps To Reproduce:

We have no clue of what is causing this behavior, but our number of jobs is almost the same as before, the only difference is the framework and horizon versions.

SDekkers · 2019-12-04T12:58:48Z

I have the same issue, the jobs when finished appear to not be removed from the redis memory, resulting in high memory usage. Running about 500k jobs per day

What are your trim values in horizon.php?

mauri870 · 2019-12-04T13:20:14Z

    'trim' => [
        'recent' => 30,
        'recent_failed' => 30,
        'failed' => 60,
        'monitored' => 0
    ],

I can't find a trim option for completed jobs tho

mauri870 · 2019-12-04T14:08:02Z

The problem seems to be that trim.recent is used to either expire a job that has been pushed(but not processed) and jobs that are already completed. Maybe it should be considered to add a new trim.completed to expire the completed jobs without loosing jobs that are still on queue.

mauri870 · 2019-12-04T19:05:58Z

~~For now our solution was to simply expire/delete the completed job to free up some memory:~~

Queue::after(function (JobProcessed $event) {
	Redis::expireat(config('horizon.prefix') . $event->job->getJobId(), Carbon::now()->addMinute()->timestamp);
});

mauri870 · 2019-12-04T19:14:31Z

Thats our fix in a horizon fork with trim.completed of 1:

diff --git a/config/horizon.php b/config/horizon.php
index b9803a8..318a945 100644
--- a/config/horizon.php
+++ b/config/horizon.php
@@ -98,6 +98,7 @@ return [
         'recent_failed' => 10080,
         'failed' => 10080,
         'monitored' => 10080,
+        'completed' => 60,
     ],
 
     /*
diff --git a/src/Repositories/RedisJobRepository.php b/src/Repositories/RedisJobRepository.php
index 171b040..dfb2cac 100644
--- a/src/Repositories/RedisJobRepository.php
+++ b/src/Repositories/RedisJobRepository.php
@@ -66,6 +66,7 @@ class RedisJobRepository implements JobRepository
     {
         $this->redis = $redis;
         $this->recentJobExpires = config('horizon.trim.recent', 60);
+        $this->completedJobExpires = config('horizon.trim.completed', 60);
         $this->failedJobExpires = config('horizon.trim.failed', 10080);
         $this->recentFailedJobExpires = config('horizon.trim.recent_failed', $this->failedJobExpires);
         $this->monitoredJobExpires = config('horizon.trim.monitored', 10080);
@@ -405,7 +406,7 @@ class RedisJobRepository implements JobRepository
             ? $pipe->hmset($id, ['status' => 'failed'])
             : $pipe->hmset($id, ['status' => 'completed', 'completed_at' => str_replace(',', '.', microtime(true))]);
 
-        $pipe->expireat($id, Chronos::now()->addMinutes($this->recentJobExpires)->getTimestamp());
+        $pipe->expireat($id, Chronos::now()->addMinutes($this->completedJobExpires)->getTimestamp());
     }
 
     /**

travisaustin · 2019-12-04T23:16:31Z

I have similar issues. I was able to get the job to expire correctly by setting horizon.trim.recent in my config. (Check the __construct() function in RedisJobRepository.php to see where it's accessed).

That said, I still have an issue of memory slowing filling up, and I think it's because keys are left behind in the horizon:recent:TAGNAME Redis keys. Right now, for example, I only have about 100 recent jobs listed, but the horizon:recent:TAGNAME keys contain over 2,000,000 entries, all referencing IDs of long-expired jobs.

driesvints · 2019-12-05T15:41:02Z

Please see #625

Are you all monitoring tags?

travisaustin · 2019-12-05T15:53:50Z

No, I’m not monitoring any tags at all.

mauri870 · 2019-12-05T15:59:43Z

Neither I. The problem is indeed Horizon not cleaning completed jobs until trim.recent expires. In our case, with 440k jobs every 30 minutes the completed jobs are causing redis to fill up memory quickly and also increasing cpu usage due to the number of keys. Please refer to the diff above which introduces a mechanism to control how long completed jobs are persisted.

#715 (comment)

travisaustin · 2019-12-05T20:34:58Z

I think there are two issues here.

First, as reported by @mauri870, is that completed jobs are retained for 1 week by default. This is easily solved by using the undocumented configuration option of horizon.trim.recent. @mauri870 - I don't think your diff is necessary if you set the configuration item horizon.trim.recent to a low value. Is that correct?

Second is that all new job IDs are added to the key horizon:recent:TAGNAMEHERE (where TAGNAMEHERE is the name of a tag). Even if these tags are not monitored, these keys fill with the Job ID of every job that is dispatched with that tag. Horizon never cleans out this list of Job IDs, and these keys continue to fill up until they are manually cleared.

Edit: there are two places that fill up. horizon:recent:TAGNAMEHERE and horizon:failed:TAGNAMEHERE

Edit again: I just realized that the configuration option horizon.trim.recent sets the TTL on the Redis job payload when it's created. If the job isn't dispatched before the horizon.trim.recent expires, the job payload will disappear from Redis before it can be dispatched. Am I understanding that right?

mauri870 · 2019-12-06T21:12:07Z

@travisaustin I think you are, at least reading the source code. That's why I added trim.completed in my fork, it's working as expected now.

#715 (comment)

themsaid · 2019-12-09T06:28:09Z

A solution is proposed in #720

eKevinHoang · 2020-01-13T10:10:55Z

I have the same issue, my config is

    'waits' => [
        'redis:default' => 60,
    ],

    /*
    |--------------------------------------------------------------------------
    | Job Trimming Times
    |--------------------------------------------------------------------------
    |
    | Here you can configure for how long (in minutes) you desire Horizon to
    | persist the recent and failed jobs. Typically, recent jobs are kept
    | for one hour while all failed jobs are stored for an entire week.
    |
    */

    'trim' => [
        'recent'        => 60,
        'recent_failed' => 10080,
        'failed'        => 10080,
        'monitored'     => 10080,
    ],

I'm using gdb to dump the memory of a horizon:work process and I see that many queue doesn't release from memory even it has been finished for hour. it seems an issue of JobMetrics feature.

My horizon version: v3.4.3
My Laravel version: v6.6

TheOneDaveYoung · 2020-02-13T19:29:48Z

I'm not understanding why #720 was closed? It seems to me the current situation with very high, possibly runaway resource utilization by Redis is a larger issue than possibly wonky pagination? Am I missing something here?

@mauri870 it's been a couple of months since your forked solution. How is it holding up and are you experiencing issues with pagination as discused in #720?

mauri870 · 2020-02-13T19:39:22Z

@TheOneDaveYoung IDK why Taylor closed that PR, he mentioned that something was not right with pagination but after we changed to my fork the OOM problems ceased and everything seems to be working great. More than 2 months now without problems.

driesvints · 2020-02-14T15:32:00Z

#720 was merged. This will unfortunately break pagination. We're currently considering separating the different types of jobs into separate screens to solve this problem.

xwiz · 2020-08-31T22:34:37Z

Still kind of experiencing this issue. I noticed Horizon uses zsets and hash sets. Maybe there are some sane tunings one can use to improve the memory consumption and performance since by default if you're running millions of jobs, using generic data types will have too much memory spill.

SumitChowjar · 2021-05-27T12:24:55Z

I am also troubling with high memory usage and my system crashed.

Laravel: v8.35.1. Horizon: v5.7

I have around 40k records. I made a chunk of 300 records and process each chunk via job.

Users::chunk(300, function($users) {
     //dispatch a job
     MigrateUsers::dispatch($users->all());
});

My horizon config is:

'waits' => [
        'redis:default' => 60,
    ],

'trim' => [
        'recent' => 60,
        'pending' => 2880,
        'completed' => 60,
        'recent_failed' => 1440,
        'failed' => 2880,
        'monitored' => 2880,
    ],

'memory_limit' => 128,

'defaults' => [
        'supervisor-1' => [
            'connection' => 'redis',
            'queue' => ['default'],
            'balance' => 'auto',
            'minProcesses' => 1,
            'maxProcesses' => 2,
            'memory' => 128,
            'tries' => 2,
            'nice' => 0,
        ],
    ],

    'environments' => [
        'local' => [
            'supervisor-1' => [
                'maxProcesses' => 2,
                'balanceMaxShift' => 1,
                'balanceCooldown' => 3,
                'timeout' => 900 // Timeout after 15 minutes
            ],
        ],
    ]

And in this process 7 to 8 GB of RAM is consumed and the system reboots in between.
@mauri870

mauri870 changed the title ~~High cpu and memory usage in redis~~ High memory usage in redis Dec 4, 2019

driesvints added the needs more info label Dec 5, 2019

themsaid mentioned this issue Dec 9, 2019

[3.x] Allow trimming completed jobs #720

Merged

mauri870 mentioned this issue Dec 13, 2019

Job delayed more than trim.recent config disappears from horizon even before it executes #723

Closed

BramRoets mentioned this issue Jan 8, 2020

High Memory Usage since v3.4.4 #740

Closed

sebdesign mentioned this issue Jan 13, 2020

[3.x] Trim job IDs from failed job tags #743

Closed

driesvints closed this as completed Feb 14, 2020

benholmen mentioned this issue Sep 7, 2023

Monitoring tags consumes more Redis RAM than expected #1314

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

High memory usage in redis #715

High memory usage in redis #715

mauri870 commented Dec 3, 2019 •

edited

Loading

SDekkers commented Dec 4, 2019 •

edited

Loading

mauri870 commented Dec 4, 2019 •

edited

Loading

mauri870 commented Dec 4, 2019

mauri870 commented Dec 4, 2019 •

edited

Loading

mauri870 commented Dec 4, 2019 •

edited

Loading

travisaustin commented Dec 4, 2019 •

edited

Loading

driesvints commented Dec 5, 2019

travisaustin commented Dec 5, 2019 via email

mauri870 commented Dec 5, 2019 •

edited

Loading

travisaustin commented Dec 5, 2019 •

edited

Loading

mauri870 commented Dec 6, 2019 •

edited

Loading

themsaid commented Dec 9, 2019

eKevinHoang commented Jan 13, 2020 •

edited

Loading

TheOneDaveYoung commented Feb 13, 2020

mauri870 commented Feb 13, 2020

driesvints commented Feb 14, 2020

xwiz commented Aug 31, 2020

SumitChowjar commented May 27, 2021

High memory usage in redis #715

High memory usage in redis #715

Comments

mauri870 commented Dec 3, 2019 • edited Loading

Description:

Steps To Reproduce:

SDekkers commented Dec 4, 2019 • edited Loading

mauri870 commented Dec 4, 2019 • edited Loading

mauri870 commented Dec 4, 2019

mauri870 commented Dec 4, 2019 • edited Loading

mauri870 commented Dec 4, 2019 • edited Loading

travisaustin commented Dec 4, 2019 • edited Loading

driesvints commented Dec 5, 2019

travisaustin commented Dec 5, 2019 via email

mauri870 commented Dec 5, 2019 • edited Loading

travisaustin commented Dec 5, 2019 • edited Loading

mauri870 commented Dec 6, 2019 • edited Loading

themsaid commented Dec 9, 2019

eKevinHoang commented Jan 13, 2020 • edited Loading

TheOneDaveYoung commented Feb 13, 2020

mauri870 commented Feb 13, 2020

driesvints commented Feb 14, 2020

xwiz commented Aug 31, 2020

SumitChowjar commented May 27, 2021

mauri870 commented Dec 3, 2019 •

edited

Loading

SDekkers commented Dec 4, 2019 •

edited

Loading

mauri870 commented Dec 4, 2019 •

edited

Loading

mauri870 commented Dec 4, 2019 •

edited

Loading

mauri870 commented Dec 4, 2019 •

edited

Loading

travisaustin commented Dec 4, 2019 •

edited

Loading

mauri870 commented Dec 5, 2019 •

edited

Loading

travisaustin commented Dec 5, 2019 •

edited

Loading

mauri870 commented Dec 6, 2019 •

edited

Loading

eKevinHoang commented Jan 13, 2020 •

edited

Loading