How do I restart celery workers gracefully?

Question

While issuing a new build to update code in workers how do I restart celery workers gracefully?

Edit: What I intend to do is to something like this.

Worker is running, probably uploading a 100 MB file to S3
A new build comes
Worker code has changes
Build script fires signal to the Worker(s)
Starts new workers with the new code
Worker(s) who got the signal after finishing the existing job exit.

armonge · Accepted Answer · 2022-03-18T15:30:26.187

51

According to https://docs.celeryq.dev/en/stable/userguide/workers.html#restarting-the-worker you can restart a worker by sending a HUP signal

 ps auxww | grep celeryd | grep -v "grep" | awk '{print $2}' | xargs kill -HUP

edited Mar 18 '22 at 15:30

answered Mar 19 '12 at 14:30

armonge

3,108
20
36

2

`sudo ps auxww | grep celeryd | grep -v "grep" | awk '{print $2}' | sudo xargs kill -HUP` exclude grep :-) – Quintin Par Mar 24 '12 at 07:59
5

You can replace grep celeryd | grep -v "grep" with grep [c]eleryd. Just saying. – chanux Oct 11 '13 at 07:42
4

It seems that it is not a graceful restart, is it? As the docs say: "Other than stopping then starting the worker to restart, you can also restart the worker using the HUP signal, but note that the worker will be responsible for restarting itself so this is prone to problems and is not recommended in production" So what is the best way to reload Celery in production to avoid failures? – mennanov Oct 08 '14 at 11:57
2

For celery multi: "For production deployments you should be using init scripts or other process supervision systems". As for HUP: "this is prone to problems and **is not recommended in production**" – webjunkie Aug 02 '16 at 10:12
1

The celery documentation appears to be self-conflicting on this subject; here it says don't use `celery multi` in production, but in the daemonization section the example systemd config file uses `celery multi`. – GDorn Jan 11 '21 at 17:27

score 15 · Answer 2 · answered Mar 10 '15 at 05:33

15

celery multi start 1 -A proj -l info -c4 --pidfile=/var/run/celery/%n.pid
celery multi restart 1 --pidfile=/var/run/celery/%n.pid

http://docs.celeryproject.org/en/latest/userguide/workers.html#restarting-the-worker

answered Mar 10 '15 at 05:33

zengr

38,346
37
130
192

5

Uugh... it says right there "The easiest way to manage workers for *development* is by using celery multi. For **production deployments** you should be using *init scripts or other process supervision systems*". This answer does not apply to running in production! – webjunkie Aug 02 '16 at 10:09
1

@webjunkie The OP didn't say "in product deployment", so not sure why would you downvote it if it was not mentioned in the original question. – zengr Aug 02 '16 at 17:22
3

He also did not say he wants a solution for an e.g. testing environment. A lot of people will not bother to read more and dangerously go and use a solution that appears right to them. So it is only fair to mention drawbacks and not simply copy and paste something from a documentation ignoring notes and stripping away further recommendations. – webjunkie Aug 03 '16 at 10:34

score 8 · Answer 3 · edited Feb 10 '20 at 00:50

8

You can do:

celery multi restart w1 -A your_project -l info  # restart workers

Example

edited Feb 10 '20 at 00:50

Zulu

8,765
9
49
56

answered Jun 05 '19 at 11:40

cảnh nguyễn

528
1
6
6

score 6 · Answer 4 · answered Mar 15 '16 at 06:57

6

If you're going the kill route, pgrep to the rescue:

kill -9 `pgrep -f celeryd`

Mind you, this is not a long-running task and I don't care if it terminates brutally. Just reloading new code during dev. I'd go the restart service route if it was more sensitive.

answered Mar 15 '16 at 06:57

JL Peyret

10,917
2
54
73

4

(pkill does this in a cleaner way) – orzel Jun 27 '16 at 20:27
didn't know that. I still prefer seeing a list of processes that will be killed beforehand however: step 1 - tune your pgrep, step 2 weaponize it by feeding it to the kill. – JL Peyret May 23 '18 at 04:34

score 2 · Answer 5 · answered May 23 '13 at 14:49

What should happen to long running tasks? I like it this way: long running tasks should do their job. Don't interrupt them, only new tasks should get the new code.

But this is not possible at the moment: https://groups.google.com/d/msg/celery-users/uTalKMszT2Q/-MHleIY7WaIJ

crunk_monad · Answer 6 · 2015-04-29T10:16:20.463

I have repeatedly tested the -HUP solution using an automated script, but find that about 5% of the time, the worker stops picking up new jobs after being restarted.

A more reliable solution is:

stop <celery_service>
start <celery_service>

which I have used hundreds of times now without any issues.

From within Python, you can run:

import subprocess
service_name = 'celery_service'
for command in ['stop', 'start']:
    subprocess.check_call(command + ' ' + service_name, shell=True)

score 2 · Answer 7 · answered Mar 19 '12 at 11:37

2

You should look at Celery's autoreloading

answered Mar 19 '12 at 11:37

sebasmagri

170
2
10

5

This seems to be experimental `This is an experimental feature intended for use in development only, using auto-reload in production is discouraged as the behavior of reloading a module in Python is undefined` – Quintin Par Mar 19 '12 at 13:57
Be also aware, that turning on this flag can cause orphan processes running, which in its turn causes result backend working unproperly – Thorin Schiffer Mar 20 '16 at 15:25
5

The autoreload options seems to have been removed from Celery 4 – Mark Chackerian Apr 17 '17 at 22:42
1

auto reloading is deprecated – Benny Chan Jul 14 '22 at 09:53

score 0 · Answer 8 · answered Aug 02 '22 at 07:16

If you're using docker/docker-compose and putting celery into a separate container from the Django container, you can use

docker-compose kill -s HUP celery

, where celery is the container name. The worker will be gracefully restarted and the ongoing task is not brutally stopped.

Tried pkill, kill, celery multi stop, celery multi restart, docker-compose restart. All not working. Either the container is stopped abruptly or the code is not reloaded.

I just want to reload my code in the prod server manually with a 1-liner. Don't want to play with daemonization.

score -3 · Answer 9 · answered Jan 06 '21 at 21:35

-3

Might be late to the party. I use:

sudo systemctl stop celery

sudo systemctl start celery

sudo systemctl status celery

answered Jan 06 '21 at 21:35

Denis Kanygin

1,125
12
15

`Unit celery.service could not be found.` – egor83 Apr 30 '21 at 17:35
most of the time it's not running under systemctl – TheNoobHunter66 Jul 19 '22 at 12:13

How do I restart celery workers gracefully?

9 Answers9

Linked