Understanding celery task prefetching

Question

I just found out about the configuration option CELERYD_PREFETCH_MULTIPLIER (docs). The default is 4, but (I believe) I want the prefetching off or as low as possible. I set it to 1 now, which is close enough to what I'm looking for, but there's still some things I don't understand:

Why is this prefetching a good idea? I don't really see a reason for it, unless there's a lot of latency between the message queue and the workers (in my case, they are currently running on the same host and at worst might eventually run on different hosts in the same data center). The documentation only mentions the disadvantages, but fails to explain what the advantages are.
Many people seem to set this to 0, expecting to be able to turn off prefetching that way (a reasonable assumption in my opinion). However, 0 means unlimited prefetching. Why would anyone ever want unlimited prefetching, doesn't that entirely eliminate the concurrency/asynchronicity you introduced a task queue for in the first place?
Why can prefetching not be turned off? It might not be a good idea for performance to turn it off in most cases, but is there a technical reason for this not to be possible? Or is it just not implemented?
Sometimes, this option is connected to CELERY_ACKS_LATE. For example. Roger Hu writes «[…] often what [users] really want is to have a worker only reserve as many tasks as there are child processes. But this is not possible without enabling late acknowledgements […]» I don't understand how these two options are connected and why one is not possible without the other. Another mention of the connection can be found here. Can someone explain why the two options are connected?

score 42 · Accepted Answer · edited May 23 '17 at 12:18

42

Prefetching can improve the performance. Workers don't need to wait for the next message from a broker to process. Communicating with a broker once and processing a lot of messages gives a performance gain. Getting a message from a broker (even from a local one) is expensive compared to the local memory access. Workers are also allowed to acknowledge messages in batches
Prefetching set to zero means "no specific limit" rather than unlimited
Setting prefetching to 1 is documented to be equivalent to turning it off, but this may not always be the case (see https://stackoverflow.com/a/33357180/71522)
Prefetching allows to ack messages in batches. CELERY_ACKS_LATE=True prevents acknowledging messages when they reach to a worker

edited May 23 '17 at 12:18

Community

1
1

answered Apr 17 '13 at 13:40

mher

10,508
2
35
27

Thanks. 2) Okay, but why would anyone want 'no specific limit'? 3) I'm pretty sure I still see "Got task from broker" messages before the current task has finished. – Henrik Heimbuerger Apr 17 '13 at 21:41
2

AFAIK, setting prefetching to 1 is **not** equivalent to turning it off. It's the lowest possible value (while not hacking the source code) for prefetch, which, in turn, is the number of CPU/cores in the current machine. – Ron Klein Jan 12 '14 at 10:32
2

@RonKlein It's not the number of CPUs/cores, it's the number of Celery workers you've defined (which can in some cases be the same, but usually not). Also, if each worker is prefetching a single task, then executing it, then prefetching another, this is equivalent to turning prefetching off, so IMHO your statement is incorrect. (The system must always prefetch as many tasks as there are workers if it wants to keep all workers fed with tasks.) – nitwit Sep 05 '14 at 19:38
@nitwit, my statement is based on my own experience with Celery + Redis as its backend. To elaborate, consider the following scenario: Using a single machine for workers, I start a Celery worker with `CELERYD_PREFETCH_MULTIPLIER` set to `1`. A task is handled. The **first** thing being executed is to unsubscribe from the current queue. Once the task is finished, I either get another task, or get an error claiming no handler found for current task. One way or another, this means that prefetching is not turned off. – Ron Klein Sep 06 '14 at 11:55
5

I think CELERYD_PREFETCH_MULTIPLIER set to 1 basically means 'fetching'. Pre-fetching is implied when the value is set to >1. So, in effect tasks beyond 1 are being 'pre-fetched'. – tigeronk2 Jan 22 '15 at 06:10
8

I've run some experiments, and (at least with the Redis broker) settings `CELERYD_PREFETCH_MULTIPLIER = 1` does *not* disable prefetching. It simply — as the name suggests — only prefetches one task at a time. – David Wolever Oct 26 '15 at 23:09
please take a look at [this link](https://stackoverflow.com/questions/41821614/celery-daemon-production-local-config-file-without-django) – Manish Jan 25 '17 at 11:17
@DavidWolever do you still have the code that you used to run this experiment? would you please share it here if you do. Thanks. – kapad Feb 05 '20 at 09:20

score 33 · Answer 2 · edited Jul 17 '18 at 05:25

Old question, but still adding my answer in case it helps someone. My understanding from some initial testing was same as that in David Wolever's answer. I just tested this more in celery 3.1.19 and -Ofair does work. Just that it is not meant to disable prefetch at the worker node level. That will continue to happen. Using -Ofair has a different effect which is at the pool worker level. In summary, to disable prefetch completely, do this:

Set CELERYD_PREFETCH_MULTIPLIER = 1
Set CELERY_ACKS_LATE = True at a global level or task level
Use -Ofair while starting the workers
If you set concurrency to 1, then step 3 is not needed. If you want a higher concurrency, then step 3 is essential to avoid tasks getting backed up in a node that could be run long running tasks.

Adding some more details:

I found that the worker node will always prefetch by default. You can only control how many tasks it prefetches by using CELERYD_PREFETCH_MULTIPLIER. If set to 1, it will only prefetch as many tasks as the number of pool workers (concurrency) in the node. So if you had concurrency = n, the max tasks prefetched by the node will be n.

Without the -Ofair option, what happened for me was that if one of the pool worker processes was executing a long running task, the other workers in the node would also stop processing the tasks already prefetched by the node. By using -Ofair, that changed. Even though one of the workers in the node was executing a long running tasks, others would not stop processing and would continue to process the tasks prefetched by the node. So I see two levels of prefetching. One at the worker node level. The other at the individual worker level. Using -Ofair for me seemed to disable it at the worker level.

How is ACKS_LATE related? ACKS_LATE = True means that the task will be acknowledged only when the task succeeds. If not, I suppose it would happen when it is received by a worker. In case of prefetch, the task is first received by the worker (confirmed from logs) but will be executed later. I just realized that prefetched messages show up under "unacknowledged messages" in rabbitmq. So I'm not sure if setting it to True is absolutely needed. We anyway had our tasks set that way (late ack) for other reasons.

Thanks for still contributing to this question! Could you add a bit more detail? For example, you're writing `-Ofair` has a "different effect", but not how the effect is different. Also, you're bringing up `CELERY_ACKS_LATE`, as others have before, but so far nobody has managed to explain to me what that attribute has to do with disabling prefetching. — Henrik Heimbuerger, Jun 08 '16 at 10:42
I had this same issue, running with a redis backend. I had 4 concurrent tasks running and when one started hanging the others would wait for that one to finish (it wouldn't) - Killing that worker would then allow others to resume. I already had `prefetch=1, celery_acks=True` and when I added `-Ofair` it fixed the issue where they were waiting for the hung worker. Unfortunately the hung worker issue still isn't fixed for me and so all the workers eventually hang, but at least they no longer do it at the exact same time. — JiminyCricket, Aug 12 '16 at 21:52
Not working for me. I am using celery 5.2, is there any settings which can impact? — Waqas Ali, Nov 24 '21 at 21:46
This works for me with celery 5.2, adding here, may be this is helpful for someone. https://stackoverflow.com/a/64869186/1900645 — Waqas Ali, Nov 24 '21 at 23:07

score 24 · Answer 3 · answered Oct 26 '15 at 23:07

Just a warning: as of my testing with the redis broker + Celery 3.1.15, all of the advice I've read pertaining to CELERYD_PREFETCH_MULTIPLIER = 1 disabling prefetching is demonstrably false.

To demonstrate this:

Set CELERYD_PREFETCH_MULTIPLIER = 1
Queue up 5 tasks that will each take a few seconds (ex, time.sleep(5))
Start watching the length of the task queue in Redis: watch redis-cli -c llen default
Start celery worker -c 1
Notice that the queue length in Redis will immediately drop from 5 to 3

CELERYD_PREFETCH_MULTIPLIER = 1 does not prevent prefetching, it simply limits the prefetching to 1 task per queue.

-Ofair, despite what the documentation says, also does not prevent prefetching.

Short of modifying the source code, I haven't found any method for entirely disabling prefetching.

As other answers have mentioned if you also set `CELERY_ACKS_LATE = 1` then you will effectively disable prefetching. — jodag, Jul 04 '17 at 17:04

score 14 · Answer 4 · answered Oct 29 '15 at 21:23

I cannot comment on David Wolever's answers, since my stackcred isn't high enough. So, I've framed my comment as an answer since I'd like to share my experience with Celery 3.1.18 and a Mongodb broker. I managed to stop prefetching with the following:

add CELERYD_PREFETCH_MULTIPLIER = 1 to the celery config
add CELERY_ACKS_LATE = True to the celery config
Start celery worker with options: --concurrency=1 -Ofair

Leaving CELERY_ACKS_LATE to the default, the worker still prefetches. Just like the OP I don't fully grasp the link between prefetching and late acks. I understand what David says "CELERY_ACKS_LATE=True prevents acknowledging messages when they reach to a worker", but I fail to understand why late acks would be incompatible with prefetch. In theory a prefetch would still allow to ack late right - even if not coded as such in celery ?

gCoh · Answer 5 · 2019-11-22T08:02:43.473

1

I experienced something a little bit different with SQS as broker.

The setup was:

CELERYD_PREFETCH_MULTIPLIER = 1
ACKS_ON_FAILURE_OR_TIMEOUT=False
CELERY_ACKS_LATE = True
CONCURRENCY=1

After task fail (exception raised), the worker became unavailable since the message was not acked, both local and remote queue.

The solution that made the workers continue consuming work was setting

CELERYD_PREFETCH_MULTIPLIER = 0

I can only speculate that acks_late was not taken in consideration when writing the SQS transport

edited Nov 22 '19 at 08:02

answered Nov 20 '19 at 16:13

gCoh

2,719
1
22
46

I created a PR handling this case, it will be available in celery==4.4.0 https://github.com/celery/celery/pull/5843 – gCoh Nov 22 '19 at 12:05

Understanding celery task prefetching

5 Answers5

Linked