I can't understand the difference between dag_concurrency
and parallelism
. documentation and some of the related posts here somehow contradicts my findings.
The understanding I had before was that the parallelism
parameter allows you to set the MAX number of global(across all DAGs) TaskRuns possible in airflow and dag_concurrency
to mean the MAX number of TaskRuns possible for a single Dag.
So I set the parallelism
to 8 and dag_concurrency
to 4 and ran a single Dag. And I found out that it was running 8 TIs at a time but I was expecting it to run 4 at a time.
How is that possible?
Also, if it helps, I have set the pool size to 10 or so for these tasks. But that shouldn't have mattered as "config" parameters are given higher priorities than the pool's, Right?