Converting small functions to coroutines

Question

I feel like there is a gap in my understanding of async IO: is there a benefit to wrapping small functions into coroutines, within the scope of larger coroutines? Is there a benefit to this in signaling the event loop correctly? Does the extent of this benefit depend on whether the wrapped function is IO or CPU-bound?

Example: I have a coroutine, download(), which:

Downloads JSON-serialized bytes from an HTTP endpoint via aiohttp.
Compresses those bytes via bz2.compress() - which is not in itself awaitable
Writes the compressed bytes to S3 via aioboto3

So parts 1 & 3 use predefined coroutines from those libraries; part 2 does not, by default.

Dumbed-down example:

import bz2
import io
import aiohttp
import aioboto3

async def download(endpoint, bucket_name, key):
    async with aiohttp.ClientSession() as session:
        async with session.request("GET", endpoint, raise_for_status=True) as resp:
            raw = await resp.read()  # payload (bytes)
            # Yikes - isn't it bad to throw a synchronous call into the middle
            # of a coroutine?
            comp = bz2.compress(raw)
            async with (
                aioboto3.session.Session()
                .resource('s3')
                .Bucket(bucket_name)
            ) as bucket:
                await bucket.upload_fileobj(io.BytesIO(comp), key)

As hinted by the comment above, my understanding has always been that throwing a synchronous function like bz2.compress() into a coroutine can mess with it. (Even if bz2.compress() is probably more IO-bound than CPU-bound.)

So, is there generally any benefit to this type of boilerplate?

async def compress(*args, **kwargs):
    return bz2.compress(*args, **kwargs)

(And now comp = await compress(raw) within download().)

Wa-la, this is now an awaitable coroutine, because a sole return is valid in a native coroutine. Is there a case to be made for using this?

Per this answer, I've heard justification for randomly throwing in asyncio.sleep(0) in a similar manner - just to single back up to the event loop that the calling coroutine wants a break. Is this right?

Your question about small coroutines is interesting but maybe you will have more benefit from running synchronous function [in executor](https://docs.python.org/3/library/asyncio-eventloop.html#asyncio.loop.run_in_executor)? — sanyassh, Apr 25 '19 at 21:36
Someone can probably make an answer out of this: just putting a function in a coroutine doesn't make it asynchronous: it will still block. As @sanyash mentions, placing it in an executor will help to run it in another thread if you have something else to do in the meantime. — Max, Apr 25 '19 at 21:48

user4815162342 · Accepted Answer · 2019-04-26T12:02:50.797

So, is there generally any benefit to this type of boilerplate?

async def compress(*args, **kwargs):
    return bz2.compress(*args, **kwargs)

There is no benefit to it whatsoever. Contrary to expectations, adding an await doesn't guarantee that the control will be passed to the event loop - that will happen only if the awaited coroutine actually suspends. Since compress doesn't await anything, it will never suspend, so it's a coroutine in name only.

Note that adding await asyncio.sleep(0) in coroutines does not solve the problem; see this answer for a more detailed discussion. If you need to run a blocking function, use run_in_executor:

async def compress(*args, **kwargs):
    loop = asyncio.get_event_loop()
    return await loop.run_in_executor(None, lambda: bz2.compress(*args, **kwargs))

9000 · Answer 2 · 2019-04-26T18:08:28.073

2

Coroutines allow you to run something concurrently, not in parallel. They allow for a single-threaded cooperative multitasking. This makes sense in two cases:

You need to produce results in lockstep, like two generators would.
You want something useful be done while another coroutine is waiting for I/O.

Things like http requests or disk I/O would allow other coroutines to run while they are waiting for completion of an operation.

bz2.compress() is synchronous ~~and, I suppose, does not release GIL~~ but does release GIL while it is running. ~~This means that no meaningful work can be done while it's running.~~ That is, other coroutines would not run during its invocation, though other threads would.

If you anticipate a large amount of data to compress, so large that the overhead of running a coroutine is small in comparison, you can use bz2.BZ2Compressor and feed it with data in reasonably small blocks (like 128KB), write the result to a stream (S3 supports streaming, or you can use StringIO), and await asyncio.sleep(0) between compressing blocks to yield control.

This will allow other coroutines to also run concurrently with your compression coroutine. Possibly async S3 upload will be occurring in parallel at the socket level, too, while your coroutine would be inactive.

BTW making your compressor explicitly an async generator can be a simpler way to express the same idea.

edited Apr 26 '19 at 18:08

answered Apr 25 '19 at 21:57

9000

39,899
9
66
104

The part about feeding smaller blocks to `bz2.BZ2Compressor` makes a lot of sense. Thanks. I also think that I can (possibly) feed the `resp` itself, seeing that it is buffer-lik, to `compress()`. – Brad Solomon Apr 25 '19 at 23:42
1

bz2 does release the gil, so could be effectively used in another thread [ref: https://github.com/python/cpython/blob/0353b4eaaf451ad463ce7eb3074f6b62d332f401/Modules/_bz2module.c#L180 ] – Max Apr 26 '19 at 17:33
1

@BradSolomon: I upvoted the other answer, which is likely a better solution. – 9000 Apr 26 '19 at 18:10
@Max would you say there is still some effectiveness to chunking the compress/decompress routine? https://pastebin.com/523W9zXU – Brad Solomon Apr 27 '19 at 01:13
@BradSolomon You'd have to measure it, but assuming you're on a multi core system, and your blocks are big enough that the thread coordination is worth it, the separate thread will be faster. – Max Apr 27 '19 at 13:58

Converting small functions to coroutines

2 Answers2