I have a desktop application where the majority of calculations (>90%) happen on the Rust side of it. But I want the user to be able to write scripts in Python that will operate on the df.
Can this be done without serializing the dataframe between runtimes to a file?
A simple invocation could be this:
Rust: agg -> Rust: calculate new column -> Python: groupby -> Rust: count results
The serializing approach works for small datasets. It doesn't really scale to larger ones. The optimal solution would somehow be to be able to tell the python side: Here is a lazy dataframe in-memory. You can manipulate it
.
I've read the documentation and the only solution I could see is to use Apache IPC.