I am trying to join multiple kafka streams & lookups using Apache Beam. Im using side inputs for handling lookup tables and everything worked out in direct runner. But, when i try to run it in spark mode or flink mode, i learnt that side inputs are not supported. These are few links of the Jira Bugs.
http://mail-archives.apache.org/mod_mbox/beam-user/201605.mbox/%3C573EFC2F.6000708@nanthrax.net%3E https://issues.apache.org/jira/browse/FLINK-6131
Is there a way to use side input or a work around for this ?
Can i use stateful processing for this? I know the state is available per window per key,but still is there a tweak to use it in a different way ?
Can i use a caching db like memcached and fetch data while processing every record?
Any suggestions are highly appreciated.
Thanks,