I'm new to building pipelines, and I was using the twitter api to stream data. My current approach is as follows:
- I send the streamed data to a Topic on pub/sub, which triggers the cloud function to insert the message into BigQuery after processing.
- The other approach recommended was using pub/sub + DataFlow, but I'm not sure how that makes a difference.
Can someone elaborate on the pros and cons of the two approaches ?