2

I trying to develop a working example of Snowplow click tracking. I have to setup enrichment process to enrich raw data on Kinesis stream. But, when I am running JAR file, I am getting this error:

ERROR com.amazonaws.services.kinesis.leases.impl.LeaseManager - Failed to get table status for SnowplowEnrich-${enrich.streams.in.raw}

Is DynamoDB a necessity for enrichment process?

halfer
  • 19,824
  • 17
  • 99
  • 186
Prakhar Mishra
  • 1,586
  • 4
  • 28
  • 52
  • Did you find any solution for this question ? – abhijeet Feb 13 '18 at 07:25
  • 1
    Yes. Actually, I found out that underlying KCL library uses DynamoDB to maintain application state, like no. of records processed and all. So, if you are using Kinesis, then you have to use KCL, which in turn uses DynamoDB. There is no escaping it. – Prakhar Mishra Feb 14 '18 at 07:29
  • Cool. I found that too. Thanks for reply. – abhijeet Feb 14 '18 at 07:55
  • Exactly the question I am asking. Seems like we have to use DynamoDB no matter what to use Stream enrich. – addicted Sep 20 '19 at 02:10

1 Answers1

2

It depends, in batch mode DynamoDB is not necessary for enrichment process, DynamoDB is used in the RDB Shredder.

Which release are (were) you trying to install. For a PoC you can use Snowplow Mini

Snowplow community is active in discourse.snowplowanalytics.com

VCeron
  • 91
  • 1
  • 7
  • I think he's asking for Stream enrichment mode. For batch mode, we can skip even skip DB or use postgre/redshift – addicted Sep 20 '19 at 06:25