My time series data TTLs
after 1-7 days (depends on the use case).
The data is immutable and ordered by timestamp (cluster by timestamp
) - data is timestamped "on-write" (so new data timestamps should always be progressive)
The partition size should not exceed 10K items - usually much less ( and at most ~10MB
for a full 10k items).
I didn't find any good documentation on how the compaction strategy should be configured (what parameters to take into account) so I just decided to do it like this:
compaction = {'class': 'TimeWindowCompactionStrategy', 'compaction_window_size': '7', 'compaction_window_unit': 'DAYS'}
Definitely not sure that this is correct
What are the KPI
I should be taking into account?