According to brisk implementation [ Presentation in Cassandra SF ] - Cassandra, CFS, Job/Task Tracker and Hive Metastore run in a single JVM which is totally different from Configuring an independent hadoop cluster.
Is this an advantage?
What happens if Task Tracker or any of the individual process in the JVM fails? Will that affect the cassandra instance in the same JVM?
How does CFS get data from? Is it storing the SSTables as sub blocks or a copy of it? Where is that compression of sub blocks done?
Regards, Tamil