Folks,
Recently I was reading some of the blogs NOSQL column oriented storage. I am trying my hands on CASSANDRA and HBASE.
What I understood is data is stored in column oriented manner.
e.g. Employee Id , Employee Name, Last Name
100 , 'abc', 'xyz'
200 , 'ABC' , 'XYZ'
Then data will be stored in the following format on the disk (column oriented storage single column together)
First column Second column Third Column
100|200 , 'abc'|'ABC' , 'xyz'|'XYZ'
1 ) I was wondering if we have to retrive single raw with id = 100 how it is done ? Since data is not continuous it will be costly ? (Is there any index with raw key for all columns)
2 ) Why HBASE cassandra is not having proper aggregation function support as Column oriented storage is meant for that ?