Relative Content

Tag Archive for amazon-s3cassandradatastax-enterprisecassandra-3.0dse

Export data from large Cassandra table

I have a Cassandra table(with historic data) which is around 5TB or more. To optimize infra cost, I need to offload old data to S3. I am looking at dsbulk unload which is optimized for export but unsure if it will handle such large volume. There is another option to write custom application which queries data older than 3 years and creates CSV/Parquet and uploads to S3. Existing data model requires billions of queries in this case.