Relative Content

Tag Archive for apache-sparkpysparkapache-kafkabatchsize

tune kafka high throughput

Im producing my pyspark dataframe into kafka cluster and i have been trying to optimize my time performance of the records sending . Actually my tests are done with a pyspark dataframe that has 2300 rows , each row is sent to kafka .