populating 40 million records from parquet in S3 to oracle tables using JDBC connection via EMR cluster(code written in pyspark)
We have 40 million data in S3 and while inserting data from S3 to oracle table the sessions are stuck and showing library cache: mutex X wait events.
What all enhancements can be done to improve performance.