I have an AWS glue job which writes data to a hudi table. It is possible the job might have up to 5 concurrent runs. Sometimes some of the runs fail with –
Exception in User Class: org.apache.hudi.exception.HoodieUpsertException: Failed to upsert for commit time 123
Caused by: org.apache.hudi.exception.HoodieRollbackException: Failed to rollback commits
After getting this error, no subsequent writes succeeds.
After going through the concurrency control documentation of hudi, I want to know if there is a way to handle this situation without getting into external lock providers? Maybe some internal retries can resolve this?
There is no overlap in the data being written by concurrent runs.