How to Avoid Duplicated Records in PyFlink When Joining Kafka Stream with Static Data?
I’m working with PyFlink and have encountered an issue where my final output contains duplicated records despite my SQL queries being tested and working correctly in SQL. My setup includes a Kafka consumer that receives CalledNumber, and I am trying to join this stream with a static dataset read from a CSV file.
Here is my setup: