How to run a ML inference process on linux in real time Need to run a ML inference process on linux in real time to reduce latency.