Querying the maximum number of concurrent kernel launches in SYCL
In my application I perform the same computation on batches of problems, I do however require some intermediate data to be allocated for these computations and therefore I’ve resorted to function objects which allocate memory based on specific meta data of my batches. (Rather than static memory inside functions or the memory-handle idiom employed by other APIs)