Thiết kế website giá rẻ

Question

EDIT: Having figured it out (see my own answer), I get that the question text below is misleading, which caused a lot of misunderstanding. I still believe it might help somebody who has the same issue.

Justification

I am asking this question (at least in its essence) now the third time, since the first and second attempt drifted off into discussions about details I did not intend.

I am nonetheless convinced that an answer to the actual core question would be a valuable contribution to people (like myself) who are not experts in multi-thread programming, since it is currently not easily found on the internet and poses a valid problem.

Question

Say we have a function fct to be run in parallel using threads. Now, it is OS-dependent if a thread can be spawned at a given moment, depending on the available RAM and other things. If it’s not possible, and we still naively try to do so, the program will crash. So, how can we reliably decide if a thread can be spawned, and otherwise decide against it, such that the program doesn’t crash uncontrollably?

(As others pointed out, the possible number of concurrently running threads is “usually” “very high”. I have two different Windows machines, though, one with 48GB of RAM (Windows 10; VS 2019), the other with 64GB of RAM (Windows 11; VS 2022) for both of which this number is somewhere around <20 for the below (extremely simple!) example. For the actual, more complex code I try to improve, I am stuck with only 5 threads. The point is not to improve this number or find out why my machines are shitty. I rather want my code to run crash-free on any OS it encounters.)

To have a ground truth, consider this MWE:

<code>#include <iostream>

#include <thread>

#include <vector>

#include <string>

constexpr int NUM_JOBS{ 100 }; // Number of jobs to run.

constexpr int NUM_THREADS{ 15 }; // Number of threads run at most in parallel. Setting this to 20 crashes on my machine.

void fct(const std::string& str) // Some function to run in parallel.

{

std::cout << "Processing job: " + str + "n";

}

int main(int argc, char* argv[])

{

std::vector<std::string> job_descriptions{}; // Different "configs" to call fct.

std::vector<std::thread> threads{};

int numThreads{ 0 };

for (int i = 0; i < NUM_JOBS; i++) { // Create some "configs".

job_descriptions.push_back("J" + std::to_string(i));

}

for (const auto& job : job_descriptions) { // Run the jobs, in parallel if possible.

bool spawned{ false };

while (!spawned) {

if (numThreads < NUM_THREADS) { // Spawn new threads only if not more than NUM_THREADS are running simultaneously.

threads.emplace_back(std::thread(fct, job));

spawned = true;

numThreads++;

}

else if (!threads.empty()) { // Otherwise wait for the topmost thread to finish before spawning new ones.

threads.front().join();

threads.erase(threads.begin());

numThreads--;

}

for (auto& t : threads) t.join(); // Join the remaining threads in the end.

return 0;

}

</code>

<code>#include <iostream> #include <thread> #include <vector> #include <string> constexpr int NUM_JOBS{ 100 }; // Number of jobs to run. constexpr int NUM_THREADS{ 15 }; // Number of threads run at most in parallel. Setting this to 20 crashes on my machine. void fct(const std::string& str) // Some function to run in parallel. { std::cout << "Processing job: " + str + "n"; } int main(int argc, char* argv[]) { std::vector<std::string> job_descriptions{}; // Different "configs" to call fct. std::vector<std::thread> threads{}; int numThreads{ 0 }; for (int i = 0; i < NUM_JOBS; i++) { // Create some "configs". job_descriptions.push_back("J" + std::to_string(i)); } for (const auto& job : job_descriptions) { // Run the jobs, in parallel if possible. bool spawned{ false }; while (!spawned) { if (numThreads < NUM_THREADS) { // Spawn new threads only if not more than NUM_THREADS are running simultaneously. threads.emplace_back(std::thread(fct, job)); spawned = true; numThreads++; } else if (!threads.empty()) { // Otherwise wait for the topmost thread to finish before spawning new ones. threads.front().join(); threads.erase(threads.begin()); numThreads--; } } } for (auto& t : threads) t.join(); // Join the remaining threads in the end. return 0; } </code>

#include <iostream>
#include <thread>
#include <vector>
#include <string>

constexpr int NUM_JOBS{ 100 };   // Number of jobs to run.
constexpr int NUM_THREADS{ 15 }; // Number of threads run at most in parallel. Setting this to 20 crashes on my machine.

void fct(const std::string& str) // Some function to run in parallel.
{
   std::cout << "Processing job: " + str + "n";
}

int main(int argc, char* argv[])
{
   std::vector<std::string> job_descriptions{}; // Different "configs" to call fct.
   std::vector<std::thread> threads{};
   int numThreads{ 0 };

   for (int i = 0; i < NUM_JOBS; i++) { // Create some "configs".
      job_descriptions.push_back("J" + std::to_string(i));
   }
 
   for (const auto& job : job_descriptions) { // Run the jobs, in parallel if possible.
      bool spawned{ false };

      while (!spawned) {
         if (numThreads < NUM_THREADS) { // Spawn new threads only if not more than NUM_THREADS are running simultaneously.
            threads.emplace_back(std::thread(fct, job));
            spawned = true;
            numThreads++;
         }
         else if (!threads.empty()) { // Otherwise wait for the topmost thread to finish before spawning new ones.
            threads.front().join();
            threads.erase(threads.begin());
            numThreads--;
         }
      }
   }

   for (auto& t : threads) t.join(); // Join the remaining threads in the end.

   return 0;
}

The function fct to run in parallel is very simple, it only prints one line to cout. Using some constant NUM_THREADS, we can control how many jobs are run in parallel at most. Setting this number too high yields on my Windows machines:

<code>Exception thrown at 0x768BA892 in ***.exe: Microsoft C++ exception: std::system_error at memory location 0x06DCFA68.

</code>

<code>Exception thrown at 0x768BA892 in ***.exe: Microsoft C++ exception: std::system_error at memory location 0x06DCFA68. </code>

Exception thrown at 0x768BA892 in ***.exe: Microsoft C++ exception: std::system_error at memory location 0x06DCFA68.

(On Linux, I can set it very high, but that’s, as said, not the point.)

The question is basically, can we replace the condition numThreads < NUM_THREADS with something to the effect of: “If thread can be spawned”?

Thiết kế website giá rẻ

Danh mục

Reliably checking if a thread can be created [duplicate]

Justification

Question