diff --git a/sources/tech/20171004 Concurrent Servers Part 2 - Threads.md b/sources/tech/20171004 Concurrent Servers Part 2 - Threads.md index 8ac1e9f490..e24ef5e8dd 100644 --- a/sources/tech/20171004 Concurrent Servers Part 2 - Threads.md +++ b/sources/tech/20171004 Concurrent Servers Part 2 - Threads.md @@ -138,6 +138,8 @@ The idea of a [thread pool][15] is simple, yet powerful. The server creates a Here's a diagram showing a pool of 4 threads, each processing a task. Tasks (client connections in our case) are waiting until one of the threads in the pool is ready to accept new tasks. +![](https://raw.githubusercontent.com/LCTT/wiki-images/master/TranslateProject/ref_img/006.png) + It should be fairly obvious that the thread pool approach provides a rate-limiting mechanism in its very definition. We can decide ahead of time how many threads we want our server to have. Then, this is the maximal number of clients processed concurrently - the rest are waiting until one of the threads becomes free. If we have 8 threads in the pool, 8 is the maximal number of concurrent clients the server handles - even if thousands are attempting to connect simultaneously. How do we decide how many threads should be in the pool? By a careful analysis of the problem domain, benchmarking, experimentation and also by the HW we have. If we have a single-core cloud instance that's one answer, if we have a 100-core dual socket server available, the answer is different. Picking the thread pool size can also be done dynamically at runtime based on load - I'll touch upon this topic in future posts in this series.