pmap level of parallelism inconsistent for chunked vs non-chunked input

Question

pmap level of parallelism inconsistent for chunked vs non-chunked input

8 Answers

jira · Answer 1 · 2011-10-26T00:51:55+0000

Comment made by: stu

Next person to take a deep look at pmap should probably also think through fork/join.

jira · Answer 2 · 2012-05-19T06:31:24+0000

Comment made by: jim.blomo

fork/join is a Java 7 feature. Would a proposed patch need to be able to fall back to Java 5 features?

jira · Answer 3 · 2012-05-19T07:06:38+0000

Comment made by: jafingerhut

Clojure/core folks can say more authoritatively, but I believe with the recent reduce enhancements that rely on jsr166 code, Clojure 1.5 will most likely require Java 6 or later, and Java 5 will no longer be supported.

jira · Answer 4 · 2012-05-28T23:29:01+0000

Comment made by: jim.blomo

Spinning up more threads than CPU cores is not a good idea when the work is CPU bound. Currently (1.4) pmap uses an unbounded thread pool, so chunked sequences will create more threads than intended. The least invasive change is to use a fixed sized thread pool (ForkJoinPool being one example). pmap is differentiated from core.reducers by the fact that it is lazy. This implies a one-at-a-time ThreadPool.submit model, instead of the recursive fork/join model. Tradeoffs include:

Enforce look-ahead even on chunked sequences:
- + no threadPool changes
- - working against chunking, which is presumably being used for a reason

Move to a fixed size thread pool:
- + reduce contention for cpu-bound functions on chunked sequences
- - increase total realization time for io-bound functions

Use ForkJoinPool for fixed thread pool (instead of newFixedThreadPool):
- + automatic and dynamic parallelism
- - more complex setup (picking Java 6 vs 7 implementation, sharing pool with core.reducers)

I think using a traditional fixed size thread pool is the right option. Most of the time all of pmap's results will be realized, so I don't think it's worth saving work by being strict about the look-ahead size. This is also the decision map has made. Since we're not using ForkJoin's main strength (recursive work queuing), I don't think it is worth setting it up in clojure.core.

I'll use Agent/pooledExecutor as the fixed size thread.

Let me know if I forgot or misunderstood anything.

jira · Answer 5 · 2012-05-29T01:59:02+0000

Comment made by: jim.blomo

2012-05-28 pmap-chunking-862.diff uses a fixed size thread pool for pmap functions.

jira · Answer 6 · 2014-01-11T20:42:57+0000

Comment made by: jafingerhut

Patch pmap-chunking-862.diff dated May 28, 2012 no longer applies cleanly after latest commits to Clojure master on Jan 11, 2014. I think the only issue is that the tests added have changed context lines, so should be a quick fix if someone wants to update the patch.

jira · Answer 7 · 2014-01-17T01:48:22+0000

Comment made by: jim.blomo

Thanks for the update, Andy. I'll take a crack at it this month.

jira · Answer 8 · 2019-06-26T12:00:00+0000

Reference: https://clojure.atlassian.net/browse/CLJ-862 (reported by llasram)

pmap level of parallelism inconsistent for chunked vs non-chunked input

Please log in or register to add a comment.

Please log in or register to answer this question.

8 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Categories

pmap level of parallelism inconsistent for chunked vs non-chunked input

Please log in or register to add a comment.

Please log in or register to answer this question.

8 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Related questions

Categories