Comment Re:10 fold speed improvement - The Phd was idiot (Score 1) 280
There is another assumption that may not hold (but usually does).
The code assumes that a writes are atomic. This will almost always hold for 8 processes and usually form 32, but if the flags array is larger than a word, atomic writes go out the window.
The code assumes that a writes are atomic. This will almost always hold for 8 processes and usually form 32, but if the flags array is larger than a word, atomic writes go out the window.