I don't think I've ever seen a Phoronix report that showed error bars in their graphs and I've never done performance tests of the ones that they run without any jitter. They also have a history of making spectacularly bad decisions about what to benchmark. For example, they did benchmarks of FreeBSD against Ubuntu, but used a FreeBSD beta that had WITNESS and INVARIANTS options compiled into the kernel, both of which impose a fairly significant performance cost (but give better error reporting). Yesterday, they benchmarked GCC's OpenMP support against Intel's Clang-omp branch. Sounds easy, except that they didn't specify optimisation flags and GCC defaults to -O2, whereas Clang defaults to -O0. A sane evaluation would have shown the results for -O0 to -O3 for both compilers, along with error bars showing the standard deviation over ten or more runs.