What the T-series really needs is a boost to per-thread performance since it will otherwise remain a specialty processor only suitable for certain workloads.
The T2 core has more than enough parallelism for most apps out there. What isn't appreciated though is that it pushes the server *implementation details* all the way up to the app-developers, which causes them grief when they need to target different hardware or when they utilize "junior" developers. It also causes a lot longer performance tuning phases than on our previous platforms (SPARC and Intel).
This situation is fine for A-list developers but causes major grief for the multitude of companies whose developers don't have experience in massively parallelized systems. Companies in that situation unfortunately are most out there.
Our local SUN.. I mean Oracle drones always point out that our servers are able to handle so much in parallel, but that means squat if we can't meet our SLAs.
If I have a response time SLA of 1 second, then it does me little good if I can service 10 times the number of requests of a competitor's server if each request takes 3 seconds and the competitor's hardware actually allows me to reach the SLA!
Also, if there is any kind of locking going on, the server will more or less halt and whopty doo there goes the parallelism.
We won't be buying another T2 since event the PHBs can read the productivity charts and risk reports handed to them by external consultants. The cost of performance tuning of apps has climbed a lot and cost us a small fortune and continue to do so. Hardest to cope with in this space are legacy and third party apps where hardly anyone dare update the decayed code (or receive funding to do so) or in the latter case can.
However, we're seriously contemplating buying a bunch of Nehalem-EX servers and would perhaps have bought the Power7 if we were an IBM-shop since both those companies "Get It", get what customers as us need in contrast to Oracle.
Advice to Oracle: Add a bunch of cache and allow for higher clock speed in the T3 to really start competing with Intel and IBM. I don't care if you add a thousand more threads if each thread still incur a latency three to four times longer than your competition.
Finally, yes I realize I come across as a sour grape but the amount of time and cost *wasted* as a result of our PHB buying these servers (based on spec-mark figures) without contemplating the intended workload has really put a dent in my department's work atmosphere.