Burton Smith responsible for architecture of the Tera MTA series and, much earlier, the Denelcor HEP -- both of which were ahead of their times technically but complete failures commercially. (Indeed, Tera Computer had significant financial problems and some corporate governance issues in the years leading up to the Cray purchase. I don't know the financials of Cray today, however.)
Some thoughts, in no particular order:
* The MTA and the HEP, together with Multiflow, represent the commercial roots of the multithreading (MT) work still going on in academia today. Note, however, that the "real" MT work is different by an order of magnitude from what we see in the threaded commericial chips emerging now from Intel, etc.
* The rumor as of a year or so ago was that Burton and a few of the Tera old guard had been pretty much sidelined from the larger Cray operation into unfunded R&D projects being pitched to organizations like ARPA, etc. It would be nice to believe that someone in the commercial arena is going to fund traditional MT ideals, but I'm skpetical.
* What is Microsft doing hiring him? Is this a largely PR move, to improve their HPC image? I have a hard time believing Microsoft is going to spend any money doing parallel architecture work; the list of companies that have tried and failed is long and impressive. Supercomputing today is either custom stuff, or high-end-but-nonetheless-stock hardware running Linux clusters. What's their angle?
* Back in the day, Tera had one of the hottest compilers on the planet; indeed, their compiler IP was pretty much the only valuable stuff left from the MTA project. [Ditto for Multiflow, whose compiler served as the base for Intel's compiler, way back when.] It would be interesting to see who else from the original Tera team follows him over to Redmond -- compiler folk? Architecture folk? Surely not hardware folk?
* If Microsoft wanted Burton, did Google make a play for him too? Now that would have been interesting -- one could have a fun time speculating about masive parallelism and large-grained work tasks across Google's distributed network...
[disclaimer: I briefly worked at Tera in the late 90's.]