Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Tim Bray on Microsoft Office

Posted by CowboyNeal on Thu Oct 24, 2002 03:53 AM
from the good-things-where-you-least-expect-them dept.
jgeelan writes "The co-inventor of XML, Tim Bray, has been talking about the newly XML-enabled version of Microsoft Office, code-named 'Office 11' and tells XML-Journal that 'when the huge universe of MS Office documents becomes available for processing by any programmer with a Perl script and a bit of intelligence, all sorts of wonderful new things can be invented that you and I can't imagine.'"
This discussion has been archived. No new comments can be posted.
Tim Bray on Microsoft Office | Log In/Create an Account | Top | 589 comments (Spill at 50!) | Index Only | Search Discussion
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
(1) | 2
  • Yay Evil Monopoly Of Doom! (Score:3, Interesting)

    by Sneftel (15416) on Thursday October 24 2002, @03:57AM (#4520211)
    Wow, I was way off when I predicted that Microsoft would further obfuscate their Word format. This seems to be in all respects a Good Thing.

    StarOffice has used XML for their native file formats for some time now; I wonder if this means we'll see an even better-quality translator between the two formats?
  • However... (Score:4, Insightful)

    by kubrick (27291) on Thursday October 24 2002, @03:57AM (#4520217)
    Knowing Microsoft, they don't like to give up that format lock-in. They'll find some way to make MSXML difficult or impossible to access, assuming that they haven't already.... encrypted data or something like that.

    • An eXaMpLe of MSWord XML... (Score:5, Funny)

      by FyRE666 (263011) on Thursday October 24 2002, @04:33AM (#4520353) Homepage
      <?xml version="1.0"?>
      <Data>
      MSWORD$$g$%jk$%sxx"d$%^$% ^($%^m$k^a$%j^$%B$JJ"£"$M £K
      </Data>
      [ Parent ]
    • Re:However... (Score:5, Funny)

      by PhotoGuy (189467) on Thursday October 24 2002, @05:44AM (#4520532) Homepage
      They'll find some way to make MSXML difficult or impossible to access, assuming that they haven't already.... encrypted data or something like that.

      What?!?!?!? You mean they'll try and pass something off as a a "security feature," when it's really intended to protect them?

      Nah, that's not really their style.

      [ Parent ]
    • Read the article? (Score:5, Informative)

      by gazbo (517111) on Thursday October 24 2002, @05:44AM (#4520533)
      This bloke said he has had extensive access to the alphas and betas. He also said how great it was.

      So unless your mind has been slashdotted to the extent that you think that Microsoft is going to suddenly change the file-format completely between beta and release, then we know that it is perfectly easy to read.

      And if you do believe they will change the format, then you are a moron.

      [ Parent ]
      • Re:Read the article? by Neil Watson (Score:2) Thursday October 24 2002, @08:20AM
      • Re:Read the article? by InternalWave (Score:1) Thursday October 24 2002, @03:09PM
      • Re:Read the article? by kubrick (Score:1) Thursday October 24 2002, @08:43PM
      • Re:MOD PARENT UP by a_n_d_e_r_s (Score:3) Thursday October 24 2002, @06:25AM
        • Re:MOD PARENT UP (Score:5, Interesting)

          by danheskett (178529) <danheskett.gmail@com> on Thursday October 24 2002, @07:41AM (#4521042)
          They already done that with other products and wrote it in thier halloween-document. Where do you have any indication that they should change stratagy now ?

          Because they have competition now, and thats makes people do funny things.

          Did you ever think that maybe all thing things MS has done in the last 24+ months that show an increasingly high level of support for interoperability and standards compliance is more than just show?

          Did it ever occur to you that maybe, just maybe, that the head people at MS read the halloween documents and discarded them as garbage?

          The format is XML, its going to be open, and its going to be great. I am willing to make a cash bet on it. Yeah, sure, some data inside it will probably be a binary representation of an object - maybe graphs, charts links to external programs, images, etc. If you want to take that bet, e-mail me.

          Most people here on slashdot know that even with a level playing field of open file formats that Office will *still* dominate the marketplace for Office software.
          [ Parent ]
        • Re:MOD PARENT UP by dbrutus (Score:2) Thursday October 24 2002, @08:54AM
        • Re:MOD PARENT UP by Milican (Score:3) Thursday October 24 2002, @09:06AM
        • Re:MOD PARENT UP by Planesdragon (Score:2) Thursday October 24 2002, @10:27AM
        • You can't put a EULA somone else's document by bay43270 (Score:2) Thursday October 24 2002, @11:03AM
      • Re:MOD PARENT UP by greenrd (Score:2) Thursday October 24 2002, @06:25AM
        • Re:MOD PARENT UP (Score:5, Insightful)

          by gazbo (517111) on Thursday October 24 2002, @06:37AM (#4520676)
          Slight difference this time. Now, we actually have a beta release, seen by independent parties (such as Tim Bray) who have praised the file format. As I said, unless you seriously think they will completely change the file format between beta and release...

          Now, whether they can license the format so as to make it illegal for other apps to use it, I don't know. However, I suspect this is not the case as it more or less removes the advantage to having invested in XML in the first place. Well, sure there's good publicity, but how long would that last when people immediately discover it is worthless?

          And of course, the vast majority of people don't care about file formats. The only people to whom this news is of interest are those who will want to either access Office docs themselves, or use other apps (e.g. Open Office) to view Office docs. If this sector are banned from doing this, why did MS spend so much money on using XML in the first place?

          [ Parent ]
      • Re:MOD PARENT UP by FortKnox (Score:2) Thursday October 24 2002, @08:45AM
      • 2 replies beneath your current threshold.
    • I expect them to use legal means by Greyfox (Score:3) Thursday October 24 2002, @07:10AM
    • format lock in rebuttal by kbs (Score:1) Thursday October 24 2002, @08:30AM
    • Viruses? by Capt Dan (Score:2) Thursday October 24 2002, @08:46AM
    • What really is going to happen by spitzak (Score:3) Thursday October 24 2002, @12:04PM
    • Re:However... by iabervon (Score:3) Thursday October 24 2002, @01:42PM
    • Re:However... by tshak (Score:2) Thursday October 24 2002, @01:43PM
      • Re:However... by kubrick (Score:1) Thursday October 24 2002, @08:30PM
    • Re:However... by tabby (Score:1) Thursday October 24 2002, @09:15PM
      • Re:However... by kubrick (Score:1) Thursday October 24 2002, @10:49PM
        • Re:However... by tabby (Score:1) Thursday October 24 2002, @11:43PM
    • 2 replies beneath your current threshold.
  • Incompatibilities Once Again (Score:3, Insightful)

    by robbyjo (315601) on Thursday October 24 2002, @03:57AM (#4520218) Homepage Journal

    .... I guess it's just MSXML rather than THE standard XML. But we can figure it out with some "intelligent guesswork" now because the file would be human-readable.

    • Re:Incompatibilities Once Again (Score:5, Insightful)

      by JaredOfEuropa (526365) on Thursday October 24 2002, @04:18AM (#4520292) Journal
      It's just like the old SGML module for Word they used to have about 6 years ago. My guess is that there will be some significant drawback to saving documents in XML, such as loss of some formatting information. That would convince users not to save in the XML format... but that isn't the important thing to Microsoft.

      More significantly, there might be small incompatibilities, or ways that Word-created XML documents divert slightly from what is normal and proper in XML. Perhaps Word will make some (intentional) mistakes when reading back XML files generated in other applications, just like Word's old SGML module would choke on many proper SGML documents.

      Make no mistake: the fact that almost everybody is using Office and the associated file formats makes it very hard for a new contender to enter the office suite market. Microsoft must be aware of the power they have over the market with their Office file formats. Think of it: when you exchange files with other businesses, you have two realistic choices of file formats: Office or plaintext. And now Microsoft is introducing compatibility with an open and well-defined markup langauge, in favour of their proprietary language? I'll believe it when I see it.
      [ Parent ]
    • MS-standards (oxymoron?) by billybob2001 (Score:2) Thursday October 24 2002, @06:05AM
    • Re:Incompatibilities Once Again by surprise_audit (Score:1) Thursday October 24 2002, @01:20PM
  • wicked :) by oo7tushar (Score:2) Thursday October 24 2002, @03:58AM
    • Re:wicked :) by Mnemia (Score:3) Thursday October 24 2002, @04:40AM
  • Too good to be true by trustno_one (Score:1) Thursday October 24 2002, @03:59AM
    • Re:Too good to be true (Score:5, Insightful)

      by Masa (74401) on Thursday October 24 2002, @04:14AM (#4520277) Journal

      Because it doesn't matter if everyone is able to read, modify and generate Office-compatible files. People will us Office products in future. Opening the file formats doesn't change anything.

      XML makes it easy to create programs that will depend on MS Office. So this only makes it easier to create programs which depend on Microsoft products.

      [ Parent ]
      • Re:Too good to be true (Score:5, Insightful)

        by bokmann (323771) on Thursday October 24 2002, @05:28AM (#4520494) Homepage
        Except I will look to xml.openoffice.org to write some xslt transformations to take Microsoft office documents and liberate them once and for all.

        Once I can move my team of 20 people to open office with no real worries or complaints about 'interchanging' files with lusers still using Microsoft, I will.

        BUT, have you ever looked at an HTML file generated by Microsoft word? It is a GREAT example of how they can pollute a standard into something unreadable.

        I suspect that they will copyright or otherwise lock up their DTD/Schema, and try to lash out at anyone that uses them in other than 'approved' ways.

        [ Parent ]
      • Re:Too good to be true -- WRONG by Anonymous Coward (Score:1) Thursday October 24 2002, @06:11AM
      • Re:Too good to be true by RPoet (Score:1) Thursday October 24 2002, @06:59AM
      • Re:Too good to be true by SurfTheWorld (Score:1) Thursday October 24 2002, @09:25AM
    • Re:Too good to be true (Score:5, Interesting)

      by MrHanky (141717) on Thursday October 24 2002, @04:21AM (#4520303) Homepage Journal
      Maybe they need a migration path away from the win32-based format they use now. .NET also seems to follow that path. Remember that MS needs access to other platforms than the i386/desktop in the future - mobile devices for instance. Keeping a format that is basically a binary image from a PC is good for locking out competition, but not when you have to start competing with yourself.
      [ Parent ]
    • Re:Too good to be true by Petronius (Score:1) Thursday October 24 2002, @07:30AM
      • 1 reply beneath your current threshold.
    • 1 reply beneath your current threshold.
  • by leandrod (17766) <`leandro' `at' `dutra.fastmail.fm'> on Thursday October 24 2002, @03:59AM (#4520225) Homepage Journal

    The most important question, besides if the MS Word XML format will be well-documented enough, is if it will be the default saving format. Most MS Office users simply don't care enough to save MS Word documents in RTF, for example, even if it's more than good enough for the vast majority of the documents.

    Not the main issue on the article, but it is unfair to single someone as the inventor of XML, which is just a streamlined version of SGML which is an evolution from IBM's GML.

  • I doubt it. (Score:3, Insightful)

    by theLOUDroom (556455) on Thursday October 24 2002, @04:00AM (#4520226)
    I really have my doubts about wether Microsoft will allow "any programmer with a Perl script and a bit of intelligence" to muck around with Office documents.
    I'm guessing their XML document format will be just as hard to decyper and the current office formats.
    • Re:I doubt it. (Score:5, Insightful)

      by sql*kitten (1359) on Thursday October 24 2002, @04:59AM (#4520425)
      I really have my doubts about wether Microsoft will allow "any programmer with a Perl script and a bit of intelligence" to muck around with Office documents.

      Why not? After all, the high-quality ActiveState port of Perl to Win32 exists because Microsoft paid for it, and you can download it for free. Not only that, but if you want to write your own code to manipulate Office documents, you have been able to do that for years in VBA - all the Office programs expose rich APIs. In fact, they are composed of Objects that you can instantiate and use in your own programs if you want - all MS care about is that there is a licensed copy of Office on the user's machine. One of the easiest ways to do charting is to simply reuse a bit of Excel, for example. From there it's a short hop via COM to any program you want.

      I'm guessing their XML document format will be just as hard to decyper and the current office formats.

      The fact that Office documents have been in a proprietary format in the past is actually unimportant, since the interfaces to the applications (and hence their documents) are well documented (check MSDN or Barnes & Noble if you don't believe me). So the reason that Microsoft are doing this is that they lose nothing and gain from making the platform even more attractive to developers.
      [ Parent ]
      • Re:I doubt it. by jkramar (Score:2) Thursday October 24 2002, @05:14AM
      • Re:I doubt it. by Penguin (Score:2) Thursday October 24 2002, @05:39AM
      • Re:I doubt it. by Anonymous Coward (Score:2) Thursday October 24 2002, @05:50AM
      • Re:I doubt it. (Score:5, Insightful)

        by khuber (5664) on Thursday October 24 2002, @07:05AM (#4520791)
        The fact that Office documents have been in a proprietary format in the past is actually unimportant, since the interfaces to the applications (and hence their documents) are well documented

        So you can read Office documents with other programs as long as you have Office and MS dev tools?

        You do see the folly in that, right?

        -Kevin

        [ Parent ]
        • Re:I doubt it. by buzzcutbuddha (Score:2) Thursday October 24 2002, @09:06AM
          • Re:I doubt it. by khuber (Score:1) Thursday October 24 2002, @04:09PM
            • Re:I doubt it. by buzzcutbuddha (Score:1) Monday October 28 2002, @11:49AM
      • Re:I doubt it. by Hooya (Score:2) Thursday October 24 2002, @08:11AM
      • Re:I doubt it. by Avumede (Score:2) Thursday October 24 2002, @09:14AM
        • Re:I doubt it. by sql*kitten (Score:2) Thursday October 24 2002, @09:41AM
        • 1 reply beneath your current threshold.
      • 1 reply beneath your current threshold.
    • Re:I doubt it. by spongman (Score:3) Thursday October 24 2002, @05:08AM
      • Re:I doubt it. by greenrd (Score:2) Thursday October 24 2002, @06:31AM
      • 1 reply beneath your current threshold.
    • Re:I doubt it. (Score:5, Interesting)

      by ianezz (31449) on Thursday October 24 2002, @05:19AM (#4520471) Homepage
      I'm guessing their XML document format will be just as hard to decyper and the current office formats.

      There are 2 problems with the current format of Microsoft Office file:

      1. Give the correct interpretation to the bytes representing the document content, in order to import the Office document in some other office suite using a different representation.
        This is mostly solved (thanks to years of trials and errors).
      2. Give the correct interpretation to the bytes representing the document itself AND all the extra cruft having nothing to do with the document contents that the Microsoft Office suite puts in, in order to generate documents readable by the various versions of the Office suite.
        This is definitively more difficult, as nobody knows Office internals and how they expect such additional data to be. StarOffice guys managed to make an acceptable job, at the price of years of trials and errors. It's like watching at a dump of your computer's memory, guesssing what's code, what's data, what's padding and the meaning of every byte...

      Now, do an XML format simplifies things? Well, yes, just as an RTF text is easier to manage than a pure binary format, but nothing prevents putting extra cruft in an XML document, so it's just that instead of having to use a hex editor, you now may use a text editor, but giving a correct interpretation of tags and attributes is something that only Microsoft can do, unless it publishes the full specifications (present and future: after all, XML is eXtendible, right?)

      Personally, I think that:

      • Microsoft is realizing that the current Office formats are getting out of control, so it wants to get rid of them, because mantaining backwards compatibility is becoming too much painful.
      • An XML-based format may be the right answer for Microsoft, in that all the subtles of parsing binary data simply disappear, while it may still make difficult to everyone else to understand what's the real meaning of data. Let's say <obscuretag_42 foobarizer="xyzzy"/>
      • Microsoft was not giving out the specifications of the formats of its Office suite before: should we now suppose it's giving out the DTD/Schema AND a good explanation of how to interpret it? I'd hope the answer is yes, but giving the company's precedents...
      [ Parent ]
      • Re:I doubt it. by Bartmoss (Score:3) Thursday October 24 2002, @05:37AM
      • Re:I doubt it. by commodoresloat (Score:2) Thursday October 24 2002, @06:09AM
        • Re:I doubt it. by Reckless Visionary (Score:2) Thursday October 24 2002, @10:53AM
      • Re:I doubt it. by ipjohnson (Score:1) Thursday October 24 2002, @06:50AM
      • a middle road? by Simon (Score:1) Thursday October 24 2002, @07:01AM
      • Re:I doubt it. by tshak (Score:2) Thursday October 24 2002, @02:06PM
        • Re:I doubt it. by theLOUDroom (Score:1) Thursday October 24 2002, @02:58PM
      • 1 reply beneath your current threshold.
    • Re:I doubt it. by tburkhol (Score:1) Thursday October 24 2002, @08:21AM
    • Perl Marco Fun by jiminim (Score:1) Thursday October 24 2002, @08:27AM
  • Historical turningpoint? (Score:5, Interesting)

    by haeger (85819) on Thursday October 24 2002, @04:00AM (#4520229)
    I just thought about someone saying that somewere, when you look back in history, you can see some historical turningpoint where tings just went wrong or right.

    One small such point is when IBM gave out the specs to their hardware for PC allowing everyone to clone it, while Apple did not.

    This could be such a point. Maybe in 10 years we'll look back at this and ask ourselves "Why the heck did MS XML-enable their Office app, releasing the hold that they had"

    Only time will tell I guess.

    .haeger


    I Play Hattrick [hattrick.org]

  • Codename? by furchin (Score:2) Thursday October 24 2002, @04:01AM
    • Re:Codename? by Dausha (Score:3) Thursday October 24 2002, @04:05AM
    • Re:Codename? by Cainam (Score:1) Thursday October 24 2002, @04:30AM
      • Re:Codename? by spongman (Score:3) Thursday October 24 2002, @05:11AM
    • Re:Codename? by misterhaan (Score:1) Thursday October 24 2002, @08:10AM
    • 1 reply beneath your current threshold.
  • *when* ? (Score:4, Funny)

    by Monty Worm (7264) on Thursday October 24 2002, @04:01AM (#4520233) Journal
    when the huge universe of MS Office documents becomes available for processing by any programmer

    I beg you pardon? Smelly programmers can keep their hands off my documents. If I wanted you to have them, I'd have emailed them to you as plaintext. I wasn't aware the the Office license meant my documents were common property....

    • Re:*when* ? by SmlFreshwaterBuffalo (Score:1) Thursday October 24 2002, @04:20AM
    • 2 replies beneath your current threshold.
  • The right time for MS (Score:5, Insightful)

    by terminal.dk (102718) on Thursday October 24 2002, @04:03AM (#4520238) Homepage
    MS is trying to time this right.

    Right now they are seeing diminishing sales, possible shrinking market share. Most of the danish public sector is looking to save money using OpenOffice/StarOffice.

    MS needs to increase their compatibility with other options, as they would otherwise force customers to convert every single user away from MS at once, instead of OpenOffice coming in slowly.

    They can also hope, that their format is setting the standard, and the other companies will have to play catch-up rather than the other way around.
  • imagination (Score:5, Funny)

    by selderrr (523988) on Thursday October 24 2002, @04:05AM (#4520245) Journal
    ...all sorts of wonderful new things can be invented that you and I can't imagine...

    When will MS ever learn that we don't WANT to imagine how wonderfull the MS Office Universe is ?
    • Re:imagination by dbrutus (Score:2) Thursday October 24 2002, @11:38AM
    • 1 reply beneath your current threshold.
  • read through "EULA" in the XML? by GnomeKing (Score:1) Thursday October 24 2002, @04:05AM
  • WTF???? (Score:3, Informative)

    by jericho4.0 (565125) on Thursday October 24 2002, @04:06AM (#4520251)
    from the article;
    The most important question, besides if the MS Word XML format will be well-documented enough,...

    WTF!? XML shouldn't need to be documented. The whole point is to create a human readable file that is parseble by computer. If MS Word delivers an XML file that I can't figure out, it's not XML.

    • Re:WTF???? (Score:4, Insightful)

      by lovebyte (81275) <lovebyte2000@NOSPAM.gmail.com> on Thursday October 24 2002, @04:10AM (#4520262) Homepage
      Have you ever seen some complex XML file? Without documentation it could be as difficult as binary to reverse-engineer!
      [ Parent ]
      • Re:WTF???? by jericho4.0 (Score:2) Thursday October 24 2002, @04:30AM
        • Re:WTF???? by lovebyte (Score:1) Thursday October 24 2002, @05:00AM
        • Re:WTF???? (Score:5, Insightful)

          by vidarh (309115) <vidar@hokstad.com> on Thursday October 24 2002, @05:25AM (#4520483) Homepage Journal
          The point of XML is not for it to be human readable, but to allow easy automatic processing of various kinds.

          With XML Schema and DTD's, you can validate various aspects of the data without writing a custome validator.

          With XPath and XPointer you can refer to parts of an XML document without needing to understand what the document contains.

          With XSL you can translate all or parts of the document from one format to the other without your application needing to know the structure, and without needing to understand more of the format than the parts you are extracting.

          With SAX and the DOM you can programmatically traverse and extract information from an XML file without having to write a custom parser.

          With CSS an editor or viewer for instance can use a standard mechanism of applying styles to elements without hardcoding the style attributes for elements anywhere.

          With XML namespaces, you can intersperse data in various formats in the same file, and the components handling each of the vocabularies need not know anything about the other components - an example would be embedding SVG in HTML: The HTML renderer doesn't need to understand any of the SVG tags, only that it should delegate contents with other namespaces to another component. And the SVG renderer couldn't care less about the HTML.

          And this doesn't even touch on the benefits of all the various interchange formats that have been specified on top of these base technologies.

          The importance of XML is that it opens up the doors for building interchangable components that operate on data without needing any hardcoded application specific knowledge of the data.

          Most of the time, you still have to write some code to tie it all together, but you don't have to build your own parsers, your own document object model, your own styling system, your own way of handling contained data of other types, your own way of transforming data between formats, etc.

          For me as a software developer XML delivered years ago. I use XML technologies daily, and it saves me work.

          [ Parent ]
      • Re:WTF???? by WhaDaYaKnow (Score:2) Thursday October 24 2002, @04:33AM
        • Re:WTF???? by SlamMan (Score:2) Thursday October 24 2002, @05:51AM
    • Re:WTF???? by Anonymous Coward (Score:3) Thursday October 24 2002, @04:28AM
      • Re:WTF???? by richie2000 (Score:2) Thursday October 24 2002, @04:51AM
      • Re:WTF???? by spongman (Score:2) Thursday October 24 2002, @05:13AM
      • Re:WTF???? by Nailer (Score:2) Friday October 25 2002, @03:52AM
      • 1 reply beneath your current threshold.
    • Re:WTF???? by anshil (Score:2) Thursday October 24 2002, @04:31AM
    • Syntax vs. Semantics (Score:5, Insightful)

      by mindriot (96208) on Thursday October 24 2002, @04:31AM (#4520343)
      Yes, the point of XML files is that their _syntax_ is simple and easily parseable by computers. But that doesn't tell you anything about the _semantics_ of a document. And as long as there is no proper documentation on what the mess of tags in your XML file means, there's hardly any way for you to hack together a Perl script to, say, extract plain text, or convert the Word XML file to an OpenOffice.org XML file, or whatever else comes to mind.
      [ Parent ]
    • Re:WTF???? by Krach42 (Score:1) Thursday October 24 2002, @04:44AM
      • Re:WTF???? by anshil (Score:2) Thursday October 24 2002, @04:54AM
        • Re:WTF???? by Krach42 (Score:1) Thursday October 24 2002, @05:19AM
          • Re:WTF???? by anshil (Score:1) Thursday October 24 2002, @05:54AM
          • 1 reply beneath your current threshold.
      • Re:WTF???? (Score:4, Insightful)

        by DGolden (17848) on Thursday October 24 2002, @04:59AM (#4520424) Homepage Journal
        Here's my pet rant:

        I would say that XML isn't a markup language - a markup language would allow the "bad nesting", since a markup language should be "layers of virtual highlighter pen" applied to an underlying data stream. XML, since it requires "proper nesting", is just Lisp sexps reimplemented, but with terrible syntax. It's yet-another-tree-structured-data-format. Big Wow. A true markup language environment would facilitate part-structured data, like HTML used to be, rather than shoehorning everything into trees.

        Lisp sexps would just say (stuff (things "text"))

        In fact, that's pretty much all there is to lisp syntax right there. The above is (a) a potentially valid lisp program and (b) a valid lisp data structure.

        XML is a data format designed mainly to allow C and Java programmers to use vaguely Lisp-like processing techniques without realising it and/or admitting it to themselves.

        [ Parent ]
        • Re:WTF???? by Krach42 (Score:1) Thursday October 24 2002, @05:11AM
        • Re:WTF???? by statusbar (Score:2) Thursday October 24 2002, @07:13PM
        • 1 reply beneath your current threshold.
      • Re:WTF???? by spongman (Score:2) Thursday October 24 2002, @05:27AM
    • Re:WTF???? by ianezz (Score:3) Thursday October 24 2002, @05:43AM
    • 1 reply beneath your current threshold.
  • What is the format? by pubjames (Score:2) Thursday October 24 2002, @04:08AM
  • interesting! by garglblaster (Score:1) Thursday October 24 2002, @04:09AM
  • by Zeddicus_Z (214454) <Zeddicus_Z@@@hotmail...com> on Thursday October 24 2002, @04:10AM (#4520263) Homepage
    As far as I can tell, one of the major reasons many businesses refuse to change over from Microsoft Office to cheaper options is due to file compatability. As our company's IT admin put it recently on the suggestion of using OpenOffice, "I get sent hundreds of Microsoft Word, Excel and Access documents a week. I need to know that I can open and access every single one of those without problems". An example of proprietry file formats helping Microsoft keep the monopoly.

    However, if Microsoft Office documents become "built around an open, internationalized standard", i.e. XML, would this not enable the people behind OpenOffice, StarOffice etc to acheive total 100% file compatability and thus negate Microsoft's largest advantage with Office?

    Of course, this could be yet another Microsoft "embrace and extend" tactic, a la` kerberos. Incorporate the standard in a bastardised form, claim standards compatability, then pollute it so you must be using Microsoft technology to properly interact with it.

  • HTML from Word (Score:5, Interesting)

    by e8johan (605347) on Thursday October 24 2002, @04:11AM (#4520266) Homepage Journal
    Just look at an HTML file exported form Word2k. I would not call that compatible with any HTML I've ever learned. Most probably the XML file exported from Office 11 will be a Microsoft specific file, specifying lots of Office specific ActiveX (aka OLE) info that cannot be emulated. And, hey, they can probably store binary data in XML. The only change is that most competing products will emit files that Word can easily read, i.e. M$ will get the biggest benefits.
    • Re:HTML from Word (Score:5, Insightful)

      by pubjames (468013) on Thursday October 24 2002, @04:51AM (#4520403)
      Just look at an HTML file exported form Word2k.

      An excellent point sir. That's a great illustration of how Microsoft approaches 'open' file formats.

      People that think that MS Office is going to move to open, well documented file formats are just plain nuts. But look at many of the comments in this forum - it seems MS has even managed to persuade many Slashdotters that they are going to use open formats. Poor fools.
      [ Parent ]
    • Re:HTML from Word (Score:5, Informative)

      by superyooser (100462) on Thursday October 24 2002, @05:24AM (#4520481) Homepage Journal
      True. Just a couple days ago, I saved a doc as Web Page in Word (Office XP) hoping that some clipart would be saved in a web-friendly format. (This was originally made in Publisher, NOT by me.) It didn't work; it saved the images as .wmz! For the web?!

      Anyway, there was tons of gibberish in the file, but it displayed fine in IE6. It was a completely blank page in Mozilla! Nothing at all! We always knew the XP didn't stand for cross-platform, but I didn't know it was this bad.

      [ Parent ]
      • 1 reply beneath your current threshold.
    • Re:HTML from Word by jonbrewer (Score:2) Thursday October 24 2002, @07:50AM
    • Re:HTML from Word by Anonymous Custard (Score:1) Thursday October 24 2002, @08:05AM
    • Re:HTML from Word by tshak (Score:2) Thursday October 24 2002, @01:50PM
  • MS Office and ML's by FireMotion (Score:1) Thursday October 24 2002, @04:13AM
  • Palladium by tsa (Score:1) Thursday October 24 2002, @04:13AM
  • Typical XML-proponent mistake (Score:5, Insightful)

    by Baki (72515) on Thursday October 24 2002, @04:14AM (#4520276)
    Just because the file format, instead of binary, is "human readable", does not make it more open.

    For "any programmer with a Perl script and a bit of intelligence" it doesn't make a difference if you read bytes (binary) or XML structures.

    As long as you don't get a DTD with extensive comments on how to interpret the elements, along with some promise/guarantee that the DTD won't change every minor release, there is no real improvement at all.

    The fact that XML is human readable is irrelevant, since no human shall read the files, but programs such as perl scripts shall. For them it makes hardly any difference; it is only marginally easier since you can use an existent XML parser instead of rolling your own (which is no big deal using the right tools such as YACC).

    This 'openness' comes at a good time for Microsoft. They suggest openness in a time that they are criticized and attacked because of file-format lock in. Many 'advisors' shall be mislead, blinded by buzzwords such as XML as they are, and actually believe that this solves the issue.
    • Re:Typical XML-proponent mistake (Score:5, Interesting)

      by smallpaul (65919) <paul&prescod,net> on Thursday October 24 2002, @04:36AM (#4520365)

      As long as you don't get a DTD with extensive comments on how to interpret the elements, along with some promise/guarantee that the DTD won't change every minor release, there is no real improvement at all.

      Have you ever tried to reverse engineer a binary file format? And have you ever tried to do the same thing with an XML file format? I learned huge chunks SVG yesterday _without_ opening an SVG book, just by mucking around in an existing SVG file and with an SVG viewer. Of course, Microsoft could do something clearly in violation of the spirit of XML, by making the whole thing one tag full of base64ed text or something. But as long as they use tags in a semi-sane way (which is the whole point, for integration with corporate systems), XML will be a big step forward.

      [ Parent ]
    • Re:Typical XML-proponent mistake by TechnoVooDooDaddy (Score:1) Thursday October 24 2002, @07:44AM
    • 1 reply beneath your current threshold.
  • The world will be a better place... by slantyyz (Score:1) Thursday October 24 2002, @04:15AM
  • Proprietary file formats... by lanalyst (Score:2) Thursday October 24 2002, @04:16AM
  • Dogbert the Evil software consultant by choka (Score:1) Thursday October 24 2002, @04:17AM
  • To me it looks like classic MS embrace and extend! by patrixx (Score:2) Thursday October 24 2002, @04:17AM
  • Stalling tactics?... (Score:5, Insightful)

    by pubjames (468013) on Thursday October 24 2002, @04:22AM (#4520306)
    Perhaps these announcements of XML compatible office file formats are just stalling tactics? MS has done it before.

    MS now has a serious competitor in StarOffice/OpenOffice.org. And that competitor has two compelling advantages - it's cheaper/free, and open XML file formats. So when clued-up IT people say to their Pointy-Haired Bosses that they should use StarOffice/OpenOffice.org, PHBs can respond "but MS is doing that next year. We can avoid all the disruption of changing office suites just by waiting a bit and upgrading to the next version of MS Office. Besides, we're already paying for it." Then when MS actually releases Office 11, they will have used all sorts of devious and subtle devices to keep their lock-in of the file format, and MS and PHBs will be happy.
  • Well Excel in Perl is pretty easy now (Score:5, Informative)

    by twoshortplanks (124523) on Thursday October 24 2002, @04:22AM (#4520309) Homepage
    I've used the excel reading and writing modules for Perl with great success. They're easy to use and do the job. (there are also simpler [cpan.org] interfaces [cpan.org] if you want them too.)

    Or you could go the whole hog and use a SAX writer like XML::SAXDriver::Excel [cpan.org] to create the documents from XML yourself.

    (This is not to say I don't think XML native formats arn't cool and will have many uses, I'm just pointing out what you can do now.)

  • Been there.... done that... by Kindaian (Score:1) Thursday October 24 2002, @04:23AM
    • 1 reply beneath your current threshold.
  • Too good to be true? by varslot (Score:2) Thursday October 24 2002, @04:24AM
    • 1 reply beneath your current threshold.
  • What I heard.... (Score:3, Interesting)

    by LarsBT (580206) on Thursday October 24 2002, @04:27AM (#4520322)
    I can't remember the reference, but I heard that they will embed binary code for different word-objects within XML tags e.g.

    <equation> 0100100100111101010011010101101010010 </equaition>
    which is allowed in XML (if I understand XML correctly). So not much gain if everything is still in propriety closed binary format.

    I think maybe it was the CEO of Microsoft Denmark. I'm NOT sure though

  • Skeptical by PizzaFace (Score:2) Thursday October 24 2002, @04:27AM
  • by Bazman (4849) on Thursday October 24 2002, @04:29AM (#4520333) Journal
    <?xml version="1.0"?>
    <uueWord2kDocument>
    M"@D)("!'3E4 @3$E"4D%262!'14Y%4D%,(%!50DQ)0R!,24-%3 E-%"@D)("`@
    M("`@(%9E7)I9VAT("A#*2`Q.3DQ
    M($9R96 4@4V]F='=A6]N92!I2!A;F0@9&ES=')I8G5T
    M92!V97)B871 I;2!C;W!I97,*(&]F('1H:7,@;&EC96YS92!D; V-U;65N="P@
    </uueWord2kDocument>

  • XML Based? by hpavc (Score:1) Thursday October 24 2002, @04:30AM
  • Other MS products using XML by Anonymous Coward (Score:2) Thursday October 24 2002, @04:34AM
  • C'mon People (Score:3, Insightful)

    by BurritoWarrior (90481) on Thursday October 24 2002, @04:34AM (#4520358)
    Office's MS-XML will be even less compatible with sthe spec than MS-Kerberos or MS-Java/J++. Office is their cash cow. It brings in 30-40% of their revenues all by itself.

    If you think there is even a remote chance in he-double L that MS will loosen their grip on this revenue stream, I have a bridge to sell you.

    You can call this flamebait if you want, but what in MS's history would lead me to believe they are suddenly going to change their historic behavior pattern AND risk a huge amount of revenue at the same time?
    • 1 reply beneath your current threshold.
  • I can see it now... (Score:3, Funny)

    by dimator (71399) on Thursday October 24 2002, @04:35AM (#4520362) Homepage Journal
    <?xml version="1.0"?>
    <document type="word">
    <![CDATA[
    @%MYD<V@Q4VEA8^`!AX0>DN6UIJE=^1J;1F\ @! (P@@<$Y(@OL%AS`0B=$<S*
    4&A399HT2*S-@*+U&1)+KCS>J4 HJTZ=^F534G%_S8\6=YS7?#59_.U!YI[_^
    AU$`HOG^5/N3A9 9'<\V/YP`(T*'MZ6)3UVSCDYF&+B;0H?7I3'O7'(2/H(Z>
    U= ;N1:`!4*"4U/ATNK5GOO+^B\O?/\QK^3KE>KVYL"PN-3O2'/^9 3/U)I.PP
    FXG3%*.RR)0.R'/&N!?>U'*;4FK6B,U:B<4@-6O% 1D!!%Z/31&E(R*MCU,HH
    15RT`H`P2H$,O5FB!R,`"*`!J5FJ -4@TNEB5)E:'"D;AO4.?>-Z1FGVQN"3O
    VN6RANM76P&((F=# 3GYM05%C;E%C1F[MQ>P:*".O*3VW,<-9`T:D.^O2BE@*
    4N25 U@$0X#X!(B8*+H-1(3'!Y9'%ZF1B%7P9E#"^90&U72560M1E`R F$1;4$
    :%/(I$JY3"67*"&E5,4&X.2>R]!F@"#7VLH>;5`>@( "`!IX4A`FK)LG*7O%D
    P^$G)10Y"^L:FO_^\,GTP-"V:_R/GL %-,**[?^UIWRK2YT.;70-KW8.LG;)[
    ]]>
    </document>
  • by pvera (250260) <pedro.vera@gmail.com> on Thursday October 24 2002, @04:39AM (#4520372) Homepage Journal
    SQL Server has had an XML web gateway since version 2000. You can run any query and output it as xml or have an xml template pull the query and transform the results with XSL, all without one line of server side script.

    ASP.net uses XML for all the human-readable files, and the IIS in windows.net server finally uses Apache-style configuration files which are also XML.
  • Yeah, right (Score:3, Insightful)

    by Alex Belits (437) on Thursday October 24 2002, @04:41AM (#4520377) Homepage
    XML is a format with nearly infinite possibilities for obfuscation, convolutedness and poorly defined standards. The most we can expect is the possibility to validate a file to absolutely certainly determine if it is compliant with the new Word format or not.
    • 1 reply beneath your current threshold.
  • M$ only cooks with water too. (Score:3, Insightful)

    by Qbertino (265505) on Thursday October 24 2002, @04:45AM (#4520392)
    I'm working with that weedy Word 2k at the office. And we use Outlook as a standard communication Platform. Believe me, that their Software often is such a pain isn't that much of a greater plan to rule the world, but more the flat-out ineptitude of delivering products with a conceptual consitency.
    Looking at Frontpain and Word HTML and extrapolating XML from that, tells me they're gonna do just a crappy job as usual and really think they've done a great thing.
    Just like the people sending me source code additions and DB content as Wordfiles. Nothing but simple inemptitude, I say.
    Not that my System of choice, Linux, is that much more consistent. Mind you. With a bazillion Font methods, every single one of them looking crappier than the next and QT, GTK+, Motif, Lesstif, Inbetweentif, Swing, TK and whatnot and none of them following the same Clipboard behaviour it's just as weedy. Only it is under *my* control to change it.
    That way, the bottom line is: With OSS if it doesn't work, there's another way. With M$ it's 'Game Over' with the first "Error in module [fill in random hexcode here]".
    That's the simple difference.
  • what a cool codename (Score:4, Funny)

    by Anonymous Coward on Thursday October 24 2002, @05:01AM (#4520431)
    code-named 'Office 11'

    awesome. Apparently the next version of the linux kernel is code named 2.6! Wow!
  • How to convert Word to XML (Score:5, Informative)

    by Korth (50341) on Thursday October 24 2002, @05:05AM (#4520433)
    I've recently been reviewing a dozen of different software to convert from Word to XML.

    So far the best tool I found is upCast (free for personal use) from http://www.infinity-loop.de/ .

    To convert a Word file:
    * Use Word's AutoFormat feature to convert visual formatting to Word styles
    * Redefine all the text as Word styles
    * Run upCast to convert to XML using the "XML (content, no DTD)" filter
    * Run HTML Tidy from http://tidy.sourceforge.net/ with the parameters -xml -utf8 -clean -bare .

    Other tools that might be worth a second look:
    * Majix (Open Source) - http://www.tetrasix.com/
    * WorX SE - http://www.xyvision.com/
    * XML MarkupKit (in German) - http://www.eds.schema.de/download/MarkupKit/
    * DocSoft LLC Word-to-XML - http://www.docsoft.com/w2xml.htm
  • Hype! Hype! Hype! (Score:5, Interesting)

    by RobotWisdom (25776) on Thursday October 24 2002, @05:05AM (#4520435) Homepage
    This article is pure PR, with no new content. The XML-cult will keep waving their hands and promising great payoffs 'RSN' (real soon now) until people actually start trying to implement uniform semantic tags in their data and documents... at which point universal disillusionment will set in because the problem is way too hard even for trained AI-PhDs. [more] [robotwisdom.com]

    The thread a couple of weeks ago about the death of META headers will apply 1000 times worse for semantic tags-- if the semantic web is going to work at all it needs to start from headers describing the webpage as a whole.

    (Also, what's with XML-Journal's claim the article has three pages when it only has two?)

  • Why do people get so excited about XML? by Viol8 (Score:2) Thursday October 24 2002, @05:05AM
  • Bigger picture (Score:3, Insightful)

    by Cheese Cracker (615402) on Thursday October 24 2002, @05:05AM (#4520438)
    Look at the bigger picture of where Microsoft is heading. They're diversifying their line of business.
    In the past, MS Office was the cash cow at Microsoft, but the market for office packages is rather
    saturated... companies and governments are looking for cheaper alternatives etc. Not much room to
    grow. Now they can afford playing the good guys by opening up their file formats, since they got
    new markets to capture... mobile phones, handheld computers, home entertainment etc.
  • Information wants to be free by djupedal (Score:1) Thursday October 24 2002, @05:06AM
  • What we need is a ISO standard (Score:5, Interesting)

    by javilon (99157) on Thursday October 24 2002, @05:08AM (#4520448) Homepage
    The open office group should get together with the rest of the guys (abyword, koffice and maybe wordperfect) and work out a format that can be submitted to the ISO. Possibly based on the open office format.
    Then goverments and corporation will adopt it for official documents so they can read their own documents in ten years.
  • XML co-inventor- pleeease by Anonymous Coward (Score:1) Thursday October 24 2002, @05:12AM
    • 1 reply beneath your current threshold.
  • by cwernli (18353) on Thursday October 24 2002, @05:21AM (#4520476) Homepage

    any programmer with a Perl script and a bit of intelligence

    and I thought intelligence was a prerequisite to be able to handle perl ? :)

  • Incompatible XML? by Dexter77 (Score:2) Thursday October 24 2002, @05:26AM
  • Is he crazy? by Lumpy (Score:2) Thursday October 24 2002, @05:33AM
  • Political XML by Wheelie_boy (Score:2) Thursday October 24 2002, @05:38AM
  • by frleong (241095) on Thursday October 24 2002, @05:46AM (#4520535)
    Here at MSDN [microsoft.com]

    It is simply not what others is claiming: <?xml version="1.0"><data>blahblah</data>

  • Imagining Wonderful Things by superyooser (Score:1) Thursday October 24 2002, @05:56AM
  • Here is my idea of what they are trying to do by rikkards (Score:1) Thursday October 24 2002, @06:11AM
  • What are you all complaining about? (Score:5, Insightful)

    by nmg196 (184961) on Thursday October 24 2002, @06:50AM (#4520720)
    Microsoft is switching from a proprietary file format, to XML, and the first 100 comments are all flaming MS. WTF does it take to make you people happy?

    They've already shown with .NET that they can make an entire programming framework (and at least 3 assocated languages) into an open standard and even have them ratified by the ECMA and maybe even ISO. Because of this people have already managed to port Perl, Python and many other languages to this framework before it even came out of beta! The guys at Ximian [ximian.com] have even managed to port quite a bit of the framework itself as part of the Mono Project [go-mono.com].

    So perhaps instead of perpetually slating Microsoft, you could get off your arse and do something useful instead.

    Nick...
  • Well, that's "Embrace" taken care of... by Queuetue (Score:2) Thursday October 24 2002, @07:17AM
  • perversion of standards by crm114 (Score:1) Thursday October 24 2002, @07:22AM
  • arg! by photon317 (Score:1) Thursday October 24 2002, @07:26AM
    • Re:arg! by photon317 (Score:1) Thursday October 24 2002, @07:36AM
  • Word 11 by jasonditz (Score:1) Thursday October 24 2002, @07:42AM
  • Export to HTML... by Anonymous Coward (Score:2) Thursday October 24 2002, @07:46AM
  • by g4dget (579145) on Thursday October 24 2002, @07:48AM (#4521103)
    The most wonderful thing that would happen would be that people can finally dump that messy piece of software and move to a better toolset.

    Unfortunately, Microsoft won't let it happen. The data may be "in XML", but that doesn't mean you can read it or generate it well. Instead, Microsoft will give you just enough to serve their business interests and nobody else's.

    How? Office will probably stick undocumented base64 encoded binary stuff into the output, containing formatting information. You can use the document content, for example, with a database, but you can't load it into another word processor and preserve all the formatting. And in the other direction, sure, you can generate simple documents that Office will import, but you can't generate arbitrary Word documents--they will, again, have weird, undocumented tags and binary stuff.

    In short: don't hold your breath. Microsoft isn't stupid.

  • XML file on Windows Messenger by Elementalor (Score:1) Thursday October 24 2002, @08:13AM
  • Late to the party, but... by mmcshane (Score:1) Thursday October 24 2002, @08:15AM
  • XML not transparent by Anonymous Coward (Score:1) Thursday October 24 2002, @08:23AM
  • Hello World.MSXML by Tsali (Score:1) Thursday October 24 2002, @08:47AM
  • XML = gigabyte documents by edxwelch (Score:1) Thursday October 24 2002, @08:54AM
  • Bye bye excel by plopez (Score:2) Thursday October 24 2002, @08:55AM
  • OpenOffice is XML, now! (Score:5, Informative)

    by magi (91730) on Thursday October 24 2002, @08:59AM (#4521661) Homepage Journal
    Doing XML stuff with OpenOffice is supergreat. It took me half-an-hour to study the format enough to write a XSLT parser that extracts all strings from an OO document.

    Now I wrote, just for demonstration, the following XSLT example in just a few minutes, useable directly with xsltproc in Linux.

    The example prints all the Heading paragraphs in a OO Writer document, indented according to the header level.

    <?xml version='1.0'?>
    <xsl:stylesheet
    xmlns:xsl="http: //www.w3.org/1999/XSL/Transform"
    xmlns:office="ht tp://openoffice.org/2000/office"
    xmlns:style="htt p://openoffice.org/2000/style"
    xmlns:text="http:/ /openoffice.org/2000/text"
    xmlns:table="http://op enoffice.org/2000/table"
    xmlns:draw="http://openo ffice.org/2000/drawing"
    xmlns:fo="http://www.w3.o rg/1999/XSL/Format"
    xmlns:xlink="http://www.w3.or g/1999/xlink"
    xmlns:number="http://openoffice.org /2000/datastyle "
    xmlns:svg="http://www.w3.org/2000/svg"
    xmlns:c hart="http://openoffice.org/2000/chart"
    xmlns:dr3 d="http://openoffice.org/2000/dr3d"
    xmlns:math="h ttp://www.w3.org/1998/Math/MathML"
    xmlns:form="ht tp://openoffice.org/2000/form"
    xmlns:script="http ://openoffice.org/2000/script"
    version='1.0'>

    <xsl:output method="text" encoding="ISO-8859-1"/>

    <!-- Print all headings, indented. -->
    <xsl:template match="text:h">
    <xsl:value-of select="substring(' ', 1, (@text:level - 1) * 2)"/>
    <xsl:text>* </xsl:text>
    <xsl:value-of select="text()"/>
    <xsl:text>&#xa;</xsl:text>
    </xsl:template>

    <!-- Don't output any other text. -->
    <xsl:template match="text()">
    </xsl:template>
    </xsl:stylesheet>

    The result would be something like:

    * Top-level heading such as a chapter
    * Second-level heading (section)
    * Another section
    * Subsection
    * Subsubsection
    * Yet another section
  • Microsoft invented open formats! by cyba (Score:1) Thursday October 24 2002, @09:00AM
  • I'm not so pessimistic by phasm42 (Score:1) Thursday October 24 2002, @09:10AM
  • office HTML by avandesande (Score:2) Thursday October 24 2002, @09:14AM
  • Hmmm.... by BuffJoe (Score:2) Thursday October 24 2002, @09:19AM
  • Microsoft's intentions by affenmann (Score:2) Thursday October 24 2002, @09:21AM
  • Tim here with a bit more background (Score:5, Informative)

    by tbray (95102) on Thursday October 24 2002, @09:22AM (#4521858) Homepage Journal
    I've seen the native Word XML format (alpha mind you, so it might get changed). It isn't exactly pretty, and if I had to write code to extract all the paragraphs that contained the word "foo" in bold it would give me a bit of a headache, but I could do it.

    The word "foo" in bold single-underline looks something like

    <r>
    <rf>
    <rp class="bold" />
    <rp class="underline" lines="1" />
    </rf>
    foo</r>

    Yeah, it's pretty verbose.

    Near as I can tell, it is 100% round-trip-able, i.e. you save as that file format, you read it in again, you hit ctl-S and it saves again; about as good as a native format. Now someone needs to write some script-ware to run Word in batch mode to xml-ify server directories with zillions of office docsl

    I think the reason MS is doing this is obvious. Look at their financials - they *really* need people to upgrade to the new version of Office. End-users don't buy Office any more, CIOs and the like do. These people are just not gonna be impressed by another new word-processing feature, but they might be motivated to upgrade if they thought that they were opening up all their data to re-use by other programs.

    I expect that with any luck we'll get a secondary industry built around doing cool unexpected stuff to Office docs. Don't want to sound over-excited here, but a huge amount of all the intellectual capital in the world is sitting around in Office docs, and this makes it noticeably more re-usable. Has to be a good thing.

    Cheers, Tim
  • Adopt and extend by drxenos (Score:1) Thursday October 24 2002, @09:36AM
  • "Codename Office 11" by verloren (Score:1) Thursday October 24 2002, @09:45AM
  • Structure vs Presentation? (Score:3, Insightful)

    by 4of12 (97621) on Thursday October 24 2002, @09:47AM (#4522069) Homepage Journal

    MS Office saving its data in XML format is a great start.

    But will this really be enough?

    Previous complaints about how versions of Office didn't disclose the format were often referred to a specification that Microsoft made available to describe what was in a Word document.

    The key problem, IIRC, was the the description was not sufficient for one to predict how the Word document was actually formatted and rendered on the page.

    Because XML is very much like SGML or TeX, it has the potential for much more exhaustively describing document structure. But whether the new Word XML format (or OpenOffice format, for that matter) contains sufficient information for developers to reproduce the "right" format is a different issue.

    I hope I'm wrong and that the format is specified comparably to the level you'd find in say PostScript or PDF.

    Maybe MS is willing to let rendered Office douments change, just as HTML rendered documents change whenever one resizes the browser window.

    But I doubt it.

  • M$ XML by thesfinx (Score:1) Thursday October 24 2002, @09:57AM
    • Re:M$ XML by JDBrechtel (Score:1) Thursday October 24 2002, @10:42AM
  • I need to drink my coffee before reading slashdot by Servo (Score:1) Thursday October 24 2002, @09:59AM
  • Wonderful new things? by SnarfQuest (Score:1) Thursday October 24 2002, @10:01AM
  • XML had an inventor? by blair1q (Score:2) Thursday October 24 2002, @10:37AM
  • Wow! by grub (Score:2) Thursday October 24 2002, @10:45AM
  • Scripts, not GUI by captaineo (Score:2) Thursday October 24 2002, @10:49AM
  • 4 Words by M$ Mole (Score:1) Thursday October 24 2002, @10:51AM
  • Totally meaningless by ChaosDiscord (Score:2) Thursday October 24 2002, @11:17AM
  • MS should be more careful... by jonadab (Score:1) Thursday October 24 2002, @11:56AM
  • by bobaferret (513897) on Thursday October 24 2002, @11:59AM (#4523249)
    I think the reason that they are switching over is probably due to the trend in emerging foriegn markets. Peru being a prime example. Countries are starting to enact legislation that requires any government procurments of software to only be for software that uses an open file format. Due to the long term storage problems.
    This tied to the fact that US sales are going to slow down or are already, due to the complete inundation of PC, they need new markets, and unless they use an open format they won't be able to get them. I'd be panicked Linux and Java eroding their server market. Governments are eroding their Office market. They only way they can grow is add value.

  • Wow. by mindstrm (Score:1) Thursday October 24 2002, @12:01PM
  • Genuine XML? (Score:4, Interesting)

    by J. Random Software (11097) <ems@jrandom.com> on Thursday October 24 2002, @12:05PM (#4523305) Homepage
    Good in theory, but HTML support in Office 2000 was such a debacle that there are third-party tools designed just to unmangle the markup. They compltely ignored Processing Instruction syntax, which is intended to do just what they wanted, and
    <![if !supportEmpty Paras]>
    wasn't even well-formed SGML.
  • Great and wonderous things, indeed. by Eneff (Score:1) Thursday October 24 2002, @05:16PM
  • Microsoft reasons for doing this by kune (Score:2) Thursday October 24 2002, @05:32PM
  • A new "use" for XSLT by Brent_DS (Score:1) Thursday October 24 2002, @05:36PM
  • How come slashdot comments are duplicated in link? by crush (Score:2) Thursday October 24 2002, @05:53PM
  • Why Microsoft will do the right thing with XML by marhar (Score:2) Thursday October 24 2002, @06:16PM
  • Why is MS doing this...featuresets (Score:3, Insightful)

    by InnovATIONS (588225) on Thursday October 24 2002, @07:25PM (#4526346)
    By taking the initiative in this MS can create an XML schema that neatly includes ALL of the featureset and terminology of MS Word/Excel/etc.

    Which then by virtue of market share becomes standard. It is actually in their best interest to publish it clearly. Then the other potential competitors will feel strong pressure to fit their software to match MS and have no real excuse why they can't. If MS waited there would be some other standard emerging and MS would be pressured by customers to adopt it. Then it would be MS having to shoehorn its document logic into some other form and not the other way around.

    While other potential competitors are playing catch-up with making their documents fit into the MS schema MS can be busy thinking about the next thing to do.

    So frankly I expect the word document xml (and excel and the rest) to actually be quite clear and documented but very aligned to how MS Word sees a document, which will likely impress others as obtuse.

  • Yeah. Right. by rice_burners_suck (Score:2) Thursday October 24 2002, @09:10PM
  • Dreamy by abdulla (Score:1) Thursday October 24 2002, @10:08PM
  • Microsoft documents well-formed?? by BeforeCoffee (Score:1) Thursday October 24 2002, @10:56PM
  • Re:What?! by sirius_bbr (Score:1) Thursday October 24 2002, @04:14AM
  • Re:Whats wrong with html/css2 ? (Score:3, Informative)

    by Jugalator (259273) on Thursday October 24 2002, @05:34AM (#4520503) Journal
    Whats wrong with HTML and CSS2 for all your word processing?

    I don't think the new XML format is meant for documents you wish to publish on the web. Office already support the HTML format pretty well (with some extensions.. ahem) since Office 2000. HTML support works even better in Office XP since it allow you to save the document as "filtered HTML", where Office filters most of the Office-specific tags and attributes at the cost of loosing some information in the document.

    I think the XML format is being added since XML represent the document with a much more meaningful structure that's easier to parse by third party software for use in electronic commerce and other automated systems, something that's inappropriate to use HTML code for, as it was designed to make pretty layouts, not to describe content for easy parsing.

    I think it's pretty obvious why MS would want to add XML support - to spread their Office document format and make Office useful in places such as web services where it wouldn't be as useful before.
    [ Parent ]
  • Re:Whats wrong with html/css2 ? by captainclever (Score:1) Thursday October 24 2002, @07:03AM
  • Re:umm by Arimus (Score:2) Thursday October 24 2002, @07:34AM
  • 20 replies beneath your current threshold.
(1) | 2