Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×

Microsoft Claims OpenDocument is Too Slow 553

SirClicksalot writes "Microsoft claims that the OpenDocument Format (ODF) is too slow for easy use. They cite a study carried out by ZDNet.com that compared OpenOffice.org 2.0 with the XML formats in Microsoft Office 2003. This comes after the international standards body ISO approved ODF earlier this month." From the ZDNet article: "'The use of OpenDocument documents is slower to the point of not really being satisfactory,' Alan Yates, the general manager of Microsoft's information worker strategy, told ZDNet UK on Wednesday. 'The Open XML format is designed for performance. XML is fundamentally slower than binary formats so we have made sure that customers won't notice a big difference in performance.'"
This discussion has been archived. No new comments can be posted.

Microsoft Claims OpenDocument is Too Slow

Comments Filter:
  • by NeoTron ( 6020 ) <kevin.scarygliders@net> on Friday May 26, 2006 @09:20AM (#15409073) Homepage
    I for one care not, especially since the study has been done by none other than the ever so neutral ZDNet. I mean, they're not Microsoft biased one little bit.

    *cough*

  • by ronanbear ( 924575 ) on Friday May 26, 2006 @09:24AM (#15409101)
    Article is pure FUD. They run a series of articles attacking OOo OOo is much slower than Office at the moment. It has nothing to do with ODF versus Open XML. In any case the comparisons with Office's current implementation of XML and ODF (under OOo) were much closer.

    When I used OOo I didn't think it was fast but it was nowhere near as slow or as much of a memory hog as this test found.

  • by albalbo ( 33890 ) on Friday May 26, 2006 @09:27AM (#15409124) Homepage
    It's actually likely they're slightly faster for spreadsheets. For example:

      * they use single-letter tag names, for the most part, to reduce parsing time
      * they remove all strings and put them in a look-up table

    I'm not sure how much difference these things actually make in practice, but there's probably a little speed there.

    What's not fair is to compare OOo to Microsoft Office, and determine the speed of OpenDocument versus OXML based on that...
  • by ceeam ( 39911 ) on Friday May 26, 2006 @09:34AM (#15409174)
    ODT format is basically a set of XML files packed into a ZIP archive. One of them is "the biggie" (content.xml) and others are for supporting it. Images etc are saved/packed in subdirs. Now - to open it, OOo apps should unpack the whole package and parse XML keeping all its contents in memory (presumably, but highly likely). Maybe not a big deal if all you handle is two page memo but keep in mind that OOo's spreadsheet and database(!) programs work the same way. And for something like 20-30 page specs sheet on a sub-1GHz machine OOo works noticeably faster when handling DOC format documents than handling its "native" ODT documents. Saving/autosaving can be a pain too (as you should dump all you document to XML and pack it. Unlike MSOffice where storage formats work as database).

    All in all - OOo's file formats are a nice and simple solution for exchanging reasonably sized documents (if you don't mind usual XML-namespace-hell structure) but for editing/working on larger documents/spreadsheets you may find yourself using MSOffice document formats (from within OOo). Pity they don't provide their own "scratch-pad/database-in-a-file" formats.

    So - for once, Microsoft is kinda right here.
  • Re:False dichotomies (Score:4, Informative)

    by LiquidCoooled ( 634315 ) on Friday May 26, 2006 @09:42AM (#15409235) Homepage Journal
    Actually the problem is not binary versus none binary, its fixed length versus variable length fields and records.
    With old style formats, you knew that the header was 512bytes followed by 600 bytes of meta data, followed by the document sections which all indicate their size (or have some way of calculating it based upon the block type)

    With XML, you get a tag opening and have to parse until the closure, this adds a lot to the complexity of reading.

    Writing is slightly different, and should infact be simpler with XML even though it may be more verbose, you don't need to buffer the entire block or rewrite the section header to indicate the length, you just happily do a sequential write.
  • by 'nother poster ( 700681 ) on Friday May 26, 2006 @09:44AM (#15409252)
    Ok, maybe I'm missing it. What does the OP comment have to do with the fox and the grapes? He is not lamenting or deriding something he can not have.
  • Re:False dichotomies (Score:5, Informative)

    by nstlgc ( 945418 ) on Friday May 26, 2006 @09:48AM (#15409280)
    +5 Insightful? Oh PLEASE!

    ODT XML files are binary files. So are old Word 2003 .doc files. So are Microsoft's new XML files. So it's pointless to claim that a "binary" file format is faster than an XML file format.
    When people say "binary files" they mean this as opposed to "text files", a seperation that stems from the ability to open a file for in "binary" or "textfile" modus in several APIs. Has to do with, amongst others, interpretation of control codes such as ^Z.

    The other big mistake: file formats aren't fast or slow. The algorithms for reading and writing them are (or aren't) slow.
    *slaps cheek* NO WAI!

    You fail to see the point of what they're saying. They're saying a binary file, with a header and fixed data structures, are alot easier to read & parse than an XML file, which consists of structures of variable length, needs to be interpreted, etc etc etc. This is a problem with XML.
  • Re:MS App Tweaks (Score:3, Informative)

    by bucky0 ( 229117 ) on Friday May 26, 2006 @09:52AM (#15409312)
    Hey, I keep hearing about this but can't find some actual proof. Do you have a link?
  • I've noticed that Word will stream open a large DOC file, so that you can start to work on it before it's been entirely loaded -- similar to a web page.

    DOC files don't so much as stream as open for Random Access. They're structured in such a way that the information is stored as an object heirarchy scattered across the file. This makes saving faster because only the changes are saved to the file. It also make opening faster, because Office only needs to pull up the information that's on the screen at the moment. (Even if it's at the end of the document.) PDFs work in a similar, but more structured, fashion.

    The unfortunate fact about ODF is that it requires a complete decoding of the file when loading, and a complete reencoding of the file when saving. However, I don't see any reason why Microsoft can't just add ODF support and make it an optional format. Computers are fast these days, and it should be up to the user to decide whether he needs the performance provided by the MS DOC *cough* "standard".

    Or in other words, Microsoft is grasping at straws, trying to find a reason why they shouldn't support opening and saving of ODF files. I feel so sorry for them. (Not.)
  • Everything that OpenOffice needs must be included with the application and loaded when the application loads. The opposite is true with Office. The majority of the application is already in the operating system. This is why OpenOffice is cross platform and Office is not.

    OpenOffice has its own fonts and font engine, though it can utilize others. Office uses the OS's font engine but adds fonts to the OS during installation.
    OpenOffice has its own engine to place, draw, clip... windows/forms. Office uses the OS's.
    OpenOffice has its own database engine, though it can use several others. Office uses jet which is part of the OS.

    The list goes on...

    If the file format was supposed to be tested for perfomance then they should have used the two different formats with the same application.
  • by gatzke ( 2977 ) on Friday May 26, 2006 @10:03AM (#15409381) Homepage Journal

    MS has a known history of paying people for grassroots positive PR online. That is where we got the term astroturfer from.

    So go ahead and defent MS on the merits. Convince me a .doc file is a better format. Tell me why we should not have open standards. Defend the indefensible. Most of us will poke holes in the silly arguments and some will call you names.

  • by The OPTiCIAN ( 8190 ) on Friday May 26, 2006 @10:22AM (#15409530)
    This reminded me of this paper [labri.fr], "The Psychology of Learning". In it the writer describes the act of people who don't want to learn new things: "As long as everybody around them use tools, techniques, and methods that they themselves know, they can count on outperforming these other people. But when the people around them start learning different, perhaps better, ways, they must defend themselves. Other people having other knowledge might require learning to keep up with performance, and learning, as we pointed out, increases the risk of failure. One possibility for these people is to discredit other people's knowledge. If done well, it would eliminate the need for the extra effort to learn, which would fit very well with their objectives."

    This issue is about Microsoft defending their turf rather than not wanting to learn something new. But it's basically the same motive at work: find ways to undermine the new to benefit the old.

    It goes on, "This model of learning also explains other surprising behavior that I frequently observe. I have seen novices in software development with knowledge of a single programming language explain to experienced expert developers why their choice of programming language was a particularly bad one. In one case, I talked to a student of computer science who told me why a particular programming language was bad. In fact he told me it was so bad that he had moved to a different university in order to avoid courses that used that particular language. When asked, he admitted he had never written a single program in that language. He simply did not know what he was talking about. And he was willing to fight for it. With respect to programming languages, negative opinions about a language that a person does not know, are usually based on very superficial aspects of it. To people obsessed with performance lack of such in a programming language is a favorite reason to advocate its eradication (even though performance is not a quality of a language, but of a particular implementation)."

    The positive lesson to take away from this is the MS is undoing itself. It's turning to cheap, nasty, suit-driven mentalities to defend its turf rather than the old days when it would just go out and write something new and nasty. It's become an unwieldy beast. I read about the Vista delays yesterday and briefly thought "Will anyone notice - who uses Windows these days". To an extent it shows what a bubble I live in. But it's true - *all* of my regular contacts use linux, freebsd or mac os x. As they should. After all - friends don't let friends use Windows.
  • by jackjeff ( 955699 ) on Friday May 26, 2006 @10:36AM (#15409605)
    I read this article a couple of months ago that compare M$ XML and OpenDocument

    http://www.groklaw.net/article.php?story=200511251 44611543 [groklaw.net]

    For those who are too lazy to read, here's a brief summary of the main differences between the two formats:
    - m$ tags are 2-3 letters long and not readble
    - m$ format looks more like a dump of the binary structure, and makes no attempt to separate content and style

    The author was already feeling the size argument coming for m$ format, which is nonsense because both formats are compressed anyway and a XML should be readable.. but somehow, he was not expecting the "speed" issue.

    Come on. If you wish something "efficient", use a binary format. If you start having a textual XML + compression, then obviously speed is not your concern. What's your concern then? Readability, processing by third party tools. In that case separation of content and style is more important. Who cares that "stuff" is written in Helvetica 12 black. I personally prefer to know it's a "title". And so on..

    As for the speed, on today's computer which are virtually 1000x faster than required for typesetting document, this is laughafable. In addition, for large documents, I know many "word" addicts who separate documents in 100pages portions or so, because it become impossible to handle...

    What I think about m$ XML, is that. well. it's not that bad. Even though not really "open", it's still better than before. But comon. This was done in a "rush", to fight back open document initiative. And in that case, dumping dummily the "internal binary structure" into a XML document was making more sense for them. There's nearly no development cost involved (no reasearch whatsoever) and it could be implemented very quckly.

    Then Yates come and talk about "customer experience" (cf ZDNET article).. This is laughfable.

    Regarding "customer experience", when will word support a real vector image format (no WMF crap please). like let's say EPS/PS/PDF... ? I personally hate having to make a raster of my images and make the word document explode in size (when i'm FORCED to use word).

    2030?
  • Re:MS App Tweaks (Score:3, Informative)

    by shotgunefx ( 239460 ) on Friday May 26, 2006 @11:34AM (#15410029) Journal
    They did a bunch of stuff like this. They'd not document certain features, under the guise of "If we document it, we'd need to support it". Version after version of DOS, they'd still be in there. They'd make use of them for an edge.

    Hence, me having to buy books like Undocument DOS" [amazon.com]

  • Re:MS App Tweaks (Score:3, Informative)

    by Gallenod ( 84385 ) on Friday May 26, 2006 @11:40AM (#15410082)
    Much of this happened before the Web was mature enough to capture and document the discussion. There's good link in one of the other replies, but here are a few more:

    http://www.newsfactor.com/story.xhtml?story_id=283 96 [newsfactor.com]
    Link to a new item about a lawsuit Novell filed in 2004 alleging OS-level sabotage. It does point out that WordPerfect's main problem was lack of a Windows version, but it also alleges Microsoft indulged in some software sabotage.

    http://www3.gripe2ed.com/scoop/comments/2005/10/24 /9814/8315/20?mode=alone;showrate=1 [gripe2ed.com]
    An anonymous posting to Ed Foster's Gripelog by someone who claims his wife was a WP beta tester. Mentions the undocumented API issue but does point out it has never been proven sufficiently to allow companies to sue MS for damages. Blames a lot of the troubles with both MS Word and Wordperfect on memory management issues, which is a valid shot.

    But the most interesting is this analysis of the MS anti-trust trial written by Ralph Nader (admittedly no friend of any monopolist, but a guy who does his homework): http://www.cptech.org/ms/harm.html [cptech.org]. When you get far enough down in the article, you'll find this quote:

    But, as Judge Jackson points out, and as most computer experts know, not all of the quality problems are innocent. In its internal emails and by countless examples, Microsoft has demonstrated that it believes it benefits when consumers cannot make competitor's products work correctly. Microsoft has a range of methods to undermine its competitor's products. When it does not use deliberate sabotage, it can withhold important technical information or refuse to license technology to its competitors, such as when it refused to permit Netscape to distribute a utility to log-on to Internet Service Providers, or when it withholds or unexpectedly changes applications programming interfaces and data file formats.

    The reason Novell included intentional sabotage in their suit was becuase of evidence submitted from the anti-trust trial. Again, there are only indirect references to the practice in the trial evidence, not explicit evidence from the OS code itself, but when has anyone who hasn't signed a non-disclosure agreement really gotten a good look at what's under Windows' hood?

    Does it pass the test for "beyond resonable doubt" -- probably not. However, "preponderance of evidence" only requires 51% certainty. There are quite a few people who will look at the trial evidence and Microsoft's behavior in other areas and pass that 51% mark.

  • but for editing/working on larger documents/spreadsheets you may find yourself using MSOffice document formats

    TeX consists of long streams of ASCII bytes and offer no random-access abilities whatsoever except those implemented by a text editor and the underlying filesystem. And yet, LyX, which can easily handle thousand-page documents, loads and saves nearly instantaneously.

    Your complaint is really over the relative brokenness of two major office suites, not the inherent advantages of their document formats.

  • Re:Uhmm... (Score:2, Informative)

    by pembo13 ( 770295 ) on Friday May 26, 2006 @11:57AM (#15410187) Homepage
    Maybe a little less instant messaging time for the employees then.
  • by symbolic ( 11752 ) on Friday May 26, 2006 @01:59PM (#15411103)

    I seem to remember a rather depressing benchmark with respect to how fast OOo was able to save and re-open a large spreadsheet- and how much memory was required to do so. The results were not pretty, and would have definitely qualified as something that goes into the "must improve asap" category. I use primarily open source apps, but I have to admit that this performance benchmark was a little disappointing. Here's a to a related ZDNet article: http://blogs.zdnet.com/Ou/?p=119 [zdnet.com]
  • by illumin8 ( 148082 ) on Friday May 26, 2006 @02:23PM (#15411266) Journal
    Don't forget you have automatic saves every 10-15 mins and when that takes more then just a second or two it is a really major pain and interruption to the job.

    Dude, I hate to break it to you, but this is 2006. We've had multi-threaded applications for how many years now? Spin off another thread for the auto-save process. Word already does this.
  • by zenhkim ( 962487 ) on Friday May 26, 2006 @02:52PM (#15411490) Journal
    > Honest question - why do you, like so many others, assume that anyone who defends MS (or $otherHatedCompany) is a shill?

    Uh... why did the villagers assume that the little shepherd boy was lying the third time he cried "wolf"?

    Because by then he had lost credibility with them. It took only two times for that to happen. How many times has Micro$oft deceived the public? Let us count the ways....

    http://www.inlumineconsulting.com:8080/website/msf t.shilling.html [inlumineconsulting.com]

    More to the point, the author closes his article with an answer to your (rhetorical?) question:

    > CONCLUSION
    > The alert reader cannot believe any pro-Microsoft opinion presented in any forum.
    >
    > I remain morally certain that some people hold legitimate pro-Microsoft opinions, with better or worse justification. Microsoft, or its public relations company(s), have so muddied the water with all the shilling and astroturfing that a neutral observer cannot determine whether a paid shill produced an arbitrary pro-Microsoft opinion as propaganda, or a random person produced it as his or her own opinion.

    The little shepherd boy lost his flock because of his dishonesty. The Micro$oft corporation lost its credibility with the tech-savvy because of its dishonesty. I have sympathy for neither.
  • by antiMStroll ( 664213 ) on Friday May 26, 2006 @03:21PM (#15411733)
    For the majority that probably means using Word at work, which further means documents are probably on a server. By far the biggest headache is the way Windows locks the app, and often every other network aware software running concurrently, until the network transaction in focus is complete. Exchange does it, as does IE. These user lock-out so far exceed the delay opening an OpenOffice document it takes cajones the size of moons for Microsoft to point fingers at ODT. Come back when you've fixed those issues, by then ODT will certainly be faster.
  • by ShieldW0lf ( 601553 ) on Friday May 26, 2006 @04:17PM (#15412182) Journal
    This stuff doesn't even make sense.

    OpenOffice uses ODF. Office uses binary formats. The performance analysis quoted doesn't compare ODF and OpenXML. It states right in the article:

    Here is a comparison with the standard 16-sheet SXC and XML sample file I've been using. The sample is in compressed XML format because it is smaller and easier for you to download. You'll have to convert the XML file to XLS and the SXC file to ODS to run the following test yourself.

    XLS is a binary format. This study is irrelevant to the statements made. And it's the only data given to substantiate the claims made. So there is no data given at all.

    All you can conclude from this is that OpenOffice 2.0, retrofitted recently for ODF, is much slower in a windows environment than Office 2003 using binary file formats. A far cry from any statements made either by Yates or by the summary.

    What a pile of crap journalism.

Real Programmers don't eat quiche. They eat Twinkies and Szechwan food.

Working...