ODF/OOXML technical white paper

A white paper based on a technical comparison between the ODF and OOXML formats

Download the whole article as PDF

Write a full post in response to this!


I was asked by the UK Action Group of the Open Document Format Alliance to write a white paper on the technical differences between ODF and OOXML. After much agonizing, correcting, having others correct my mistakes, suggestions, changes and drafts I still have got something that may be alright to be previewed by all. The actual documents are in ftp://officeboxsystems.com/odfa_ukag both as a “PDF” and an “ODT” (Open Document Format).

The following is a transcribed version of the white paper. Although it has all the Free Software Magazine formatting constraints which means that the information is not as clearly presented, so therefore I recommend you to download the document from the above URL. It is here primary for reference purposes.

Enjoy.

Technical Distinctions of ODF and OOXML

ODF Alliance UK Action Group

A Consultation Document

by

Edward Macnaghten

Introduction

For many years various computer software vendors produced office software. However, it was difficult for a user of one application to reliably manipulate documents that were created by another. The file format specifications were closely guarded secrets, and for interoperability to exist developers would spend large amounts of resources trying to decipher complex “file dumps”.

Obviously a standard was needed, and ODF came to the fore. Soon after, Microsoft suggested that its own OOXML should perform that role instead.

An important point of any standard is the desirability and ease with which it can be implemented. A standard that cannot be implemented by all interested parties within the realm it applies to makes it virtually useless, and only serves to hinder interoperability rather than promote it.

The technical philosophy of the design of a standard is therefore important.

This document reviews the ODF and OOXML formats. It examines some of the nuts and bolts as to how the files are actually formed and the technical merits (or otherwise) of what is created.

This only examines the technical issues; those concerning intellectual property and legal issues are dealt with elsewhere.

ODF Background

History

ODF is a format developed as a vendor neutral standard. However, it did not originate from anywhere inparticular but rather through a process of evolution [1]…

  • In 1999 StarDivision began work on an XML interchangeable file format for their StarOffice product.
  • In August that year StarDivision was acquired by Sun Microsystems.
  • In October 2000, Sun Microsystems released large amounts of the source code to the community driven OpenOffice.org project under an open licence. At the same time an XML community project was created with the goal of defining an XML specification for the interchangeable file format.
  • In May 2002 OpenOffice.org version 1.0 and StarOffice version 6 were released using this XML format (SXW).
  • Also in 2002 collaboration began with other office suites, notably the KOffice project, to further refine the interoperability of the format.
  • In December 2002, OASIS had a conference call announcing the creation of what is now the ODF standard.
  • Then, from 2002 to 2004 the format was overhauled based on experiences to date and examination of what is required in an office format.
  • In December 2004, a second draft of the XML file format was approved by OASIS for review.
  • In February 2005, a third draft was published for public feedback—six years after commencement of the project and five years after public consultation began.
  • In May 2005, the ODF format was approved as an OASIS standard [2].
  • Soon after that, many office suites adopted the standard as a means to store the documents [3].
  • In September 2005, ODF was submitted as ISO standard.
  • In May 2006, ODF achieved ISO certification (ISO/IED 26300).
  • In February 2007, OpenDocument Version 1.1 had been approved by OASIS [4]. It particularly addresses accessibility issues.
  • ODF throughout continued and continues to grow in popularity and support [5].

Philosophy

The phiosophy behind the format was to design a mechanism in a vendor neutral manner from the ground up using existing standards wherever possible. Although this meant that software vendors would need to tweak their individual packages more than if they continued down their original routes, the benefits of interoperability were understood by the participants to be important enough to justify this [6].

This is reflected in the specification in many ways, but specifically:

  • XSL:FO—Formatting
  • SVG—Scaleable Vector Graphics
  • MathML—Mathematical formulas
  • XLink—Embedded links
  • SMIL—Synchronized Multimedia Integration Language
  • Xforms—Forms definitions

OOXML Background

History

OOXML is the format Microsoft developed for its Office 2007 suite. The development of OOXML was not performed with visible public consultation. It is difficult to determine when development of the format started. However, the following is known:

  • Microsoft used an XML format as an option in their Office 2003 suite, though it was not the default format. It was not compressed and all data was stored in the single XML file, binary data, such as pictures, being represented as BASE64 strings in special binData tags. A strict licence governing use of this format was included. This was released during 2003.
  • Microsoft started development of Microsoft Office 12, which would be known as “Microsoft Office 2007”. They developed a new XML schema (OOXML), probably using the Office 2003 XML files as a base. Like ODF, the format stores the data in a number of files embedded in a .zip file. No visible consultation with the public occurred during its creation.
  • Under pressure, Microsoft relaxed its licensing terms in January 2005. However, they were not relaxed enough to permit free software to use it [7].
  • Shortly after the Commonwealth of Massachusetts’ Information Technology Division chose OASIS ODF as a standard for its documents Microsoft announced that it would be submitting its format to ECMA for standardisation [8]. Later that month (November 2005), Microsoft relaxed the licence further as well as placing a mutual Non-Assertive-Clause for the patents they held on it [9].
  • In December 2006, ECMA approved OOXML as a standard (ECMA 376) [10]. This was done after no visible public consultation, and approximately two and a half years of development since releasing their previous (Microsoft Office 2003) closed XML format(10). It was almost immediately submitted to ISO to be certified as an international standard later that month.
  • In January 2007, a draft of over 6000 pages was released to members for 30 days to analyse and submit “contradictions”.
  • Microsoft released Microsoft Office 2007 in January 2007.
  • In February 2007, an unprecedented twenty countries submitted comments and contradictions to the standard.

Philosophy

We are of the view that the format appears to be designed by Microsoft for Microsoft products, and to inter-operate with the Microsoft environment. Little thought appears to have been exercised regarding interoperability with non-Microsoft environments or compliance with established vendor-neutral standards [11].

Don't miss out on the other pages!
123456789next ›last »

Write a full post in response to this!

Similar articles

0

Do you like this post?
Vote for it!

Copyright information

Permission is granted to copy, distribute and/or modify this document under the terms of the GNU General Public License, Version 2 or any later version published by the Free Software Foundation. A copy of the license is available at http://www.gnu.org/copyleft/gpl.html.

Biography

Edward Macnaghten: Edward Macnaghten has been a professional programmer, analyst and consultant for in excess of 20 years. His experiences include manufacturing commercially based software for a number of industries in a variety of different technical environments in Europe, Asia and the USA. He is currently running an IT consultancy specialising in free software solutions based in Cambridge UK. He also maintains his own web site.

Anonymous visitor's picture

Informative

Submitted by Anonymous visitor (not verified) on Thu, 2007-04-12 01:45.

Vote!
0

The works that you publish here are very informative. I thank you very much that you choose to make them available to us. Thanks to your work, I am enlightened to the differences between ODF and OOXML and their suitability to function as a standard general purpose electronic document format. Judging from what you've presented, ODF is more suitable for this purpose and OOXML seems to be designed to be backwards compatible with older Microsoft Office formats rather than be suitable as a general purpose document format.

Anonymous visitor's picture

Tssss

Submitted by Anonymous visitor (not verified) on Sun, 2007-04-22 08:57.

Vote!
0

Stopped reading after the first page.
Already in the history part you seem to falsify the info you are giving.

You should be well aware if you did any specialisation on the subject that Micrsoft had their XML format out in the open already in 2001 when it appeared in an Office beta which suggest they also started with that format in 1999 or 2000 at a simular time as StarOffice.

Also you forget to mention that when OASIS started on the format and for the first two years their format was known as the Open Office XML format and only end of 2004 after being suggested by the EU they considered it for submission to ISO and changed the name to OpenDocument.

The whole first page of this "white paper" is so dripping with bias that it is truly not worth reading any more. I read some of your stuff before but I had not expected you to smei-falisfy the info just to put your por-ODF or what now looks like an anti-OOXML view forward !!!

Anonymous visitor's picture

Tongue in cheek

Submitted by Anonymous visitor (not verified) on Sun, 2007-04-22 17:49.

Vote!
0

I think that the term "white paper" was being used tongue-in-cheek by the author to dig at the way there are so many Microsoft funded "white papers" out there which are completely biased and contain lies (whether it be by falsified or misrepresented statistics, omissions of important details, or just outright lies).

The details that you have complained about are at best picky and at worst pedantry. The point is Microsoft is out for number 1 = Microsoft. It serves their interests to keep everyone using their products and they will do anything to destroy the competition whether through marketing/misinformation (like their "white papers"), buy-outs, legal action, or changing the laws of the land by bribing politicians.

Mitch Meyran's picture

Tssss yourself!

Submitted by Mitch Meyran on Wed, 2007-05-02 14:03.

Vote!
0

Considering that OOXML is very far from the XML used in Office 2003, frankly, I'd say that this early format was a dud. Moreover, at that time this 'XML' format wasn't XML-conformant - as in, it probably worked with MS's XML v3 parser, which isn't even XML compliant.
Second: of course, since the base format for ODF was OpenOffice.org's document format, which WAS (in 1999) XML-based - so it was Open Office XML, as in OpenOffice's XML. Said format was already parsed by several applications, and OASIS' goal was to extend it so as to cater to as many people as possible. Note that Microsoft was part of OASIS at the time. Strangely, they never mentioned anything during ODF's design that hinted at incompatibilities with their suites. Now they have the gall to say that ODF can't reproduce their documents faithfully, and that they weren't asked for advice when drafting ODF.

Had you read the whole article, you'd see that said bias has a source: OOXML is awkward, designed for Microsoft products to the exclusion of all others, and would be, as such, unworthy of being called a standard. This has been described in details with practical examples. As such, I'd respectfully recommend you get your head out of that place where the sun doesn't shine and read the whole of it - you might learn something that isn't mentioned in a MS Office advertisement for once.
---
A computer is like air conditioning: it becomes useless when you open windows.

Laurie Langham's picture

OO(?)XML

Submitted by Laurie Langham on Mon, 2007-05-07 08:34.

Vote!
0

I have to wonder if the very selection of the name OOXML wasn't a deliberate attempt by Microsoft to try to create confusion in the public mind about the OOo of the Open Office.org. Imagine the reaction if Open Office had tried to call their word processor M$ Terd, or something similar.

John Drinkwater's picture

A few mistakes...

Submitted by John Drinkwater (not verified) on Thu, 2007-05-03 12:32.

Vote!
0

Looks like a reasonable article, but I've already seen a few minor issues that bother me (typos mostly).

page 4: table:default-cell-style-name="Defaul" <- should be Default?

page 4/page 5: "Dates are UK representations (DD/MM/YYYY)"
If so, why are the dates "3/5/2007" ? for a date of
office:date-value="2007-03-05", that would be 05/03/2007 in the UK system.

Edmundo Carmona's picture

Expect a response...

Submitted by Edmundo Carmona on Sat, 2007-05-12 18:31.

Vote!
0

If I were you, I would expect a fuming response from George Ou, Paul Thurrot, Paul Murphy, Rob Master-of-the-IT-Universe Enderle or any other MS shill that will tell you how perfect OOXML is and how there's no need for another standard (specially a Free standard life ODF). :-)

Anonymous visitor's picture

evden eve nakliyat

Submitted by Anonymous visitor (not verified) on Mon, 2007-07-09 20:10.

Vote!
0

yes all right... thanks...

hAl's picture

This 'white paper' is just for critisising OOXML

Submitted by hAl (not verified) on Fri, 2007-07-27 13:15.

Vote!
0

This white paper is written very poorly.

Firstly this document is indeed serieusly misrepresenting history history on OOXML suggesting ODF has a muc longer pedigree where both XML office formats started about 1999-2000.

Secondly it is putting examples up from OOo and MS Office which allthough using each format are not per se good implementations of each spec and also with OOo at least two years longer to implement an implementation of ODF.

Thirdly allthough ODF is supposed to support SVG the example show that pasted SVG in ODF (which is supposed to support SVG) is embedded as a seperate file which is probably also allowed in ODF but makes the whole reusing of SVG in the spec sort of redundant where the pasted SVG is actually converted to VML in MS Office 2007 and becomes a native part of the document. So even allthough ODF has native SVG support the ODF implementation forgets that whilst the OOXML implementation of MS Office is the one to recognise the SVG correctly and convert it to native VML. This example only shows why it is sucky to use a comparison in ready made implementations in stead of comparing the actual spec.

Fourthly this document supports reuse of standards like SVG and MathML which is basicly a good thing but does not notice that these are not document standards but webstandards that for instance do not support the rest of the ODF specs tags for revisions, bookmarks, footnote or anthing else that might combine into the markup. when revising a single digit of a math formula in an ODF doucment that would therefore require the entire math to be marked as a revision or (if possible) the math to be split up in multiple parts.

Fifthly this document suggests that using shorter tags in OOXML requires more tags ? However this is not substantiated. Also it seems the OOXML spec mainly uses short tags for those item that are uses most like the main namespaces w=wordprocessingml and command tags for line, rows, colums and cells and uses longer descriptive tags for less frequent items. Also OOXML tagging use shows the relation between certain parent and child elements actually making implementation easier rather than harder as this document suggests.

Sixtly the document does mention the redundancy of data in the spreadsheet cell as an advantage because the formatted version of the data is easily retrieved. That is correct but also it requires any implementation to make certain that the data and it's formatted version are always synchronised. The ODF spec does not state how to deal with inconsistensies in the cells value and representation or whether a document should verify on opening that these are correct. An application not verifying this could potentially be at risk from hidden data in spreadsheet value item. Especially at risk when this that value for instance could contains (references to) script or embedded executable elements.

Sevendly the document suggests that OOXML cannot handle ODF Office settings whilst ODF could handle OOXML compatiblity settings. However the author seems to be unaware that OOXML actually contains a feature that is called custom settings. By creating a namespace and XML with your implemtations seetings you can add custom application settings to OOXML as well and even make them validate using a schema. Particularly bad in this white paer is that it suggest its wrongly perceived lack of compatibilty with ODf settings as a reason not to choose for OOXML. Very poor thinking as OOXML actually has built in extensibility even beyond it's custom setting that could also manage any compatibility issue with ODF.

Eigthly this document suggest an inconsistency with ISO 216 (paper sizes). OOXML actually support non ISO paper formats which is as it should be because those iso formats are not covering all common office paper formats in use. ODF uses a flexible paper format allowing for even more variation of paperssize but also possibly requiring more implementation for validating whether a given paper size size is actually available and or how to handle near exact sizes. With ISO 216 not being the only papers in use in for Office documents inconsistensies seem normal.

Nigntly The documents objects to OOXML referencing windows meta files and clipboard objects as they are propriety windows related formats. However these are only mentioned as optional enumeration values for embedded items. So these values can even be used on a linux platform to easily identify that the document contains embedded objects it cannot handle without having to open the embedded file. If an ODF file contains these windows related items (which it also can) it might have to verify the embedded filetype purely on it's extention or even open the embedded file. On the other hand the dismisses the references to java in ODF's java applets because the java standard is supposedly well documented. However it fails to mention that any implementation of native embedding of java applets actually requires a java virtual machine which for instance Microsoft isn't allowed to make themselves after it's legal issues with Sun (who were in part responsible for the addition of java appplets to the spec) making full implementations dependant on 3rd party Java VM licensing.

This document presents itself as an independant white paper and suggests to look at technical merits as well as flaws but actually is a document looking for technical flaws in OOXML comparing those unfavorably to ODF whilst ingnoring flaws in ODF (for instance the lack of spreadsheet formula's !!). Also on the merit side it is generally only ODF merits that are mentioned.

hAl

Anonymous visitor's picture

I have seen so many MS

Submitted by Anonymous visitor (not verified) on Tue, 2007-07-31 12:08.

Vote!
0

I have seen so many MS evanaglists going around tom tomming about the spreadsheet formulae... but what use are they if they cannot map into the legacy binaries. Secondly what ODF compatibility is being referred to. Anyone making such statements is nothing but a copyprinter of MS's baseless claims and not worthy of being called a technologist. Where is the demonstrable evidence of OOXML supporting ODF formats.. and please do not refer to MS sponsored Novell's ODF converter project....

Edward Macnaghten's picture

OOXML deserves the criticism

Submitted by Edward Macnaghten on Wed, 2007-08-01 19:31.

Vote!
0

Dear hAl
Thank you very much for the commenting on this article. All feedback is greatly welcome, and you have obviously taken a lot of time and effort in your comment. I appreciate this greatly. However, you have raised some interesting points that I feel I must address forethwith.

"Firstly this document is indeed serieusly misrepresenting history history on OOXML suggesting ODF has a much longer pedigree where both XML office formats started about 1999-2000."

The XML that Office 2003 uses is greatly different to the MS-OOXML submitted to ECMA. It was neither the default format used, nor was it adopted to any great degree, and nor could it be used to save and restore documents reliably. It was also developed behind closed doors with little community interaction. This greatly differed from the StarOffice then OpenOffice formats where they developed their XML standard from the ground up to be interoperable, and refined it with close co-operation with the community.


"Secondly it is putting examples up from OOo and MS Office which although using each format are not per se good implementations of each spec and also with OOo at least two years longer to implement an implementation of ODF."

The examples given are not contrived. It is the ODF and MS-OOXML XML representations of the meat of the said documents. I am afraid I neither understand the point not relevance of your OOo reference. What two years? Longer than what? At time of writing of the article NO application supported MS-OOXML, However, the white paper compares the formats, not applications, so this is digressing anyway.


"Thirdly although ODF is supposed to support SVG the example show that pasted SVG in ODF (which is supposed to support SVG) is embedded as a separate file which is probably also allowed in ODF but makes the whole reusing of SVG in the spec sort of redundant where the pasted SVG is actually converted to VML in MS Office 2007 and becomes a native part of the document. So even although ODF has native SVG support the ODF implementation forgets that whilst the OOXML implementation of MS Office is the one to recognise the SVG correctly and convert it to native VML. This example only shows why it is sucky to use a comparison in ready made implementations in stead of comparing the actual spec."

The example I guess you are referring to is the "Dalek" picture embedded in the "Word Processing File". This is a "Jpeg" picture embedded in the document. Both formats permit this, but the manner ODF does this, using a "draw:frame" tag and a "draw:image" tag is considerably more elegant and interoperable than MS-OOXML's "v:shapetype", "v:stroke", "v formulas", eleven "v:f" (with the cryptic "eqn" attributes), "v:path", "o:lock" and "v:shape" tags in MS-OOXML.

You also say that: "... So even although ODF has native SVG support the ODF implementation forgets that whilst the OOXML implementation of MS Office is the one to recognise the SVG correctly and convert it to native VML". This sentence is somewhat nonsense. What do you mean the "ODF" implementation? ODF is a format. Also I am very happy for MS-Office in that it can convert SVG formats to VML, though the functionality of the applications are not relevant to the formats themselves. I feel I need to say though that I would be a lot happier if MS-Office actually saved files in the more interoperable SVG, that has been accepted as a standard by W3C, whereas VML has not.


"Fourthly this document supports reuse of standards like SVG and MathML which is basicly a good thing but does not notice that these are not document standards but webstandards that for instance do not support the rest of the ODF specs tags for revisions, bookmarks, footnote or anthing else that might combine into the markup. when revising a single digit of a math formula in an ODF doucment that would therefore require the entire math to be marked as a revision or (if possible) the math to be split up in multiple parts."

When you say " ...not document standards but web standards" I assume you mean the standard is more suited to web pages than editable documents. I admit, I have done limited research, but what research I have done would suggest you may well be correct. However, as you point out in your comments there are working solutions applications using ODF can implement to get around this limitation, and I would suggest the way forward is in the next version of the standard to enhance the SVG and MathML specifications, or create a subset of them, to facilitate the required functionality easier rather than using a proprietary single-vendor method of storing the information.


"Fifthly this document suggests that using shorter tags in OOXML requires more tags ? However this is not substantiated. Also it seems the OOXML spec mainly uses short tags for those item that are uses most like the main namespaces w=wordprocessingml and command tags for line, rows, colums and cells and uses longer descriptive tags for less frequent items. Also OOXML tagging use shows the relation between certain parent and child elements actually making implementation easier rather than harder as this document suggests."

I do not suggest shorter tags requires more tags. I simply point out the the OOXML uses more tags than ODF. This is substantiated in both the examples I gave, and seems largely due to the fact OOXML does not support the "SPAN" functionality of XML. I have not managed to figure out what "parent" and "child" relationship you are referring to that MS-OOXML shows that ODF does not.


"Sixthly the document does mention the redundancy of data in the spreadsheet cell as an advantage because the formatted version of the data is easily retrieved. That is correct but also it requires any implementation to make certain that the data and it's formatted version are always synchronised. The ODF spec does not state how to deal with inconsistencies in the cells value and representation or whether a document should verify on opening that these are correct. An application not verifying this could potentially be at risk from hidden data in spreadsheet value item. Especially at risk when this that value for instance could contains (references to) script or embedded executable elements."

ODF and MS-OOXML are formats. It is up to the applications to ensure consistency in both of these. It has no place in the format specification itself. Implementing security checks while loading data in the application is plain common sense, and one which Microsoft has a very poor track record in. Also, I would have thought that the MS-OOXML method of storing spreadsheet data is at far more at risk to security problems than ODF due to the inconsistent manner which formatting can be included, the cross referencing of strings to an index in a separate part of the spec and the peculiar way dates are stored as numbers, and that number being inconsistent depending on a global setting.


"Seventhly the document suggests that OOXML cannot handle ODF Office settings whilst ODF could handle OOXML compatibility settings. However the author seems to be unaware that OOXML actually contains a feature that is called custom settings. By creating a namespace and XML with your implementations settings you can add custom application settings to OOXML as well and even make them validate using a schema. Particularly bad in this white paper is that it suggest its wrongly perceived lack of compatibility with ODF settings as a reason not to choose for OOXML. Very poor thinking as OOXML actually has built in extensibility even beyond it's custom setting that could also manage any compatibility issue with ODF."

This is news to me, I obviously missed it buried in the 6000 or so pages of documentation. Thanks for pointing this out. You however epitomized the philosophy of the difference between MS-OOXML and ODF. MS-OOXML handles Microsoft's own applications natively, while everyone else needs to use the "custom" settings their, while ODF places everyone on an even footing.


"Eigthly this document suggest an inconsistency with ISO 216 (paper sizes). OOXML actually support non ISO paper formats which is as it should be because those ISO formats are not covering all common office paper formats in use. ODF uses a flexible paper format allowing for even more variation of paperssize but also possibly requiring more implementation for validating whether a given paper size size is actually available and or how to handle near exact sizes. With ISO 216 not being the only papers in use in for Office documents inconsistensies seem normal."

If an ISO standard exists then it makes sense to use it. There is no justification for using "1", "9", and "30" instead of "Letter", "A4" and "C4". Your point of not covering all paper sizes is valid, but simply unilaterally making more numbers up is not a valid solution. Storing precise paper dimensions is valid in this case, though, I agree, not perfect. If more codes are required for paper sizes it makes sense to extend ISO 216 properly rather than take a "go-it-alone" attitude.


"Nigntly The documents objects to OOXML referencing windows meta files and clipboard objects as they are propriety windows related formats. However these are only mentioned as optional enumeration values for embedded items. So these values can even be used on a linux platform to easily identify that the document contains embedded objects it cannot handle without having to open the embedded file. If an ODF file contains these windows related items (which it also can) it might have to verify the embedded filetype purely on it's extention or even open the embedded file. On the other hand the dismisses the references to java in ODF's java applets because the java standard is supposedly well documented. However it fails to mention that any implementation of native embedding of java applets actually requires a java virtual machine which for instance Microsoft isn't allowed to make themselves after it's legal issues with Sun (who were in part responsible for the addition of java appplets to the spec) making full implementations dependant on 3rd party Java VM licensing."

The point I was making about the META files is that MS-OOXML was obviously not designed for interoperability (which is backed up by statements by ECMA and Microsoft – see Note 11 at the foot of the article itself). Why are Microsoft's Windows META files singled out? Why cannot they be treated as any other format? It implies that the only applications that can support MS-OOXML have to support Microsoft's proprietary META files too.

Your point about JAVA I in essence agree with you, though I am not sure about the "Microsoft is not allowed to make a Java Virtual Machine" bit. If I remember correctly all what the legal ruling stated was that Microsoft were not allowed to employ their "Embrace, Extend and Extinguish" strategy with the JVM and still call it "Java". It did not limit them as to what functionality their programmers could and could not include in programs.

To re-iterate the point of the article, Java is now (on the verge of being) open source completely license un-encumbered. Windows META files are not. I believe it is far more of a sin to specify the MS-Meta files than it is the Java ones.

In conclusion, your accusation that I may be "biased" for ODF and against MS-OOXML. This bias is based on the research I did into both formats. The examples I gave were not contrived, and epitomize the differences between them. ODF is not perfect. You pointed the lack of spreadsheet formula definitions. Although not totally missing from ODF they are grossly lacking. This, I believe, was an oversight by OASIS and one they are correcting with the "OpenFormula" specification. However, it is an understandable oversight. Neither specification specify "Macro" syntax, and nor should they as that is very much in the realm of the application itself, and the application's functionality should not be limited by the interoperable document specification.

You have not addressed my biggest criticism of MS-OOXML, and the crux of the reason why I think it should not be an ISO standard. Standards exist for interoperability. An office file format is no exception. It is good that that one person should send me a document created in onje application, and me to be able to open and manipulate it in OpenOffice, AbiWord, Koffice, Lotus Notes or azny compliant application I choose. ODF sets out to achieve this. After examining MS-OOXML it is obvious that that does not. It simply sets out to store in an XML format data that is currently stored in the Microsoft binary "doc", "xls" files. Recently Microsoft claimed that ODF was simply an XML "dump" of the4 OpenOffice internals. This is both wrong and hypocritical. Wrong in the sense that ODF is most certainly NOT that, but was created in conjunction with the community, and hypocritical is obvious that MS-OOXML IS an XML "dump" of MS-Office's legacy formats.

They also claim that MS-OOXML is neccessary for rendering MS-Office legacy documents precisely. Again, this is just silly as it is the application that does the rendering, not the format. The fact the Microsoft claim that only MS-Office can do this shows the closed nature of them, and their lack of respect for interoperability. Also, we are talking about writeable office formats here. The standard for precisely rendered portable document formats is, you guessed it, PDF.

Please forgive me. This comment has grown much longer than I anticipated. And thank you once again, hAL, for contributing to the discussion.

Eddy macnaghten

Anonymous visitor's picture

.pdf

Submitted by Anonymous visitor (not verified) on Wed, 2007-08-29 10:07.

Vote!
0

"Also, we are talking about writeable office formats here. The standard for precisely rendered portable document formats is, you guessed it, PDF."

This might be the case for now, but I have to say that the goal of having a format that is so well
defined so as to be able to render the data accuratly must be present also for .odf
it might be a long way away....but still, nothing to stop it.

Since paper size is not analog in nature(printers rule), i would agree that the way forward should be to lift out the "dimension stating" and rely on a standard. odf need to change in this area, but the current solution is guess is the most open.....

Though i do dis like the container approach since it really nullify a standard in a lot of ways. The only external objects that should be allowed should be other standardised objects.

/Dan

Young Kun Kim(3K CEO)'s picture

pureXML vs Mixed XML

Submitted by Young Kun Kim(3K CEO) (not verified) on Sun, 2007-09-02 03:52.

Vote!
0

If we classify XML, we can do it as the PureXML by W3C and the MixedXML.

When the XML announced as the next generation of standard internet document in 1998, we had been expecting that XML would be widely distributed instead of HTML. Currently, a lot of enterprises are researching and developing XML technology and the soultions and applications based on XML are being adopted as a standard in various industry areas like trade, steal, laws, e-commerce, health care, automobile, etc and e-business industry like B2B, B2C, G2G, G4C and G2B and so forth.

But, in effect, the standard XML technologies (e.g. XML, CSS, XSL, DTD, Schema, XLink, Xpointer, Xpath, etc) by W3C are not being used in wide spread for making various kind of contents and applying various kind of soultions because of lack of researchs of core technology of XML, XML experts and wrong perception of XML. After all, lots of IT solution providers are trying to solve these problems by application technology and the legacy document technology. The various kind of technologies and applications like Web Service, XHTML, RSS, InfoPath, ODF, OOXML, etc are evolving with the PureXML by W3C to make up for the weak points of current XML.

The design goals of XML by W3C are being diluted with MixedXML.
A lot of IT solution providers are trying to develop and advance XML technology combine with current IT technologies (e.g. COM technology, legacy document technology, Print technology, etc).
http://www.xmlidc.com/baseXML/xmldoc/portal/xml_portal/xml_contents_keyw...

Gianni Giaccaglini's picture

OOXML vs. ODF spreadsheet representation

Submitted by Gianni Giaccaglini (not verified) on Fri, 2007-09-14 08:11.

Vote!
0

OK, ODF is simpler (although somebody could say it "simplicistic"). But I dont' agree with you when you say it is so "obscure": understandig it is easy and Microsoft specific documentation is very clear.
And you do not point out the avantages OOXML syntax offers, not only in terms of compactness. In particular (to do only a few exampoles):
- the r attribute in tags give an useful, immediate information to developers; on the contrary, ODF compels them to deduce it from the row/cells structure (with extra code and some delay at run-time);
- sharedStrings is a powerful tool when somebody want modify all of part of textual information (in corporate template e.g.); in ODF you have to reconstruct them navigating on the content file;
- by the way, the fact that ODF collects ALL spresheets in a UNIQUE (!!!) content xlm file vs. OOXML rational separation in different sheet1.xml, sheet2.xml etc. under a general spreadsheets.xml and with a separated workbook.xml tag etc. is a matter of serious criticism; non only in my opinion but with respect to basic STRUCTURED principles of XML recommedations (dont' you agree?)
- idem for the separation of styles from data; isnt' it a canon of XML?

Finally, please note that ODF was born OLD, because in fact it is mapped on StarOffice, an MS Office clone VERSION 2000. As a result, for instance it do not support ListObjects, very usefull things Excel 2007 upgrades well. So, please, look at some table1.xlm file in OOXML: you will find how clear and substancially self-documented it is.

Thank for paying attention