Re: Word 2003 XML



On 2006-06-16 17:52:00 +0100, Jim Gordon <goldkey74@xxxxxxxxxxxxxxxxxxxxxx> said:

Office 2004 is unable to understand the new XML file format for Word and PowerPoint at this moment. Excel 2004 is able to understand the new format (I think - haven't had a chance to try it yet).

Thanks. I think it sounds as though it is best avoided for the moment. I basically work in a Windows world so it is useful to be able to move doc files over to the Mac for homework.


The advantage of XML is that it is human readable, at least to those humans who read XML code. XML is an open format meaning that anyone can create XML code and it will be interchangeable with other documents. It is not proprietary.

The appearance of file sizes being smaller is sleight of hand. The old .doc format is a binary format. Binary formats are more efficient than text, but are not open standards based. XML is text, and hence requires large file sizes. The sleight of hand is because the XML files are automatically zipped (compressed) before they are saved they seem to be smaller. For a fair comparison, zip a .doc format document and compare that size to the same XML document.

I have to say it didn't strike me as looking like a zip file when I opened it in a text editor. When I opened an ODF file in this way the PKZIP reference at the top of the file was immediately striking.

ODF does seem to be a very much more /open/ format. Having played with it in the past I really like that one can open the zip file, edit the XML that is the text, change the PNG images then open the file in OpenOffice with all those changes in place.
--
Cheers,

Steve

The reply-to email address is a spam trap.
Email steve 'at' shodgson 'dot' org 'dot' uk

.



Relevant Pages

  • Re: Sane Syntax
    ... vital role in the future of TeX but we need some more human friendly ... Generating well formed LaTeX2e documents from XML ... Another approach is to convert existing documents to XML format and go ... TEI, together with DocBook, are the two ...
    (comp.text.tex)
  • Re: XHTML vs HTML
    ... to be the predominant type of HTML used on the web for many years yet. ... First, it is XML. ... XHTML is also ... transformed using XSL from and into virtually *any* other data format. ...
    (microsoft.public.frontpage.programming)
  • Re: text to bibliography?
    ... to xml: you can store binary data in an xml file. ... including your well-formattedbibliography(no longer in xml format). ... It is in annotated bibliographies (something Word 2007 does not ... that %I is actually the field representing the publisher. ...
    (microsoft.public.word.docmanagement)
  • Re: Future of LISP. Alternative to XML. Web 3.0?
    ... using s-expressions instead of XML, nobody is going to use it, ... because it's cheaper to keep the existing XML software and continue ... XML-MAIDEN format or to HTML format and next displayed via standard ... or a CanonML or LISP browser. ...
    (comp.lang.lisp)
  • Re: Data table text I/O package?
    ... It has bracketing: rows and columns. ... As a medium XML is as awful as readable. ... > tell you not to use an internal type when it suits your application. ... is irrelevant to the data format used. ...
    (comp.lang.ada)

Loading