New Office File Format Conversions: A Heads-Up

Tech Tip: Click here to run a free scan for Windows Errors and optimize PC performance



All:

We're going to get a lot of questions about the new Office File Formats
coming down the pike, so let's get the ball rolling.

* Microsoft Office is changing to new file formats in the next release.

* The new formats are nothing like the old formats. Not even "related".

* Yes, this is a very good idea. They held off as long as they could to
avoid disruption, but it's a move that has been long overdue.

* Word, Excel and PowerPoint will move first, in Office 2007/Mac Office 12,
the coming iteration. Everything else will change over next time.

The old formats are NOT text, they are binary structures in which
"containers" of data of each type are linked by binary pointers that
indicate the off-set of each container from the end of the file.

The new formats are Open (published) compressed XML, very similar to HTML.
Each document is a mini-web site, compressed with the standard Zip
compression algorithm. It may contain binary objects for things such as
graphics, but usually it won't.

Just a tip for young players: when Word changes file formats then don't
up-convert HEAVILY customised templates, re-make them.

While up-conversion works fine for documents, templates have a more complex
file structure and may have more complex widgets in them. Whenever you
convert from one file format to another, you may be asking the converter to
re-express relational binary objects (List Templates and Style Tables come
to mind).

If you are, what you get depends upon whether the new file format has a
direct equivalent of the object. In a document, you can usually take it as
read that up-conversion will be totally safe. In a template, not
necessarily so: you need to test it.

DOWN conversion, on the other hand, is another name for suicide. If you
convert a document UP, keep the old one if you need the old format. If you
need the old version of the application to read the new content, you can
save back, but you may lose data doing so, so keep the new version.

Don't even THINK about round-tripping between file formats. You WILL lose
data, and almost certainly gain corruption.

In the forthcoming conversion, up-conversion from .doc to .docx is
guaranteed to work. Up conversion from .html to .docx will work.

Up conversion from .dot to .doct is probably OK. You'll get a file that
should work without problems.

Up-conversion from .dot to .docm, or down-conversion from .docm to .doc
simply won't work. According to my reading of what is supposed to happen, I
suspect the application will actually disallow it.

You have been warned :-)

Cheers
--

Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.

John McGhie <john@xxxxxxxxxxx>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410

.



Relevant Pages

  • Re: Elegant bash solution for testing existence of any files?
    ... file formats to EPS-formatted files. ... "fed" into the directory the script resides in automatically. ... Once it's ascertained that files exist, the conversion ... echo "files exist, loading conversion process" ...
    (comp.unix.shell)
  • Re: Elegant bash solution for testing existence of any files?
    ... file formats to EPS-formatted files. ... "fed" into the directory the script resides in automatically. ... Once it's ascertained that files exist, the conversion ... echo "files exist, loading conversion process" ...
    (comp.unix.shell)
  • Re: Elegant bash solution for testing existence of any files?
    ... file formats to EPS-formatted files. ... "fed" into the directory the script resides in automatically. ... Once it's ascertained that files exist, the conversion ... echo "files exist, loading conversion process" ...
    (comp.unix.shell)
  • Re: time for a real guru
    ... > to do conversion on file formats not real time. ... ACM doesn't deal with files at all, ...
    (microsoft.public.win32.programmer.mmedia)
  • Re: Force Office 2003 to save in Office 2007 file formats
    ... The ADM file for Office 2003, to set the Office 2007 file formats as defaults (files only, not templates) when using the ... compatibility pack with the pre 2007 versions isn't available yet, but you should be able to use the information in this article ...
    (microsoft.public.office.misc)