Re: internal format of a word document

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance



The format isn't really deep, dark, or secret, although I won't argue about
the shameful part. :-) The documentation is on MSDN, starting at
http://msdn.microsoft.com/library/default.asp?url=/library/en-us/stg/stg/structured_storage_start_page.asp.
There are other resources on the Internet; search Google for "OLE structured
storage".

If you want to manipulate a Word file without involving Word, you have a LOT
to learn about OLE, COM, and other three-letter acronyms. I figure a year of
study ought to get you started. Also be aware that most of the available
sample code is in C++, not VB6.

--
Regards,
Jay Freedman
Microsoft Word MVP FAQ: http://word.mvps.org

Margaret Aldis wrote:
> The internal format is a deep, dark (and probably shameful) secret ;-)
>
> You can't directly manipulate the file format - certainly any change
> to number of characters etc. will render the file completely
> unreadable (because the pointers Word uses for all its formatting
> etc. won't point to the right place any more.) Instead, you need to
> use VBA to manipulate the document object structure. There are
> several Word VBA newsgroups where you can get more help on specifics.
>
>
> "yarders" <yarders@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote in message
> news:F14CCE65-FD8B-477F-9A57-98C1B9A2EFD7@xxxxxxxxxxxxxxxx
>> How do you find out the internal format? The reason for this is that
>> I'm trying to manipulate a word document from VB6 code and after it's
>> manipulated
>> the size of the file could change.
>>
>> After saving the changes can't open the file, a message box is shown
>> stating
>> the name or path are not valid. I believe it's because the file size
>> has changed.
>>
>> Any help with this will be greatly apprieciated, thanks.


.



Relevant Pages

  • Re: Bond Angles and Hybridization
    ... frou-frou and idiosyncratic this and that depending on format. ... THE SOFTWARE CALCULATES DISTANCES, ANGLES, AND DIHEDERAL ANGLES ... Structure data in HyperChem-Lite using the Selection ... I have a battery of custom-written software to manipulate *.hin files ...
    (sci.chem)
  • Re: Auditing / Logging
    ... The simplest solution would be to simply log all activity using tcpdump in binary ... format. ... This decreases the file size, is faster, and allows you to manipulate it after. ...
    (Pen-Test)
  • Re: Please explain TIF And RAW- THANKS!
    ... I especially appreciate Roy's advice to only manipulate copies. ... Save the original image data in some format that is not "lossy". ... Lossy formats (such as JPEG) should only be used for the "final" ... "Sharpening" filters should be applied as the last step, ...
    (rec.photo.digital)
  • RE: seperate first 2 lines of column in seperate columns in same row
    ... Any chance of you sending the original source file for this that you are ... so I could manipulate this data but am lacking in skills, ... one excerpt from the file in original format, below the example is whar I ... I'm no code expert, but I have little VB experience and oracle, but I just ...
    (microsoft.public.excel.misc)
  • Re: Representing futuristic English
    ... >> DVD movie players seem to be a dime a dozen, ... >> suspect that format is no black box either. ... >DVD format is a widely shared secret. ...
    (rec.arts.sf.composition)