Re: convert FROM pdf
- From: "Graham Mayor" <gmayor@xxxxxxxxxxxxxxxxxx>
- Date: Sat, 27 Oct 2007 08:42:49 +0300
PDF was never intended to be backword convertible. If it doesn't work
(saving PDF in Acrobat to DOC or TXT is somewhat hit and miss) then you will
have to consider OCR conversion options. Abbyy Finereader is as good as you
can get from this type of software.
--
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Graham Mayor - Word MVP
My web site www.gmayor.com
Word MVP web site http://word.mvps.org
<>>< ><<> ><<> <>>< ><<> <>>< <>><<>
Teresa wrote:
Lene, thank you so much for this detailed and quick answer. I will
try your method.
Again, thanks.
"Lene Fredborg" wrote:
In addition to text, I suppose the manuals include illustrations
(drawings, screen dumps, etc.). If you save the PDF files as .txt
files, you will lose all such content.
I think I would do the following for each manual:
1. Save the PDF document as a .doc file.
2. Create a new Word document based on Word template that contains
all the company's publishing styles (I suppose you have a Word
template with the relevant styles, page setup, etc.).
3. Select Insert > File and insert the file you converted from PDF
to .doc (or copy and paste the content).
4. Reformat the entire document. To a great extent, you should be
able to use Find and Replace to search for a style and replace it
with the appropriate company style (via More > Format > Style). When
working, turn on formatting marks (Ctrl+Shift+8) to make sure that
you can see everything found in the document.
You will also need to carefully correct numbering, headers/footers,
etc. It may be helpful to check the use of styles in Normal view
with Tools > Options
Style area width set to an appropriate amount so that you can seenames applied to all paragraphs (however, in case of tables, the
the style
style names are not shown).
You can use Ctrl+Spacebar to remove any direct character formatting
from selected text. You can use Ctrl+Q to remove any direct
paragraph formatting from selected text.
5. When finished, delete any irrelevant styles left over from the
converted PDF file.
Make backup copies regularly so that you have a version to revert to
just in case.
Note that you can copy styles from one Word document/template to
another via Tools > Templates and Add-Ins > Organizer if needed.
Maybe you could make a test with a few pages first in order to find
out whether the result in ePublisher Pro is OK.
---
Regards
Lene Fredborg
DocTools - Denmark
www.thedoctools.com
Document automation - add-ins, macros and templates for Microsoft
Word
"Teresa" wrote:
I need to convert four 100-page user manuals into my company's
publication style. The manuals are only available in PDF (no source
files), and I need to make them into Word files in order to make
changes/additions, and add my company's publication styles. I will
then convert them back to PDFs, and eventually might have to
convert them into online Help with WebWorks ePublisher Pro.
Believe me when I say that any little glitch in the Word documents
will create oodles of trouble for me if it is picked up in
ePublisher Pro. So I want to make this conversion to the company
styles as CLEAN as possible.
Please give me your opinion for a clean conversion from PDF to Word.
Here is what I am thinking:
Use Adobe Acrobat 8 and Save As a Text (Plain) .txt file.
Open a blank Word document and import the company styles.
Copy/paste from the .txt file into the new Word document, and then
apply styles.
I talk, but really I don't know how to "import the company styles."
How do I do this?
I'm nervous to use Acrobat 8 and Save As a Word .doc file because
there might be hidden junk that, even though I will fix the
formatting, might trip me up in the conversion to online Help.
Thanks for any and all suggestions.
.
- References:
- RE: convert FROM pdf
- From: Lene Fredborg
- RE: convert FROM pdf
- From: Teresa
- RE: convert FROM pdf
- Prev by Date: Re: Word 2003 listing files out of alphabetical order!
- Next by Date: Re: Replacing bracketed text
- Previous by thread: RE: convert FROM pdf
- Next by thread: RE: convert FROM pdf
- Index(es):
Relevant Pages
|