Re: Convert .rtf or .doc or .pdf or .htm to plain txt

Tech-Archive recommends: Repair Windows Errors & Optimize Windows Performance

From: Matt Berther (mberther_at_hotmail.com)
Date: 01/28/05


Date: Fri, 28 Jan 2005 11:49:13 -0800

Hello Beringer,

Im not completely sure about this, but vwWare[1] may do what you need.

[1] http://wvware.sourceforge.net/

--
Matt Berther
http://www.mattberther.com
> As a related topic:
> Does anybody know of code examples on how to convert RTF to HTML, XML
> etc?
> Thanks in advance,
> Eric
> "David Browne" <davidbaxterbrowne no potted meat@hotmail.com> wrote in
> message news:eJNMQQUBFHA.2180@TK2MSFTNGP12.phx.gbl...
> 
>> "Dave" <nospam@yahoo.com> wrote in message
>> news:uEOED%23TBFHA.2624@TK2MSFTNGP11.phx.gbl...
>> 
>>> Greetings,
>>> 
>>> Is anybody aware of any code that will allow me to read .rtf or .doc
>>> or .pdf or .htm as plain text (so I can do a streamreader off them).
>>> Thanks,
>>> 
>> Each format would require a different tool.  Microsoft Word can do
>> .rtf and, of course, .doc.
>> 
>> But for PDF check out the pdftotext.exe from the XPDF library
>> 
>> http://www.foolabs.com/xpdf/download.html
>> 
>> from their web site:
>> 
>> "Xpdf is an open source viewer for Portable Document Format (PDF)
>> files. (These are also sometimes also called 'Acrobat' files, from
>> the name of Adobe's PDF software.) The Xpdf project also includes a
>> PDF text extractor, PDF-to-PostScript converter, and various other
>> utilities.
>> 
>> Xpdf runs under the X Window System on UNIX, VMS, and OS/2. The non-X
>> components (pdftops, pdftotext, etc.) also run on Win32 systems and
>> should run on pretty much any system with a decent C++ compiler. "
>> 
>> It's a commandline tool so you would need to shell out to it, and
>> then open a streamreader against the output file.
>> 
>> David
>> 


Relevant Pages

  • Re: Convert .rtf or .doc or .pdf or .htm to plain txt
    ... "Xpdf is an open source viewer for Portable Document Format files. ...
    (microsoft.public.dotnet.framework)
  • Re: HP 4MP printer driver current?
    ... I can print the RTF file from the G4, ... When you print to a PDF, ... When Preview is involved in the print process, the original format of ... Preview to examine a document in a print queue. ...
    (comp.sys.mac.system)
  • Re: HP 4MP printer driver current?
    ... So the built-in PDF rendering can cope with the document, ... If I save it (Print>Save As PDF) as a PDF in Word, ... I can print the RTF file from the G4, ... If the computer is spitting up an "Unexpected EOF" error, ...
    (comp.sys.mac.system)
  • Re: Book machine!
    ... : than a reader would. ... They might also use pdf for that. ... Or use rich text .RTF which supports color coding ... aren't exactly specialized formatting. ...
    (rec.arts.sf.written)
  • Re: HP 4MP printer driver current?
    ... I can print the RTF file from the G4, ... When you print to a PDF, ... When Preview is involved in the print process, the original format of ... Preview to examine a document in a print queue. ...
    (comp.sys.mac.system)