Re: Convert .rtf or .doc or .pdf or .htm to plain txt

From: Matt Berther (mberther_at_hotmail.com)
Date: 01/28/05


Date: Fri, 28 Jan 2005 11:49:13 -0800

Hello Beringer,

Im not completely sure about this, but vwWare[1] may do what you need.

[1] http://wvware.sourceforge.net/

--
Matt Berther
http://www.mattberther.com
> As a related topic:
> Does anybody know of code examples on how to convert RTF to HTML, XML
> etc?
> Thanks in advance,
> Eric
> "David Browne" <davidbaxterbrowne no potted meat@hotmail.com> wrote in
> message news:eJNMQQUBFHA.2180@TK2MSFTNGP12.phx.gbl...
> 
>> "Dave" <nospam@yahoo.com> wrote in message
>> news:uEOED%23TBFHA.2624@TK2MSFTNGP11.phx.gbl...
>> 
>>> Greetings,
>>> 
>>> Is anybody aware of any code that will allow me to read .rtf or .doc
>>> or .pdf or .htm as plain text (so I can do a streamreader off them).
>>> Thanks,
>>> 
>> Each format would require a different tool.  Microsoft Word can do
>> .rtf and, of course, .doc.
>> 
>> But for PDF check out the pdftotext.exe from the XPDF library
>> 
>> http://www.foolabs.com/xpdf/download.html
>> 
>> from their web site:
>> 
>> "Xpdf is an open source viewer for Portable Document Format (PDF)
>> files. (These are also sometimes also called 'Acrobat' files, from
>> the name of Adobe's PDF software.) The Xpdf project also includes a
>> PDF text extractor, PDF-to-PostScript converter, and various other
>> utilities.
>> 
>> Xpdf runs under the X Window System on UNIX, VMS, and OS/2. The non-X
>> components (pdftops, pdftotext, etc.) also run on Win32 systems and
>> should run on pretty much any system with a decent C++ compiler. "
>> 
>> It's a commandline tool so you would need to shell out to it, and
>> then open a streamreader against the output file.
>> 
>> David
>> 


Relevant Pages

  • Re: Convert .rtf or .doc or .pdf or .htm to plain txt
    ... "Xpdf is an open source viewer for Portable Document Format files. ...
    (microsoft.public.dotnet.framework)
  • Re: HP 4MP printer driver current?
    ... I can print the RTF file from the G4, ... When you print to a PDF, ... When Preview is involved in the print process, the original format of ... Preview to examine a document in a print queue. ...
    (comp.sys.mac.system)
  • Re: HP 4MP printer driver current?
    ... So the built-in PDF rendering can cope with the document, ... If I save it (Print>Save As PDF) as a PDF in Word, ... I can print the RTF file from the G4, ... If the computer is spitting up an "Unexpected EOF" error, ...
    (comp.sys.mac.system)
  • Re: HP 4MP printer driver current?
    ... I can print the RTF file from the G4, ... When you print to a PDF, ... When Preview is involved in the print process, the original format of ... Preview to examine a document in a print queue. ...
    (comp.sys.mac.system)
  • Re: Aus Webanwendung PDF, RTF, ... generien
    ... anders als PDF und RTF). ... diese Ausgabe in PDF hinbekommt. ... Sollte ich vielleicht auf ein Zwischenformat wie DocBook umsteigen? ... von simplen Tabellen wohl eher eine Atombombe, ...
    (de.comp.lang.perl.misc)

Loading