Re: Convert .rtf or .doc or .pdf or .htm to plain txt

From: Beringer (borden_eric_at_invalid.com)
Date: 01/28/05


Date: Fri, 28 Jan 2005 06:59:43 -0800

As a related topic:
Does anybody know of code examples on how to convert RTF to HTML, XML etc?

Thanks in advance,
Eric

"David Browne" <davidbaxterbrowne no potted meat@hotmail.com> wrote in
message news:eJNMQQUBFHA.2180@TK2MSFTNGP12.phx.gbl...
>
> "Dave" <nospam@yahoo.com> wrote in message
> news:uEOED%23TBFHA.2624@TK2MSFTNGP11.phx.gbl...
>> Greetings,
>>
>> Is anybody aware of any code that will allow me to read .rtf or .doc or
>> .pdf or .htm as plain text (so I can do a streamreader off them).
>> Thanks,
>>
>
> Each format would require a different tool. Microsoft Word can do .rtf
> and, of course, .doc.
>
> But for PDF check out the pdftotext.exe from the XPDF library
>
> http://www.foolabs.com/xpdf/download.html
>
> from their web site:
>
> "Xpdf is an open source viewer for Portable Document Format (PDF) files.
> (These are also sometimes also called 'Acrobat' files, from the name of
> Adobe's PDF software.) The Xpdf project also includes a PDF text
> extractor, PDF-to-PostScript converter, and various other utilities.
>
> Xpdf runs under the X Window System on UNIX, VMS, and OS/2. The non-X
> components (pdftops, pdftotext, etc.) also run on Win32 systems and should
> run on pretty much any system with a decent C++ compiler. "
>
>
> It's a commandline tool so you would need to shell out to it, and then
> open a streamreader against the output file.
>
> David
>
>
>



Relevant Pages

  • Creating a printable report simply, rtf or PDF
    ... Over Christmas I produced, using PHP and MySQL, a database and front end to ... Better ways I assume are to export it in PDF or RTF. ... get my script to produce XML and then use a template ...
    (comp.lang.php)
  • Re: Creating a printable report simply, rtf or PDF
    ... There are a few RTF generators available there, ... > Better ways I assume are to export it in PDF or RTF. ... > and something like sablotron to generate XML. ...
    (comp.lang.php)
  • Re: XSL pagination control
    ... supported as of XSLT v1.1, and nearly nobody implements it just yet). ... this potential reporting solution is to be used ... XSLT for an XML everytime a new report has to be emitted, ... end product in either .pdf or .rtf. ...
    (comp.text.xml)
  • Re: MS Word objects list
    ... If you had said at the beginning that you're translating between RTF ... and XML, I would have suggested right away: ... >>> ranges of text can contain comments and so on. ...
    (microsoft.public.word.vba.general)
  • Re: HP 4MP printer driver current?
    ... I can print the RTF file from the G4, ... When you print to a PDF, ... When Preview is involved in the print process, the original format of ... Preview to examine a document in a print queue. ...
    (comp.sys.mac.system)

Quantcast