Re: Convert .rtf or .doc or .pdf or .htm to plain txt
From: Beringer (borden_eric_at_invalid.com)
Date: 01/28/05
- Next message: Klayman: "COM+ stream"
- Previous message: Edward: "How to write plain ascii to a textfile"
- In reply to: David Browne: "Re: Convert .rtf or .doc or .pdf or .htm to plain txt"
- Next in thread: Matt Berther: "Re: Convert .rtf or .doc or .pdf or .htm to plain txt"
- Reply: Matt Berther: "Re: Convert .rtf or .doc or .pdf or .htm to plain txt"
- Messages sorted by: [ date ] [ thread ]
Date: Fri, 28 Jan 2005 06:59:43 -0800
As a related topic:
Does anybody know of code examples on how to convert RTF to HTML, XML etc?
Thanks in advance,
Eric
"David Browne" <davidbaxterbrowne no potted meat@hotmail.com> wrote in
message news:eJNMQQUBFHA.2180@TK2MSFTNGP12.phx.gbl...
>
> "Dave" <nospam@yahoo.com> wrote in message
> news:uEOED%23TBFHA.2624@TK2MSFTNGP11.phx.gbl...
>> Greetings,
>>
>> Is anybody aware of any code that will allow me to read .rtf or .doc or
>> .pdf or .htm as plain text (so I can do a streamreader off them).
>> Thanks,
>>
>
> Each format would require a different tool. Microsoft Word can do .rtf
> and, of course, .doc.
>
> But for PDF check out the pdftotext.exe from the XPDF library
>
> http://www.foolabs.com/xpdf/download.html
>
> from their web site:
>
> "Xpdf is an open source viewer for Portable Document Format (PDF) files.
> (These are also sometimes also called 'Acrobat' files, from the name of
> Adobe's PDF software.) The Xpdf project also includes a PDF text
> extractor, PDF-to-PostScript converter, and various other utilities.
>
> Xpdf runs under the X Window System on UNIX, VMS, and OS/2. The non-X
> components (pdftops, pdftotext, etc.) also run on Win32 systems and should
> run on pretty much any system with a decent C++ compiler. "
>
>
> It's a commandline tool so you would need to shell out to it, and then
> open a streamreader against the output file.
>
> David
>
>
>
- Next message: Klayman: "COM+ stream"
- Previous message: Edward: "How to write plain ascii to a textfile"
- In reply to: David Browne: "Re: Convert .rtf or .doc or .pdf or .htm to plain txt"
- Next in thread: Matt Berther: "Re: Convert .rtf or .doc or .pdf or .htm to plain txt"
- Reply: Matt Berther: "Re: Convert .rtf or .doc or .pdf or .htm to plain txt"
- Messages sorted by: [ date ] [ thread ]
Relevant Pages
|