Re: Replacing html tags
- From: "Mark Fitzpatrick" <markfitz@xxxxxxxxxx>
- Date: Wed, 4 Oct 2006 10:27:09 -0500
Woohoo! This is a great control library. Glad you posted it here as it saved
me from writing a lot of code using the WebBrowser control to do some
similar HTML manipulation.
--
Thanks again,
Mark Fitzpatrick
Former Microsoft FrontPage MVP 199?-2006
"Chris Fulstow" <chrisfulstow@xxxxxxxxxxx> wrote in message
news:1159974982.558465.258060@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
You could do this with the HTML Agility Pack:
http://www.codeplex.com/Wiki/View.aspx?ProjectName=htmlagilitypack
I think it comes with an example that strips HTML tags, which you could
probably adapt quite quickly to keep <a> tags.
jumblesale wrote:
Hello all,
I'm not all that bad at Regex, but i'm stumped on how to approach my
problem.
I need to parse a string and remove all html tags except hyperlinks.
I can remove all the html tags using: Regex.Replace(inputText,
@"<(/?[^\>]+)>", "");
But this also removes any hyperlinks, which i need to keep.
I've also written a regex for finding hyperlinks:
<a[\s]href=["'][^"]+[.\s]*["'][^<]+[.\s]*</a>
but my problem is trying to put all this together.
I've thought of using Regex.Matches and checking each instance but
can't get that to work.
Any ideas and/ or code would be great - i'm used to C# but VB's cool as
well.
Cheers in advance,
max
.
- References:
- Replacing html tags
- From: jumblesale
- Re: Replacing html tags
- From: Chris Fulstow
- Replacing html tags
- Prev by Date: Re: Replacing html tags
- Next by Date: asp.net 1.1 error
- Previous by thread: Re: Replacing html tags
- Next by thread: Error Creating aplicacion ASP.NET
- Index(es):
Relevant Pages
|