Re: Replacing html tags
- From: "Chris Fulstow" <chrisfulstow@xxxxxxxxxxx>
- Date: 4 Oct 2006 08:16:22 -0700
You could do this with the HTML Agility Pack:
http://www.codeplex.com/Wiki/View.aspx?ProjectName=htmlagilitypack
I think it comes with an example that strips HTML tags, which you could
probably adapt quite quickly to keep <a> tags.
jumblesale wrote:
Hello all,
I'm not all that bad at Regex, but i'm stumped on how to approach my
problem.
I need to parse a string and remove all html tags except hyperlinks.
I can remove all the html tags using: Regex.Replace(inputText,
@"<(/?[^\>]+)>", "");
But this also removes any hyperlinks, which i need to keep.
I've also written a regex for finding hyperlinks:
<a[\s]href=["'][^"]+[.\s]*["'][^<]+[.\s]*</a>
but my problem is trying to put all this together.
I've thought of using Regex.Matches and checking each instance but
can't get that to work.
Any ideas and/ or code would be great - i'm used to C# but VB's cool as
well.
Cheers in advance,
max
.
- Follow-Ups:
- Re: Replacing html tags
- From: Mark Fitzpatrick
- Re: Replacing html tags
- From: jumblesale
- Re: Replacing html tags
- References:
- Replacing html tags
- From: jumblesale
- Replacing html tags
- Prev by Date: Error Creating aplicacion ASP.NET
- Next by Date: Re: Replacing html tags
- Previous by thread: Replacing html tags
- Next by thread: Re: Replacing html tags
- Index(es):
Relevant Pages
|