Re: Replace character
- From: Jesse Houwing <jesse.houwing@xxxxxxxxxxxxxxxx>
- Date: Thu, 20 Nov 2008 11:12:48 +0000 (UTC)
Hello shapper,
On Nov 20, 2:33 am, Arne Vajhøj <a...@xxxxxxxxxx> wrote:
shapper wrote:Hi,
I have a text as follows:Regex.Replace with a reasonable regex expression for valid email
"My email is someth...@xxxxxxxxxxxxx and I posted this @ 2 am"
I need to replace the @ by (AT) bu only the ones that are in email
addresses.
All other @ shouldn't be replaced.
I know how to replace all @ but I am having problems in replacing
only the @'s in the email addresses.
How can I do this?
addresses should give a reasonable correct replacement.
Arne
I was trying that but I have two problems:
return Regex.Replace(text, "\w+([-+.]\w+)*@\w+([-.]\w+)*\.\w+([-.]\w+)
*", "(at)");
First I get a lot of errors on the pattern in each \ character:
Unrecognized escape sequence
The second problem is that this identifies all emails on my string but
I don't want to replace the emails but the @ in the emails.
There are two options here: Either capture the first and second part of the email in a group and put them back in.
Second, match only the @, but use lookahead and lookbehind constructions to make sure you only replace the @ in an emailaddress
So here we go:
capture the first and last parts and replace:
(?<name>\w+([-+.]\w+)+)@(?<domain>\w+([-.]\w+)*\.\w+([-.]\w+)*))
and replace that with:
${name}(at)${domain}
This should be quite easy to understand. Capture the parts you want to keep in a named group (?<name>..) and put them back into the replacement pattern ${name}.
The second option using look arounds:
(?<=\w+([-+.]\w+)+)(at)(?>\w+([-.]\w+)*\.\w+([-.]\w+)*))
and replace that with:
(at)
This one is usually harder to understand. (?<=...) looks back in your string and tries to find the pattern that is defined at the '...'. If it cannot find this exact pattern the whole expression fails.
(?>...) essentially does the same, but looks ahead, instead of backwards. The funny thing is, that even though you search for something in your regex, it doesn't become part of the actual Match and therefore doesn't get replaced.
Now to touch on your issue with th escape sequences. Any special character sequence in regex starts with a \. And any escape sequence in C# code starts with a \ as well, so if you put a regex in a string you again have two options: First, escape all your \'s in your regex by putting an additional \ in front of it. eg. \w becomes \\w. Second option is to use verbatim strings in C# by placing an @ in front of the string definition like this: @"\w...more regex goes here".
Hope this helps
--
Jesse Houwing
jesse.houwing at sogeti.nl
.
- Follow-Ups:
- Re: Replace character
- From: shapper
- Re: Replace character
- References:
- Re: Replace character
- From: shapper
- Re: Replace character
- Prev by Date: Refreshing modeless dialogs?
- Next by Date: Re: automatic use of cores for different threads, was: Stupid question?
- Previous by thread: Re: Replace character
- Next by thread: Re: Replace character
- Index(es):
Relevant Pages
|