in reply to Re: [OT] - Mutilated email addresses (not Perl)
in thread [OT] - Mutilated email addresses (not Perl)

I'd be curious to see what happens to underscores, hyphens, or other email address TLD's like in yahoo.com.tw

From the small sample dataset, the TLD appears to make no difference. But being in the UK means I am only likely to see .com .co.uk and .net. I shall keep an eye out for new domains appearing and check the behaviour.

I've had another look now there's a bit more data. The truncation happens at both underscores and hyphens...

chirl-loydayls@hotmail.co.uk -> chirl-loydayls@hotmail.co.ukchirl t_gilbro@hotmail.com -> t_gilbro@hotmail.comt

The above have been modified from the actual email addresses!

However, I don't have any access to the code on either server.

Replies are listed 'Best First'.
Re^3: [OT] - Mutilated email addresses (not Perl)
by jdporter (Paladin) on Jan 16, 2024 at 14:19 UTC

    I'd suspect that the bit being duplicated/appended is /^([a-z.]+)/i. That is, alpha and dot. Which kinda looks like someone's naive take on what domain names — and user names — can look like.

    If so, then you can strip off that extraneous bit with something like: s/^(([a-z.]+).*)\2$/\1/