Respect case in substitution

b4swine has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Respect case in substitution by ysth (Canon) on Feb 27, 2008 at 09:19 UTC
Something like: `s/\b(find)\b/$1 eq uc($1) ? "REPL" : $1 eq ucfirst(lc($1)) ? "Repl" : +"repl"/gie` [download] or more flexibly: `my %repl = ( find => 'repl', FIND => 'REPL', Find => 'Repl', ); s/\b(find)\b/$repl{$1}\|\|$repl{lc $1}/gie` [download] -- Online Fortune Cookie Search	[reply] [d/l] [select]
Re: Respect case in substitution by moritz (Cardinal) on Feb 27, 2008 at 09:30 UTC
I know that won't help you very much, but Perl 6 knows the `:ii` modifier, which can transport case information on a char by char basis, or it detects if the matched text has a "simple" case (like all upper, all lower, ucfirst, lcfirst, captilized), and applies that informaion to the substitution string. You can implement something like that in perl 5, but not with such a nice syntax: `s/\b(find)\b/transport_case($1, 'repl')/eig; sub transport_case { ... }` [download] The complexity of `transport_case` strongly depends on what you want to achieve. The first described behaviour is a simple matter of iterating over all the chars, and testing/applying the case.	[reply] [d/l] [select]
Re: Respect case in substitution by ikegami (Patriarch) on Feb 27, 2008 at 10:42 UTC
`s/\b(find)\b/ uc('repl') \| ( $1 ^ uc($1) ) /eig;` [download] Only guaranteed to work for search strings and replacement strings consisting entirely of ASCII letters. Non-letter and accented characters won't work. EBCDIC won't work. It also only works if $1 and $find are the same length. Update: Added clarification (by adding "guaranteed") and an extra failure mode.	[reply] [d/l]
Re^2: Respect case in substitution by graff (Chancellor) on Feb 27, 2008 at 18:33 UTC
What is your definition of "won't work" here? `#!/usr/bin/perl -l use strict; my $find="find=1"; my $repl="repl:2"; for my $trial (qw/find=1 Find=1 FIND=1 fInD=1/) { $_ = "here is >$trial< data"; s/\b($find)\b/ uc($repl) \| ( $1 ^ uc($1) ) /eig; print }` [download] For me, that produces: `here is >repl:2< data here is >Repl:2< data here is >REPL:2< data here is >rEpL:2< data` [download] Do you see something wrong with that? I'll grant that many accented characters won't work properly, in the sense that you'll get an incorrect character as the result, but actually, there are a fair number of them where the case distinction is a matter of a single bit being on or off (just like in the ASCII letters), and I'd expect those to work. (But I don't have time to test that just now -- and I'm sure not going to argue about EBCDIC...)	[reply] [d/l] [select]
Re^3: Respect case in substitution by ikegami (Patriarch) on Feb 27, 2008 at 20:47 UTC
It will happen to work for some invalid inputs. Here are cases that support what I said: Non-letters in $repl: my $find= "find"; my $repl= "@@@@"; $_ = "Find"; print; s/\b($find)\b/ uc($repl) \| ( $1 ^ uc($1) ) /eig; print; # @``` # Should be @@@@ [download] Non-letters in $find: `my $find= chr(1234) . 'find'; my $repl= "aaaaa"; $_ = chr(1234) . 'FiNd'; print; s/\b($find)\b/ uc($repl) \| ( $1 ^ uc($1) ) /eig; print; # AAaAa # Should be aAaAa` [download]	[reply] [d/l] [select]