in reply to SOLVED Removing characters

The split in

my @word = split " ", $_;
could be rewritten as
my @words = map { lc } # convert to lower case split /[\s.]+/, $_; # split on white space and "."
this will have the effect of removing all white space and periods and giving a list of lower case words. You need to read it backwards, the split happens before the map.

Replies are listed 'Best First'.
Re^2: Removing characters
by Slug (Acolyte) on Jan 22, 2008 at 15:05 UTC
    No I still can't get it to work. I just need to combine the contents of the file removing all white spaces, tabs, commas, return characters and periods.

    E.g

    Hey, diddle, diddle,
    The cat and the fiddle,
    heydiddlediddlethecatandthefiddle

      You hadn't mentioned commas in the original post. Anyway to show that the approach works a complete program

      #!/usr/bin/perl use 5.010_000; use warnings; use strict; my %number_of; my @words; while ( my $line = <DATA> ) { $number_of{lines}++; $number_of{chars} += length $line; push @words, map { lc } split /[\s.,]+/, $line; } $number_of{words} = scalar @words; foreach my $item (qw(chars words lines)) { say "Number of $item: $number_of{$item}"; } say @words; __DATA__ Hey, diddle, diddle, The cat and the fiddle, The cow jumped over the moon.
      and the output
      Number of chars: 75 Number of words: 14 Number of lines: 3 heydiddlediddlethecatandthefiddlethecowjumpedoverthemoon

      If other punctuation is to be removed then you add it to the split. The difficulty would be with an apostrophe, you probably do not want to count don't as two words so the split line becomes

      push @words, map { lc } # change to lower case map { tr/'//d; $_ } # remove apostrophes split /[\s.,?!"]+/, # split on space and punctuation $line;

      Update: removed a useless use List::Util; from the example.