Beefy Boxes and Bandwidth Generously Provided by pair Networks
We don't bite newbies here... much
 
PerlMonks  

Re: Extracting keywords from HTML

by fizbin (Chaplain)
on Aug 21, 2005 at 18:11 UTC ( #485560=note: print w/replies, xml ) Need Help??

Help for this page

Select Code to Download


  1. or download this
    sub get_stop{
      # sample
    ...
        qw(’ ‘ ” “);
      \%punc;
    }
    
  2. or download this
      s/(&#?\w+;)/$punc->{$1}||$1/eg;
    
  3. or download this
      next if $stop->{$_};
    
  4. or download this
    ''words in double single quotes''
    
  5. or download this
      for (@words){
        s/^['-]+//;
    ...
        #  next if /--/;  # not needed anymore
        $words_all->{$_}->{$file_key} += 1;    
      }
    
  6. or download this
    --
    @/=map{[/./g]}qw/.h_nJ Xapou cets krht ele_ r_ra/;
    map{y/X_/\n /;print}map{pop@$_}@/for@/
    

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://485560]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others contemplating the Monastery: (4)
As of 2023-01-30 18:20 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?