You should have a look at the various nodes about XML::Parser on the site (use the site search), starting with XML::Parser Tutorial and the review.

Here is a somewhat clean (except that it uses global variables) way to use XML::Parser for your purpose:

#!/usr/bin/perl -w use strict; use XML::Parser; my( $in_item, $item); my $parser= XML::Parser->new(); $parser->setHandlers( Start => \&start_handler, Char => \&char_handler, End => \&end_handler, ); $parser->parse( \*DATA); sub start_handler() # raises a flag when getting to Item { my ($p, $elmt ) = @_; if ( $elmt eq 'Item' ) { $in_item=1; $item=''; # reset the item text } } sub char_handler # stores the item text in $item { my( $p, $text)= @_; if( $in_item) { $item .= $text; } } sub end_handler # processes the item and lowers the flag { my ($p, $elmt ) = @_; if ( $elmt eq 'Item' ) { $in_item=0; # Toto, we are not in Item anymore # this is where you call the Item processing procedure print "item: $item\n"; } } __DATA__ <Orders> <Order ID="0008" Date="11/14/1999"><Item>A Book</Item></Order> </Orders>

Now as for your second question, XML::Parser's Subs style would help, or you could just use XML::Twig:

#!/usr/bin/perl -w use strict; use XML::Twig; my $twig= XML::Twig->new( twig_handlers => { Item => sub { print $_->t +ext, "\n"; } }); $twig->parse( \*DATA); __DATA__ <Orders> <Order ID="0008" Date="11/14/1999"><Item>A Book</Item></Order> </Orders>

BTW, I was confused by the use of Entity in your question. Entity already has a meaning in XML: &lt; is an entity.


In reply to Re: XML::Parser start handler by mirod
in thread XML::Parser start handler by Anonymous Monk

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.