In my rules, do I refer to this tag as "ns2:Height" or as "Height" somehow accounting for the namespace?

You can assign namespace prefixes using the namespaces option, for example namespaces => { 'http://www.example.com/foo' => 'ns2' }, and then use the ns2: prefix in your rules. Or, you can use the same option to map different namespaces to the same prefix, if that is acceptable in your application (!), in the way I showed below (in that case the empty prefix).

Can I have rules that just deal with Height, Length, Width, Weight, etc?

Yes, you can specify multiple tag names for a rule with |. You can also use regexes to specify rules, as in '/foo/' => ... (although I don't think the two can be combined). Here's an example:

use warnings; use strict; use XML::Rules; use Data::Dump; my $parser = XML::Rules->new( stripspaces => 3|4, warnoverwrite => 1, namespaces => { 'http://www.example.com/foo' => '', 'http://www.example.com/bar' => '', '*' => 'die' }, # don't allow other namespaces rules => [ 'Height|Length|Width|Weight' => sub { lc $_[0] => $_[1]->{_content} }, 'root' => 'pass', _default => sub { die "Unknown tag $_[0]" }, ] ); dd $parser->parse(<<'END_XML'); <root xmlns="http://www.example.com/bar" xmlns:foo="http://www.example.com/foo" > <foo:Height Units="inches">1.00</foo:Height> <foo:Length Units="inches">1.00</foo:Length> <foo:Width Units="inches">1.00</foo:Width> <foo:Weight Units="pounds">0.01</foo:Weight> </root> END_XML __END__ { height => "1.00", length => "1.00", weight => 0.01, width => "1.00" }

In this code, I've made the _default rule be to die, that way I can make sure during development that I haven't overlooked a tag in the input XML. But you can of course also specify any _default rule you like.

By the way, since these questions are going a bit off topic from the original post, it might be best to start a new thread on your questions about XML::Rules, showing your code, enough example input to reproduce, etc. (Short, Self-Contained, Correct Example).


In reply to Re^2: hashref with(out) arrays by haukex
in thread hashref with(out) arrays by bfdi533

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.