in reply to Reverse engineering HTML pages to RDF/XML Schema Using ARC2 or Perl ?
I think you need to be more specific on what you are trying to do. Can you post a link to a page you are trying to parse, or paste in a short fragment, along with what you are trying to extract.
Having said that, if you are tying to parse HTML, then you probably want to use modules such as HTML::TreeBuilder (from CPAN). There have been two threads on this recently: how to quickly parse 50000 html documents?, Parsing HTML files
|
|---|