Re: Re: Preferred Methods (again)

<!-- notroot: <Root> -->
[download]

<root><foo/><root><bar/></root></root> <!-- Yes, your regex will take 
+the second <root> -->
[download]

If parsing XML data could be done with a simple regex, those modules would probably not exist.

2;0 juerd@ouranos:~$ perl -e'undef christmas'
Segmentation fault
2;139 juerd@ouranos:~$
[download]

Comment on Re: Re: Preferred Methods (again) Select or Download Code

Replies are listed 'Best First'.
Re: Re: Re: Preferred Methods (again) by perrin (Chancellor) on Jan 17, 2002 at 01:33 UTC
Get off your high horse about XML compliance. He gave a sample input format and asked how to grab pieces of it. If he changes the input format or wants it to handle broken input, he has to change the way he parses. That's true with an XML parser too.	[reply]
Re: Re: Re: Re: Preferred Methods (again) by Juerd (Abbot) on Jan 17, 2002 at 01:38 UTC
seattlejohn already commented on xml compliance. With an XML parser you don't have to change your parsing for grabbing the root element when the input changes, with a regex you (probably) do. `2;0 juerd@ouranos:~$ perl -e'undef christmas' Segmentation fault 2;139 juerd@ouranos:~$` [download]	[reply] [d/l]
Re: Re: Re: Re: Re: Preferred Methods (again) by perrin (Chancellor) on Jan 17, 2002 at 01:44 UTC
With an XML parser you don't have to change your parsing for grabbing the root element when the input changes Not when the content changes, but when the format changes you do. You example was a format change: `<root><foo/><root><bar/></root></root> <!-- Yes, your regex will take the second <root> -->` If that's even legal, it would certainly require changes in your code to get the right part.	[reply] [d/l]
Re: Re: Re: Preferred Methods (again) by BMaximus (Chaplain) on Jan 17, 2002 at 02:21 UTC
If parsing XML data could be done with a simple regex, those modules would probably not exist. It's possible. Just that it doesn't have any error checking. See: Parsing pseudo XML files BMaximus	[reply]
Re: Re: Re: Re: Preferred Methods (again) by Matts (Deacon) on Jan 17, 2002 at 17:07 UTC
XML parsing is not possible with a simple regexp, it requires a proper parser. Having said that, it's possible with lots of complex regexps, or at least mostly possible. See Paul Kulchenko's XML::Parser::Lite for that. Now parsing a subset of XML, well that's a totally different matter. And entirely appropriate in certain situations. Yes, that is me saying that. Oh, and XML::SAX::PurePerl has plenty of error checking. But it's likely way too slow for the questioner's problem.	[reply]