You can of course take an HTML::Parser to do the work for you. This is the best way.
But if you really want your own regexp, I suggest to use 2 parsings, just to maintain readability. I am sure a real regexp expert can do it in one line, but here is my try.
It is up to you to store then in arrays, but at least it gives you a hint.$x="<html>/something/more words<br/></html>"; $x =~ m|<(\w+)>(.*)</\1>|g; print "first is $1\n"; # You can put the <> around it here $2 =~ m|(/\w+/)(.*)<br/>| ; print "second is $1\n"; print "third is $2\n";
This is really quick and dirty...
updateIt all depends on how much flexibity you want anyway, you can easily play around with the seperators, etc ...
---------------------------
Dr. Mark Ceulemans
Senior Consultant
IT Masters, Belgium
In reply to Re: Regular expressions
by mce
in thread Regular expressions
by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |