in reply to Pattern Search Across Multiple Lines
You haven't written it explicitly, but lets assume that the trade-messages are not nested and that the file is not corrupted (no unmatched tags)... otherwise things will get more complicated
You say your file is huge => therefore you should read it line-by-line
based on the assumptions above, the algorithm below should do the trick
note1: you will surely want to enhance the first check to cover the situation that the endTag occurs before the startTag
note2: check substr ... no need for regular expressions here
note3: if things get more complicated (e.g. you need to evaluate sub-Tags, or the input is really XML and contains comments (<!--)) you will want to use some of the fine XML-modules of CPAN instead (don't forget to check the Tutorials section on that topic ;-)
|
|---|