in reply to Re: 1GB XML mining with XML:twig (newbies question)
in thread 1GB XML mining with XML:twig (newbies question)

Thanks for both the replies. I solved the problem already by means offered by XML::twig. There is possiblity to read just a portion of the data (one PC-Compound), to parse and discard in the end - in principle it is similar to your suggestions in a way:
my $twig= new XML::Twig( twig_handlers => { PC-Compound => \&subrutineforparsing} ); $twig->parsefile($inputfile);
As for several PC-InfoData_value_binary (aliases), I load the into an array and than use regular expression to get just the alias from NSC db. karpatov

Replies are listed 'Best First'.
Re^3: 1GB XML mining with XML:twig (newbies question)
by karpatov (Beadle) on Feb 18, 2008 at 16:00 UTC
    Hmm. My solution worked. But was desperately slow and runout of memory errors happend. So I decided to use your strategy (Regex and only then xml-parser) and it is great. Tx. karpatov