paola82 has asked for the wisdom of the Perl Monks concerning the following question:
Hi monks, I have a problem with parsing, web pages, maybe the same problem every time I have to parse data like this...I have an html page, I show a part of that
<TR class="violet3"> <TD ><B>hsa-miR-107</B></TD> <TD >17.1922</TD> <TD >-21.47</TD> <TD >2.119850e-02</TD> <TD >2.097540e-02</TD> <TD >6.191350e-04</TD> <TD >106</TD> <TD >127</TD> <TD ><pre><FONT COLOR="#FFFFFF">a</FONT><FONT COLOR="#FFFFFF">c</F +ONT><FONT COLOR="#FFFFFF">u</FONT><FONT .... </TR> <TR class="violet2"> <TD ><B>hsa-miR-103</B></TD> <TD >17.1922</TD> .... <TR class="violet3"> <TD ><B>hsa-miR-651</B></TD> </TR> <TR class="violet2"> <TD ><B>hsa-miR-320</B></TD>
I need to extract hsa-miR-651, hsa-miR-320, etc....what I have to do, regular expression don't help me and I don't understand how to use some moduls like html::element....I don't understand the synthax and I'm not actually sure, if it is ok to use it...can anyoneone help me? Thanks you all
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: parsing html
by wfsp (Abbot) on May 14, 2009 at 15:55 UTC | |
|
Re: parsing html
by mirod (Canon) on May 14, 2009 at 15:51 UTC | |
|
Re: parsing html
by ramrod (Priest) on May 14, 2009 at 15:29 UTC | |
by paola82 (Sexton) on May 14, 2009 at 15:58 UTC | |
by wfsp (Abbot) on May 14, 2009 at 17:06 UTC | |
by paola82 (Sexton) on May 15, 2009 at 09:10 UTC | |
by wfsp (Abbot) on May 15, 2009 at 09:28 UTC | |
| |
by whakka (Hermit) on May 14, 2009 at 17:14 UTC |