monkfan has asked for the wisdom of the Perl Monks concerning the following question:
Hi,
Given this type of html (stored in a variable), how can I extract the string after the last <pre> tag:
such that it returns simply:
I couldn't figure out how to create mechanism that can distinguished between html tag and 'fasta' tag marked by ">".
Regards,
Edward
Given this type of html (stored in a variable), how can I extract the string after the last <pre> tag:
$VAR1 = '<html><title>GAL7</title> <body bgcolor=white> <h2 align=center>GAL7</h2><hr> <form method="post" action="/cgi-bin/SCPD/getgene2?GAL7" enctype="appl +ication/x-www-form-urlencoded"> <input type="submit" name="action" value="Get mapped sites" /><input t +ype="submit" name="action" value="Get putative sites" /><input type=" +submit" name="action" value="Get interg enic region" /><br /><input type="submit" name="action" value="Retriev +e sequence" />Start<-ATG <input type="text" name="start" value="-450" + size="5" maxlength="5" />ATG->End <inp ut type="text" name="end" value="50" size="5" maxlength="5" /><div></d +iv></form><hr> <pre> >YBR018C GAL7 275433 275933 TTTGATATCACTCACAACTATTGCGAAGCGCTTCAGTGAAAAAATCATAA GGAAAAGTTGTAAATATTATTGGTAGTATTCGTTTGGTAAAGTAGAGGGG GTAATTTTTCCCCTTTATTTTGTTCATACATTCTTAAATTGCTTTGCCTC TCCTTTTGGAAAGCTATACTTCGGAGCACTGTTGAGCGAAGGCTCATTAG ATATATTTTCTGTCATTTTCCTTAACCCAAAAATAAGGGAAAGGGTCCAA AAAGCGCTCGGACAACTGTTGACCGTGATCCGAAGGACTGGCTATACAGT GTTCACAAAATAGCCAAGCTGAAAATAATGTGTAGCTATGTTCAGTTAGT TTGGCTAGCAAAGATATAAAAGCAGGTCGGAAATATTTATGGGCATTATT ATGCAGAGCATCAACATGATAAAAAAAAACAGTTGAATATTCCCTCAAAA ATGACTGCTGAAGAATTTGATTTTTCTAGCCATTCCCATAGACGTTACAA ';
my $new_output = ' >YBR018C GAL7 275433 275933 #this fasta marker line is to be + kept TTTGATATCACTCACAACTATTGCGAAGCGCTTCAGTGAAAAAATCATAA GGAAAAGTTGTAAATATTATTGGTAGTATTCGTTTGGTAAAGTAGAGGGG GTAATTTTTCCCCTTTATTTTGTTCATACATTCTTAAATTGCTTTGCCTC TCCTTTTGGAAAGCTATACTTCGGAGCACTGTTGAGCGAAGGCTCATTAG ATATATTTTCTGTCATTTTCCTTAACCCAAAAATAAGGGAAAGGGTCCAA AAAGCGCTCGGACAACTGTTGACCGTGATCCGAAGGACTGGCTATACAGT GTTCACAAAATAGCCAAGCTGAAAATAATGTGTAGCTATGTTCAGTTAGT TTGGCTAGCAAAGATATAAAAGCAGGTCGGAAATATTTATGGGCATTATT ATGCAGAGCATCAACATGATAAAAAAAAACAGTTGAATATTCCCTCAAAA ATGACTGCTGAAGAATTTGATTTTTCTAGCCATTCCCATAGACGTTACAA ';
Regards,
Edward
|
---|
Replies are listed 'Best First'. | |
---|---|
Re: Extracting Text After <pre> tag in HTML
by GrandFather (Saint) on Sep 22, 2006 at 01:28 UTC | |
Re: Extracting Text After <pre> tag in HTML
by graff (Chancellor) on Sep 22, 2006 at 01:35 UTC | |
by mreece (Friar) on Sep 22, 2006 at 20:13 UTC | |
Re: Extracting Text After <pre> tag in HTML
by gellyfish (Monsignor) on Sep 22, 2006 at 21:25 UTC | |
by Anonymous Monk on Mar 30, 2007 at 08:41 UTC | |
Re: Extracting Text After <pre> tag in HTML
by radiantmatrix (Parson) on Sep 27, 2006 at 15:02 UTC | |
Re: Extracting Text After <pre> tag in HTML
by mreece (Friar) on Sep 22, 2006 at 20:28 UTC |
Back to
Seekers of Perl Wisdom