in reply to Is it possible to retrieve the coding sequence of a gene from NCBI GenBank database using perl ?
In fact, there is a hyperlink at the word CDS in the particular GenBank page of NCBI and when the link is clicked it shows the cds of the gene. Is it possible to get the cds sequence directly using a perl script....
It sounds to me like you have a website that provides the 'cds' already prepared and you are asking us if there is a way to download the text from the webpage. If that is the case then yes you can do that. Given an URL there are several ways to download a page. On *nix platforms you can use wget with the perl system function or with qx(). From Perl directly you can use the LWP::Simple module.
But it sounds like you need to give the number or name of the gene to get to the correct page before the CDS link appears. If that is the case you might be able to use WWW::Mechanize. It doesn't support JavaScript, hence the word 'might'. Are these pages open to the public? What is the CDS format? I didn't see it in a quick web search.
Edit: I found a few solutions here that seem to do part of what you are requesting.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Is it possible to retrieve the coding sequence of a gene from NCBI GenBank database using perl ?
by supriyoch_2008 (Monk) on Jan 24, 2017 at 14:16 UTC |