I have proteins for which I would like to find their corresponding nucleotide sequences. I also have the genome in which the protein is found. In the genome, I have found the corresponding Gene ID for the protein. However, I am having trouble getting the nucleotide sequence with the Gene ID. I have tried using Entrez Efetch:
Entrez.email = "dddd@gmail.com"
with open("genome.gb", "w") as out_handle:
request = Entrez.efetch(db="gene", id="2703488", rettype="gb", retmode="text")
out_handle.write(request.read())
request.close()
but this only returns the following:
1. G
tail component [Escherichia virus Lambda]
Other Aliases: lambdap14
Other Designations: tail component
Annotation: NC_001416.1 (9711..10133)
ID: 2703488
Is there anyway to get the actual nucleotide sequence using Efetch? Thanks in advance!