Sciencemadness Discussion Board
Not logged in [Login ]
Go To Bottom

Printable Version  
Author: Subject: Protein sequence
phlogiston
International Hazard
*****




Posts: 1376
Registered: 26-4-2008
Location: Neon Thorium Erbium Lanthanum Neodymium Sulphur
Member Is Offline

Mood: pyrophoric

[*] posted on 6-2-2015 at 13:15
Protein sequence


For our interest, please share your interest and goal with this.

It is a DNA sequence, not protein.
using nucleotide blast, you get a very good (but not perfect) match with a gene encoding an alpha-dextrin endo-1,6-alpha-glucosidase from the bacterium <i>Raoultella ornithinolytica B6</i>:

Code:
Query CGTTCTATAGATACGCGATGACGGTATACCATCCGCA |||||||| | |||||| ||||||| ||||||||||| Sbjct CGTTCTATCGCTACGCGCTGACGGTTTACCATCCGCA


When you would try to translate your DNA sequence first, you would find that in the frame that you posted, the 7th codon is a stop codon (TGA), so that is probably the wrong frame.

Code:
cgttctatagatacgcgatgacggtataccatccgcaaagt R S I D T R - R Y T I R K


Looking at other frames (include the reverse strand), you will find stop codons in every possible one, except for the reverse complement skipping the first two bases:
Code:
actttgcggatggtataccgtcatcgcgtatctatagaacg F A D G I P S S R I Y R T


pblast does not return a perfect hit for this sequence. The best one is from an inosine-5'-monophosphate dehydrogenase from a bacterium from the <i>Cryocola</i> genus.

Code:
Query FADGIPSSRIY FA+GI SSRIY Sbjct FAEGISSSRIY



Am I just doing your homework for you or is this result in any way interesting?

[Edited on 6-2-2015 by phlogiston]




-----
"If a rocket goes up, who cares where it comes down, that's not my concern said Wernher von Braun" - Tom Lehrer
View user's profile View All Posts By User
morsagh
Hazard to Others
***




Posts: 187
Registered: 20-2-2014
Member Is Offline

Mood: No Mood

[*] posted on 6-2-2015 at 13:32


I wanted to learn how to use DNA codon table and from sequences match to protein. I am just on second class of high school so truly it isnĀ“t my homework.
View user's profile View All Posts By User
mayko
International Hazard
*****




Posts: 1218
Registered: 17-1-2013
Location: Carrboro, NC
Member Is Offline

Mood: anomalous (Euclid class)

[*] posted on 6-2-2015 at 14:41


If you want to see what peptide a DNA sequence would be translated into, a good web option is the Expasy Translate tool. (Expasy has a lot of other good tools) Enter your sequence, and it returns a codon-by-codon translation, for all three reading frames on the forward and reverse strand. (Keep in mind that only open reading frames, beginning with a Start, ending with a Stop, etc, correspond to 'well formed' peptides.) This would be best if you only have a few sequences to check, occasionally.

If you have more than you can do by hand, my tool of choice is the BioPython module. Seq().translate and SeqUtils.six_frame_translations would do what you want.

If you're looking to search for proteins which are similar to what your sequence would translate to, BLAST is likely the tool for you. In particular, BLASTX will search protein databases, given a nucleotide sequence.

Again depending on your specific situation, BLAST has a web utility and a command line exectutable.

If your sequence comes from a known species with a sequenced genome, you can also use the UCSC Genome Browser's BLAT search to view your sequence in its genomic context.

I do most bioinformatics with BASH and Python scripts; if those aren't your language or they're new to you, check your local library for "Bioinformatics for [Perl/Java/etc]" books.




al-khemie is not a terrorist organization
"Chemicals, chemicals... I need chemicals!" - George Hayduke
"Wubbalubba dub-dub!" - Rick Sanchez
View user's profile Visit user's homepage View All Posts By User
phlogiston
International Hazard
*****




Posts: 1376
Registered: 26-4-2008
Location: Neon Thorium Erbium Lanthanum Neodymium Sulphur
Member Is Offline

Mood: pyrophoric

[*] posted on 7-2-2015 at 14:31


I see, then Maykos post is much more helpful.
I assumed you were trying to identify an existing protein that was encoded by that stretch of DNA.

Although there are many excellent tools online to do the translation for you, it is definitely also a good learning experience to try doing it by hand using a genetic code table

You seem to be interested in biochemistry at an early age then already. Enjoy it, it is a tremendously fascinating topic, and I fully believe the next revolution in science will be in biology.

[Edited on 8-2-2015 by phlogiston]




-----
"If a rocket goes up, who cares where it comes down, that's not my concern said Wernher von Braun" - Tom Lehrer
View user's profile View All Posts By User
Mesa
Hazard to Others
***




Posts: 264
Registered: 2-7-2013
Member Is Offline

Mood: No Mood

[*] posted on 23-4-2015 at 17:31


Quote: Originally posted by mayko  


I do most bioinformatics with BASH and Python scripts; if those aren't your language or they're new to you, check your local library for "Bioinformatics for [Perl/Java/etc]" books.

My brother started off his postgrad microbiology work using Python etc.
But he had switched over to an old 400mhz UNIX box by the end of the project. Supposedly the obsolete RISC processors only took a fraction of the time to do protein folding calcs etc. than the models released 15 years after them.

View user's profile View All Posts By User
phlogiston
International Hazard
*****




Posts: 1376
Registered: 26-4-2008
Location: Neon Thorium Erbium Lanthanum Neodymium Sulphur
Member Is Offline

Mood: pyrophoric

[*] posted on 25-4-2015 at 02:23


That link is for a peptide sequencing service, not DNA.
DNA sequencing is indeed not difficult at all, with the right tools.




-----
"If a rocket goes up, who cares where it comes down, that's not my concern said Wernher von Braun" - Tom Lehrer
View user's profile View All Posts By User
Chemosynthesis
International Hazard
*****




Posts: 1071
Registered: 26-9-2013
Member Is Offline

Mood: No Mood

[*] posted on 25-4-2015 at 05:13


Very true, though those sequencing tools are changing more and more rapidly. Seems like by the time we get a sequencer and trained operator, the next new thing is "in." Glad it's not me!
View user's profile View All Posts By User

  Go To Top