Wednesday, 11 June 2014

Getting protein size

The BioMart for Uniprot does not want to give any real protein features.

That's a bit disappointing.

Back to Ensembl:

According to this PDF:
http://www.stat.berkeley.edu/~sandrine/Teaching/PH292.F07/Docs/Durinck.pdf

I should be able to getSequence in Ensembl and get back the protein sequence.

This is possible.

For example, for BRCA1, using Ensembl:

> seq = getSequence(id="BRCA1", type="hgnc_symbol", seqType="peptide", mart=mart)

Challenge: it returns 29 different sequences!!!!


No comments:

Post a Comment