The BioMart for Uniprot does not want to give any real protein features.
That's a bit disappointing.
Back to Ensembl:
According to this PDF:
http://www.stat.berkeley.edu/~sandrine/Teaching/PH292.F07/Docs/Durinck.pdf
I should be able to getSequence in Ensembl and get back the protein sequence.
This is possible.
For example, for BRCA1, using Ensembl:
> seq = getSequence(id="BRCA1", type="hgnc_symbol", seqType="peptide", mart=mart)
Challenge: it returns 29 different sequences!!!!
No comments:
Post a Comment