This is documentation for Mathematica 8, which was
based on an earlier version of the Wolfram Language.
View current documentation (Version 11.1)

ProteinData

ProteinData
gives the reference amino acid sequence for the protein prot.
ProteinData
gives the value of the specified property for the protein prot.
  • Proteins are specified by standard names such as .
  • ProteinData gives a list of all reference human proteins.
  • Protein sequences are represented as strings of standard single-letter amino acid codes.
  • Fundamental properties include:
"MolecularWeight"total molecular weight in daltons
  • Sequence properties for proteins include:
"DNACodingSequence"base pair sequence coding for the protein
"DNACodingSequenceLength"length of base pair sequence coding for the protein
"Gene"gene that codes for the protein
"Sequence"amino acid sequence for the protein
"SequenceLength"length of amino acid sequence for the protein
  • Protein structures may contain additional elements not explicitly encoded in the original DNA sequence.
  • Molecular structure properties based on residues include:
"DihedralAngles"list of dihedral angles , , in radians
"SecondaryStructureRules"list of rules giving start and end positions of helix, sheet, etc. structures
  • Molecular structure properties based on individual atoms include:
"AdditionalAtomPositions"list of 3D coordinates of additional atoms
"AdditionalAtomTypes"list of element symbols for additional atoms
"AtomPositions"list of 3D coordinates of protein atoms
"AtomRoles"list of structural roles for protein atoms
"AtomTypes"list of element symbols for protein atoms
"GyrationRadius"radius of gyration
"MoleculePlot"3D molecular structure plot
  • Distances are measured in picometers.
  • ProteinData gives molecular structure properties with various groupings:
{}no grouping
"Chain"group by chain
"Residue"group by residue
{g1,g2,...}list of grouping criteria
  • Properties associated with chains within structures include:
"ChainLabels"list of identifiers for 3D structure chains
"ChainSequences"list of amino acid sequences for 3D structure chains
  • Protein common domain properties include:
"DomainIDs"NCBI CDD numbers of domains
"DomainPositions"positions of domains in the protein sequence
"Domains"names of domains in the protein
  • Functional properties include:
"BiologicalProcesses"biological processes associated with the protein
"CellularComponents"cellular components in which the protein is found
"MolecularFunctions"molecular functions of the protein
  • Protein identification properties include:
"AlternateNames"alternate traditional names
"GeneID"GeneID number string for the protein's gene
"Name"traditional name
"NCBIAccessions"NCBI accession strings
"PDBIDList"list of all PDB ID strings
"PrimaryPDBID"PDB ID chosen in Mathematica for structure properties, etc.
"StandardName"standard Mathematica name
  • ProteinData gives the units for a particular property value.
Get a list of human proteins:
Display the ribbon diagram:
Get the amino acid sequence of a protein:
Get the molecular weight of a protein:
Get the number of amino acids in a protein sequence:
Get the coordinates of atoms in a 3D protein structure:
Get a list of human proteins:
In[1]:=
Click for copyable input
Out[1]//Short=
In[2]:=
Click for copyable input
Out[2]=
 
Display the ribbon diagram:
In[1]:=
Click for copyable input
Out[1]=
 
Get the amino acid sequence of a protein:
In[1]:=
Click for copyable input
Out[1]=
 
Get the molecular weight of a protein:
In[1]:=
Click for copyable input
Out[1]=
 
Get the number of amino acids in a protein sequence:
In[1]:=
Click for copyable input
Out[1]=
 
Get the coordinates of atoms in a 3D protein structure:
In[1]:=
Click for copyable input
Out[1]//Short=
Obtain a list of protein names:
Find the English name of a protein:
Get a list of protein classes:
Find protein classes related to DNA binding:
Get a list of proteins involved in DNA binding:
Get a list of groups a protein belongs to:
Test whether a protein belongs to a class:
Plot the ribbon diagram for a protein:
Get the 3D coordinates of each atom in a protein structure:
Get the corresponding atom types:
Group the atom coordinates by residue:
Group by chain:
Group by chain and by residue:
Get the atom types for a particular residue:
Get the alpha-carbon atoms from each residue:
Use the alpha-carbon positions to render the protein backbone:
Get a list of properties for a particular protein:
Get a short textual description of a property:
Get the gene that encodes a protein:
Get the names of all proteins encoded by the gene from GenomeData:
Display a protein using atom sizes from ElementData and colorings from ColorData:
Show the conformation of a protein backbone using Tube and BezierCurve:
A random collection of protein backbones:
Show the Ramachandran plot for a protein:
New in 7