-
See Also
- ServiceExecute
- ServiceConnect
- BioMolecule
- BioSequence
-
- Service Connections
- ESMAtlas
-
-
See Also
- ServiceExecute
- ServiceConnect
- BioMolecule
- BioSequence
-
- Service Connections
- ESMAtlas
-
See Also
"RCSBProteinDataBank" (Service Connection)

Connecting & Authenticating
Requests
BioMolecule Structures
"BioMolecule" — get a BioMolecule structure from RCSB Protein Data Bank
"PDBStructureID" | None | PDB structure ID | |
"AssemblyNumber" | None | biological assembly of the reference structure |
Data
"EntryData" — get the relevant data present in the structure file from RCSB Protein Data Bank as a Dataset
"PDBStructureID" | None | PDB structure ID |
"ChemicalComponentData" — get the relevant information about residues present in the RCSB Protein Data Bank
"ComponentID" | None | chemical ID of a residue |
PDB IDs
"TextSearch" — get the relevant PDB IDs from RCSB PDB by giving a simple text query
"Query" | None | search query | |
"StartIndex" | 1 | start index of the output structures | |
MaxItems | 10 | total number of output structures |
"SequenceSearch" — get the relevant PDB IDs from RCSB PDB by providing a BioSequence
"BioSequence" | None | sequence to search against | |
"MinimumOverlapFraction" | None | minimum overlap fraction between two sequences | |
"StartIndex" | 1 | start index of the output structures | |
MaxItems | 10 | total number of output structures |
"SimilarStructures" — get the relevant PDB IDs from RCSB PDB through structure search by providing a reference PDB ID or BioMolecule.
"PDBStructureID" | None | PDB ID of the reference structure |
"BioMolecule" | None | reference BioMolecule |
"AssemblyNumber" | 1 | biological assembly of the reference structure |
"ShapeMatchCriteria" | "Strict" | how the shapes should be matched |
"StartIndex" | 1 | starting index of output structures |
MaxItems | 10 | total number of output structures |
"SequenceMotifSearch" — get the relevant PDB IDs from RCSB PDB by searching for a sequence motif
"Motif" | None | motif to search against; can be a string or BioSequence object | |
"PatternType" | None | type of the input motif | |
"SequenceType" | None | type of the sequence motif | |
"StartIndex" | 1 | start index of the output structures | |
MaxItems | 10 | total number of output structures |
Parameter Details
"Strict" | strict matching of reference structure | |
"Relaxed" | relaxed matching of reference structure |
"Simple" | simple expression | |
"Regex" | regular expression | |
"Prosite" | Prosite expression |
"Protein" | protein sequence | |
"DNA" | DNA sequence | |
"RNA" | RNA sequence |
"X" | any single letter code of either protein, DNA or RNA | |
"{P}" | any amino acid except "P" ("Pro") | |
"[ST]" | either "S" ("Ser") or "T" ("Thr") | |
"X(2)" | same as "XX" | |
"X(2,4)" | "XX" or "XXX" or "XXXX" | |
"C-{S}-C-X(2)-[LIVMYFWC]" | "Prosite" format example | |
"C{S}CXX[LIVMYFWC]" | "Regex" format example | |
"CXCXXL" | "Simple" format example |
Examples
open all close allBasic Examples (2)
Get a BioMolecule object by providing the PDB ID through an ExternalIdentifier:
Visualize the BioMolecule:
Get a specific biological assembly by providing the "AssemblyNumber":
Visualize the BioMolecule:
Scope (6)
Get the data in the structure file:
Get the relevant information about molecules present in RCSB PDB:
Search for structures from RCSB PDB by giving a simple text query:
Visualize one of the structures:
Get a specific number of structures starting from a "StartIndex" and MaxItems parameters:
"BioSequence" can also be "RNA" or "DNA":
Search for structures that are similar to the input structure. Here, look for structures similar to human KRAS G12C mutant in the RCSB Protein Data Bank by providing the "PDBStructureID". By default, the first element of the output is the input structure:
Visualize the first five structures to see the similarity:
it is also possible to provide the biological assembly number through "Assembly" as shown here. By default, the "AssemblyNumber" is set to 1. Here is an example of searching for structures similar to biological assembly 2 of camel single-domain VH antibody fragment:
You can also search for structures using a "BioMolecule" as input. Here, look for structures in RCSB Protein Data Bank from a structure in the ESM Metagenomic Atlas:
Search for structures that contain a zinc finger sequence motif:
Visualize the structure of the first element of the search, where zinc is in purple:
"Motif" can also accept a BioSequence:
You can also search for "DNA" or "RNA" motifs:
Neat Examples (1)
Search structures from RCSB PDB using BioSequence:
BioSequence of the first structure:
Align this sequence with the reference sequence:
Compare the input sequence with the similar structure using SmithWatermanSimilarity: