Skip to main content

Protein Structure Database (PDB)


Protein Structure Database (PDB)

Introduction

The Protein Structure Database (PDB) is the primary global repository for the three-dimensional (3D) structures of biological macromolecules such as proteins, nucleic acids, and protein–ligand complexes.
These structures are determined experimentally using techniques like X-ray crystallography, Nuclear Magnetic Resonance (NMR) spectroscopy, and Cryo-Electron Microscopy (Cryo-EM).


PDB plays a vital role in understanding:
Protein structure and function
Molecular interactions
Drug discovery and design
Structural biology and bioinformatics


History and Development
Established in 1971
Founded by Brookhaven National Laboratory (USA)
Initially contained only 7 protein structures
Now maintained by the Worldwide Protein Data Bank (wwPDB)

Members of wwPDB
RCSB PDB (USA)
PDBe (Europe)
PDBj (Japan)
BMRB (Biological Magnetic Resonance Data Bank)


Objectives of PDB
To collect, store, and distribute 3D structural data of biomolecules
To provide free and open access to structural information
To ensure standardization and validation of structural data
To support research and education worldwide

Types of Molecules Stored in PDB
Proteins
Enzymes
Nucleic acids (DNA, RNA)
Protein–protein complexes
Protein–ligand and protein–drug complexes
Virus capsids and ribosomes.


Experimental Methods Used

1. X-ray Crystallography

Most common method
Requires protein crystallization
Provides high-resolution structures

2. NMR Spectroscopy
Used for small proteins
Structure determined in solution
Multiple conformations possible

3. Cryo-Electron Microscopy (Cryo-EM)

Suitable for large complexes
No need for crystallization
Rapidly growing method

PDB File Format

Each structure in PDB is assigned a unique 4-character PDB ID (e.g., 1ABC).

Main Components of a PDB File

HEADER – General information
TITLE – Description of the structure
EXPDTA – Experimental method
ATOM – Atomic coordinates
HETATM – Non-standard atoms (ligands, ions)
SEQRES – Amino acid sequence
CONECT – Bond connectivity
END – End of file


Data Organization in PDB

Primary structure: Amino acid sequence
Secondary structure: α-helices, β-sheets
Tertiary structure: 3D folding
Quaternary structure: Multisubunit assembly


Structural Validation in PDB

Before acceptance, each structure undergoes quality checks:
Resolution assessment
Ramachandran plot analysis
Bond length and angle validation
Steric clash detection
Validation reports are available for each PDB entry.

Access and Tools Provided by PDB

Structure visualization (3D viewers)
Sequence and structure search tools
Structure comparison
Download options in multiple formats
Educational resources.


Applications of PDB

1. Structural Biology

Understanding protein folding
Studying structure–function relationships

2. Drug Discovery

Structure-based drug design
Identification of drug binding sites

3. Bioinformatics

Homology modeling
Molecular docking studies
Protein classification

4. Medical and Clinical Research

Studying disease-related mutations
Designing therapeutic proteins

5. Education

Teaching protein structure and function
Training students in molecular biology


Advantages of PDB

Free and open access
High-quality curated data
Global collaboration
Supports advanced computational analysis


Limitations of PDB

Not all proteins have known structures
Some structures have low resolution
Static structures may not reflect dynamic behavior

Importance of PDB in Modern Biology
PDB acts as a bridge between sequence and function, helping scientists understand how molecular structure determines biological activity. It is an indispensable resource in genomics, proteomics, drug discovery, and biotechnology.


Conclusion

The Protein Structure Database (PDB) is the world’s most important repository for 3D macromolecular structures. By providing reliable and accessible structural data, PDB supports research in biology, medicine, and bioinformatics, making it a cornerstone of modern life sciences.

Comments

Popular Posts

Fourth Semester M.Sc. Degree Examination, March 2021Time: 3 HoursBotanyBO 241: BIOINFORMATICS AND BIOPHYSICS(2019 Admission)

Fourth Semester M.Sc. Degree Examination, March 2021 Time: 3 Hours Botany BO 241: BIOINFORMATICS AND BIOPHYSICS (2019 Admission) 1. Answer the following questions. 1. Expand EMBL and DDBJ. 2. What do bootstrap values indicate? 3. What is multiple sequence alignment? 4. What is SNP? 5. Define transcriptome. 6. What is Smith Waterman algorithm? 7. Comment on Phylip. 8. What are the factors that determine the electrophoretic mobility of a particle? 9. Differentiate between resolution and resolving power of the microscope. 10. Which are the factors that determine the sedimentation of a component during centrifugation? (10 x 1= 10 Marks) II. Answer the following questions in not more than 50 words. 11. (a) What is the difference between rooted and unrooted phylogenetic tree? OR (b) What is ORF? What is its significance in functional genomics? 12. (a) Explain the use of GENSCAN. OR (b) Explain the assumptions in molecular clock hypothesis. 13. (a) Write a brief explanation on KEGG. OR (b) Co...

Secondary Databases (PROSITE, PRINTS, BLOCKS)

Secondary Databases (PROSITE, PRINTS, BLOCKS  Secondary Databases Introduction Biological databases are broadly classified into primary and secondary databases. Primary databases store raw experimental data (e.g., nucleotide or protein sequences), whereas secondary databases contain derived information obtained by analyzing primary sequence data. Secondary databases are mainly used to: Identify protein families Detect conserved motifs, patterns, and domains Predict protein function Study structure–function relationships Examples of secondary databases include PROSITE, PRINTS, BLOCKS, Pfam, etc. 1. PROSITE Database Definition PROSITE is a secondary database that documents protein domains, families, and functional sites in the form of patterns and profiles. Developed by Swiss Institute of Bioinformatics (SIB) Maintained along with UniProt Principle PROSITE is based on the idea that functionally important regions of proteins are conserved during evolution. These conserved regions can ...

Intellectual Property Rights (IPR) – Detailed Notes

Intellectual Property Rights (IPR) – Detailed Notes 1. Introduction Intellectual Property Rights (IPR) are legal rights granted to creators and inventors over their creations or inventions. They protect innovation and creativity, providing the owner exclusive rights to use, sell, or license their creation. IPR encourages research, development, and economic growth by rewarding creativity. 2. Importance of IPR Protects inventions, designs, and creative work. Prevents unauthorized use, copying, or commercialization. Encourages innovation and research. Provides financial benefits to inventors through licensing or royalties. Supports economic growth and competitiveness. Safeguards traditional knowledge and biodiversity. 3. Types of Intellectual Property Rights A. Patents Definition: Exclusive right granted to an inventor for a new invention for a limited period (usually 20 years). Requirements: Novelty – must be new and not published. Inventive step – non-obvious to someone skilled in the f...

Electroporation – Detailed Notes

Electroporation – Detailed Notes Definition : Electroporation is a physical method of gene transfer in which cells are exposed to a brief, high-voltage electric pulse, creating temporary pores in the cell membrane. This allows DNA, RNA, proteins, or other molecules to enter the cytoplasm. It is widely used in bacteria, yeast, plant protoplasts, and mammalian cells. Key Concept: The electric field destabilizes the membrane, making it permeable to macromolecules. 1. Principle Cells are suspended in a conductive medium. A brief electrical pulse induces transient pores in the plasma membrane. DNA or other molecules present in the medium enter the cell through these pores. Membrane reseals after the pulse, and the molecule is retained inside the cell. Advantages of Principle: Direct and rapid. Works in many cell types. Does not require chemical carriers or viral vectors. 2. Materials Required Cells – bacterial, yeast, plant protoplasts, mammalian cells. DNA/RNA/other macromolecule – purifie...

Third Semester M.Sc. Degree Examination, January 2023 Botany BO 231 PLANT BREEDING, HORTICULTURE AND BIOSTATISTICS

Third Semester M.Sc. Degree Examination, January 2023 Botany BO 231 PLANT BREEDING, HORTICULTURE AND BIOSTATISTICS Time: Three Hours (2019 Admission Onwards) I. Answer the following questions. 1.What is green super rice? 2.What are the functions of ICAR-NBPGR? 3.Give the importance of floral biology in plant breeding. 4.How do you develop a synthetic variety? 5.Where can you find gene-for-gene relationships? 6.Describe the significance of biodiversity policy. 7.What is Olericulture? 8.Describe the advantages of in door garden. 9.What is Students's t-test? 10. Explain Ogive graph. (10 × 1 = 10 Marks) 11.Answer the following questions in not more than 50 words . 11. (a) Explain hybridization and mention it's procedure. OR (b) Write short notes on the concept of centers of origin proposed by Vavilov. 12. (a) Describe cytoplasmic male sterility and its uses. OR (b) Explain the role of interspecific and intergeneric hybridization. 13. (a) What is seed certification? How is it done? ...

Third Semester M.Sc. Degree Examination, February 2024 231: PLANT BREEDING, HORTICULTURE AND BIOSTATISTICS

Third Semester M.Sc. Degree Examination, February 2024                 Botany BO 231: PLANT BREEDING, HORTICULTURE AND BIOSTATISTICS (2019 Admission onwards) Time: 3 Hours I.Answer the following questions. 1.What is atomic gardening? 2.Name the cardamom research institute in Kerala. 3.Explain advantages of distant hybridisation. 4.Describe plant variety rights. 5.Write short notes on arboriculture. 6.What is vermicomposting? 7.Give short notes on cut flower industry. 8.What is ANOVA? 9.Describe the properties of binomial distribution. 10. Explain the use of LSD. Max. Marks: 75 (10 x 1 = 10 Marks) II.Answer the following questions in not more that 50 words. 11. (a) What do you mean by genetic modification techniques? OR (b) What is center of diversity of a species? 12. (a) Compare auto and allopolyploidy. OR (b) What are requirements of back cross breeding? 13. (a) Describe ideotype breeding and its significance. OR (b) What is the role of seed cer...

❃LC-MS (LIQUID CHROMATOGRAPHY – MASS SPECTROMETRY)

LC-MS (LIQUID CHROMATOGRAPHY – MASS SPECTROMETRY)  ┏━━━━━ •❃°•°❀°•°❃•━━━━•━━━┓ 1. INTRODUCTION LC-MS is a hyphenated analytical technique combining Liquid Chromatography (LC) and Mass Spectrometry (MS). It is used for separation, identification, and quantification of compounds in complex mixtures. LC separates analytes based on polarity, size, or charge, while MS detects molecules based on mass-to-charge ratio (m/z). Developed in the 1970s–1980s, LC-MS is now widely used in pharmaceutical, clinical, environmental, and food analysis. Importance : Detects trace levels of compounds (ng–pg range) Analyzes non-volatile, thermally labile compounds that cannot be analyzed by GC-MS Provides structural information through mass fragmentation Example: Detection of drugs in plasma, protein identification in proteomics, pesticide residue analysis in food. 2. COMPONENTS OF LC-MS The LC-MS system has three main parts: A. Liquid Chromatograph (LC) Function: Separates components of a mixture befor...

Fourth Semester M.Sc. Degree Examination, June 2022BotanySpecial Paper II - ElectiveBO 242 a: BIOTECHNOLOGY

Reg. No.: Name: N-6273 Fourth Semester M.Sc. Degree Examination, June 2022 Botany Special Paper II - Elective BO 242 a: BIOTECHNOLOGY Time: 3 Hours (2019 Admission Onwards) Max. Marks: 75 1. Instruction: Draw diagrams and illustrate with examples wherever necessary. Answer the following questions. 1. What are the desirable features of a cloning vehicle? 2. What is a palindrome? 3. What is the significance of Ori C site? 4. What is the actual function of restriction enzymes in a bacterial system?  5.Name any two bacteria and fungi used for alcohol fermentation. 6. What is a starter culture? 7. What are adapters? 8. What are probes? 9. What is biopiracy? 10. Define cybrids. (10 x 1 = 10 Marks) II. Answer the following questions in not more than 50 words .  11. (a) Why is callus culture a prerequisite for somaclonal variations? OR (b) How is virus elimination done via plant tissue culture? 12. (a) How is aeration maintained in a bioreactor? OR (b) What are the methods available f...

••CLASSIFICATION OF ALGAE - FRITSCH

      MODULE -1       PHYCOLOGY  CLASSIFICATION OF ALGAE - FRITSCH  ❖F.E. Fritsch (1935, 1945) in his book“The Structure and  Reproduction of the Algae”proposed a system of classification of  algae. He treated algae giving rank of division and divided it into 11  classes. His classification of algae is mainly based upon characters of  pigments, flagella and reserve food material.     Classification of Fritsch was based on the following criteria o Pigmentation. o Types of flagella  o Assimilatory products  o Thallus structure  o Method of reproduction          Fritsch divided algae into the following 11 classes  1. Chlorophyceae  2. Xanthophyceae  3. Chrysophyceae  4. Bacillariophyceae  5. Cryptophyceae  6. Dinophyceae  7. Chloromonadineae  8. Euglenineae    9. Phaeophyceae  10. Rhodophyceae  11. Myxophyce...

Protein Sequence DatabasesPIR, SWISS-PROT and TREMBEL

Protein Sequence Databases PIR, SWISS-PROT and TREMBEL 1. Introduction Protein sequence databases are biological databases that store information about amino acid sequences of proteins, along with their functional, structural, and biochemical characteristics. Since proteins are the functional molecules of the cell, protein databases are essential for understanding gene expression, metabolism, enzymatic activity, signaling pathways, and evolution. Protein sequence databases mainly contain data derived from translated nucleotide sequences and experimental protein studies. 2. Types of Protein Sequence Databases Protein sequence databases are broadly classified into: A. Primary Protein Databases Contain original protein sequence data Minimal or no manual annotation B. Secondary Protein Databases Derived from primary databases Provide curated functional and structural information C. Composite Protein Databases Combine protein data from multiple sources Reduce redundancy 3. Protein Informati...