Skip to main content

Protein Sequence DatabasesPIR, SWISS-PROT and TREMBEL


Protein Sequence Databases
PIR, SWISS-PROT and TREMBEL

1. Introduction

Protein sequence databases are biological databases that store information about amino acid sequences of proteins, along with their functional, structural, and biochemical characteristics. Since proteins are the functional molecules of the cell, protein databases are essential for understanding gene expression, metabolism, enzymatic activity, signaling pathways, and evolution.
Protein sequence databases mainly contain data derived from translated nucleotide sequences and experimental protein studies.

2. Types of Protein Sequence Databases

Protein sequence databases are broadly classified into:

A. Primary Protein Databases

Contain original protein sequence data
Minimal or no manual annotation

B. Secondary Protein Databases
Derived from primary databases
Provide curated functional and structural information

C. Composite Protein Databases
Combine protein data from multiple sources
Reduce redundancy
3. Protein Information Resource (PIR)

Overview
Protein Information Resource (PIR) is one of the earliest protein sequence databases, developed to store and analyze protein sequences.

Maintained by

Georgetown University (USA)
In collaboration with NBRF (National Biomedical Research Foundation)


Data Content

Protein sequences
Functional information
Evolutionary relationships
Classification into protein families

Unique Features
Organized into protein superfamilies
Emphasis on evolutionary and functional classification
Non-redundant dataset

Advantages
High-quality annotations
Useful for comparative protein studies

Limitations
Smaller than newer databases
Less frequently updated compared to UniProt


4. SWISS-PROT Database

Overview
SWISS-PROT is a manually curated, high-quality protein sequence database known for its accuracy and reliability.

Maintained by
Swiss Institute of Bioinformatics (SIB)
European Bioinformatics Institute (EMBL-EBI)

Data Content

Amino acid sequences
Protein function
Enzyme activity
Post-translational modifications
Domain structure
Subcellular localization


Key Features

Manual curation by experts
Minimal redundancy
High annotation accuracy
Extensive cross-references


SWISS-PROT Entry Includes : 
Accession number
Protein name
Organism
Function
Sequence length
Amino acid sequence

Advantages
Highly reliable
Preferred for functional studies
Limitations
Slow growth due to manual annotation

5. TrEMBL (Translated EMBL)

Overview
TrEMBL is a computer-annotated protein database that contains protein sequences translated from nucleotide sequence databases.

Maintained by
EMBL-EBI
Swiss Institute of Bioinformatics

Data Source
Translations of coding sequences from:
EMBL
GenBank
DDBJ
Key Features
Automatically annotated
Large and rapidly growing database
Supplement to SWISS-PROT

Advantages
Covers newly discovered proteins
Fast data availability

Limitations

Annotation may contain errors
Less reliable than SWISS-PROT

6. UniProt Knowledgebase (UniProtKB)

SWISS-PROT and TrEMBL together form the UniProt Knowledgebase (UniProtKB).
Components
UniProtKB/Swiss-Prot – reviewed, manually curated
UniProtKB/TrEMBL – unreviewed, automatically annotated

Purpose
Provide comprehensive protein sequence and functional information
Serve as a central protein knowledge hub


7. Comparison of PIR, SWISS-PROT, and TrEMBL


8. Applications of Protein Sequence Databases

Protein function prediction
Identification of conserved domains
Comparative protein analysis
Phylogenetic studies
Drug target identification
Enzyme characterization

9. Importance of Protein Sequence Databases
Link genes to protein function
Support proteomics research
Assist in metabolic pathway analysis
Aid in molecular evolution studies
Help in crop improvement and biotechnology

10. Conclusion
Protein sequence databases such as PIR, SWISS-PROT, and TrEMBL play a vital role in modern bioinformatics. While SWISS-PROT provides high-quality, manually curated protein data, TrEMBL ensures rapid availability of newly sequenced proteins. PIR contributes valuable evolutionary and functional classifications. Together, these databases support comprehensive protein research and biological discovery.

Comments

Popular Posts

❃HPLC – High Performance Liquid Chromatography

HPLC – High Performance Liquid Chromatography ┏━━━━━ •❃°•°❀°•°❃•━━━━•━━━┓  1. Introduction High Performance Liquid Chromatography (HPLC) is an advanced analytical technique used for the separation, identification, and quantification of components present in a mixture. It is based on the differential distribution of analytes between a stationary phase and a liquid mobile phase under high pressure. HPLC is widely used in biochemistry, biotechnology, pharmaceuticals, food analysis, environmental studies, and clinical diagnostics. 2. Principle of HPLC The principle of HPLC is based on partition, adsorption, ion-exchange, or size-exclusion mechanisms, depending on the type of column used. A liquid mobile phase is pumped at high pressure through a column packed with fine stationary phase particles Sample components interact differently with the stationary phase Components with stronger interaction elute slower Components with weaker interaction elute faster Separated components are detec...

Microbial Production of PharmaceuticalsSomatostatin, Humulin and Interferons

Microbial Production of Pharmaceuticals Somatostatin, Humulin and Interferons 1. Introduction Advances in recombinant DNA technology have enabled microorganisms to produce human therapeutic proteins safely, economically and in large quantities. Microbial systems such as Escherichia coli and yeast (Saccharomyces cerevisiae) are widely used for the production of pharmaceuticals that were earlier isolated from human or animal tissues. Important microbial-derived pharmaceuticals include somatostatin, human insulin (Humulin) and interferons. 2. Advantages of Microbial Production of Pharmaceuticals High yield and rapid production Cost-effective and scalable Free from animal pathogens Consistent product quality Easy genetic manipulation 3. General Steps in Microbial Production of Recombinant Pharmaceuticals Isolation of target gene Construction of recombinant DNA Insertion into suitable vector Transformation into host microorganism Expression of protein Downstream processing and purification ...

••CLASSIFICATION OF ALGAE - FRITSCH

      MODULE -1       PHYCOLOGY  CLASSIFICATION OF ALGAE - FRITSCH  ❖F.E. Fritsch (1935, 1945) in his book“The Structure and  Reproduction of the Algae”proposed a system of classification of  algae. He treated algae giving rank of division and divided it into 11  classes. His classification of algae is mainly based upon characters of  pigments, flagella and reserve food material.     Classification of Fritsch was based on the following criteria o Pigmentation. o Types of flagella  o Assimilatory products  o Thallus structure  o Method of reproduction          Fritsch divided algae into the following 11 classes  1. Chlorophyceae  2. Xanthophyceae  3. Chrysophyceae  4. Bacillariophyceae  5. Cryptophyceae  6. Dinophyceae  7. Chloromonadineae  8. Euglenineae    9. Phaeophyceae  10. Rhodophyceae  11. Myxophyce...

SCAR (Sequence Characterized Amplified Region) Markers

SCAR (Sequence Characterized Amplified Region) Markers   Introduction SCAR markers are PCR-based DNA markers derived from RAPD, AFLP, or other random markers. Developed by Paran and Michelmore in 1993 to convert dominant, less reproducible markers into specific, reproducible, co-dominant markers. SCAR markers are locus-specific, reproducible, and sequence-characterized, making them ideal for marker-assisted selection (MAS). Principle SCAR markers are designed based on known DNA sequences obtained from cloned RAPD/AFLP fragments. Specific primers (18–24 bp) are synthesized to amplify a single, defined locus. The PCR amplification of this region generates a distinct band, which is highly reproducible and can distinguish homozygotes from heterozygotes if designed as co-dominant. Key idea: Random marker (e.g., RAPD) → Cloning & sequencing → Design specific primers → PCR → SCAR marker Materials Required Genomic DNA from the organism Specific primers (18–24 bp) designed from sequence...

Intellectual Property Rights (IPR) – Detailed Notes

Intellectual Property Rights (IPR) – Detailed Notes 1. Introduction Intellectual Property Rights (IPR) are legal rights granted to creators and inventors over their creations or inventions. They protect innovation and creativity, providing the owner exclusive rights to use, sell, or license their creation. IPR encourages research, development, and economic growth by rewarding creativity. 2. Importance of IPR Protects inventions, designs, and creative work. Prevents unauthorized use, copying, or commercialization. Encourages innovation and research. Provides financial benefits to inventors through licensing or royalties. Supports economic growth and competitiveness. Safeguards traditional knowledge and biodiversity. 3. Types of Intellectual Property Rights A. Patents Definition: Exclusive right granted to an inventor for a new invention for a limited period (usually 20 years). Requirements: Novelty – must be new and not published. Inventive step – non-obvious to someone skilled in the f...

Single Nucleotide Polymorphisms (SNPs) – Detailed Notes

Single Nucleotide Polymorphisms (SNPs) – Detailed Notes 1. Definition SNPs are single base-pair variations in the DNA sequence that occur at a specific position in the genome among individuals of a species. Example: At a specific locus, one individual may have A while another has G: Copy code Individual 1: …A T C G A T…   Individual 2: …A T C G G T… SNPs are the most common type of genetic variation in most organisms. 2. Characteristics of SNPs Single base change: Involves substitution of one nucleotide for another (A↔G, C↔T). Biallelic nature: Most SNPs have only two alleles in a population. Widespread in the genome: Found in coding regions (exons), non-coding regions (introns, promoters, intergenic regions). Stable inheritance: Passed from generation to generation like other genetic markers. Frequency: Occur approximately every 100–300 bp in the human genome. 3 . Types of SNPs SNPs are categorized based on location or effect on gene function: A. Based on genomic location Cod...

Exploitation of Somaclonal and Gametoclonal Variations for Plant Improvement

Exploitation of Somaclonal and Gametoclonal Variations for Plant Improvement  1. Introduction Plant tissue culture often induces genetic and epigenetic variations among regenerated plants. These variations, when stable and heritable, can be exploited as a source of novel traits for crop improvement. Somaclonal variation: Variation arising in plants regenerated from somatic cells cultured in vitro. Gametoclonal variation: Variation arising in plants regenerated from gametic cells (anther, pollen, ovule culture). Both provide additional genetic variability beyond conventional breeding. 2. Somaclonal Variation 2.1 Definition Somaclonal variation refers to genetic variation observed among plants regenerated from somatic tissue cultures, such as callus, suspension cultures, or explants. Term coined by Larkin and Scowcroft (1981). 2.2 Sources of Somaclonal Variation Chromosomal changes Aneuploidy Polyploidy Chromosome rearrangements Gene mutations Point mutations Insertions and deletions...

❥NORTHERN BLOTTING

NORTHERN BLOTTING – 30 MARK DETAILED NOTES  π“†ž❥ π“†ž❥ π“†ž❥ π“†ž❥ π“†ž❥ π“†ž ❥ π“†ž❥ π“†ž❥  Northern blotting is a molecular biology technique used to detect specific RNA molecules in a complex mixture. It provides information about gene expression, RNA size, and transcript abundance by hybridizing RNA with a labeled complementary DNA or RNA probe. πŸ“Œ Named by analogy to Southern blotting (DNA detection). 2. Principle The principle of Northern blotting is based on: Separation of RNA molecules by size using denaturing agarose gel electrophoresis Transfer (blotting) of separated RNA onto a nylon or nitrocellulose membrane Hybridization of membrane-bound RNA with a labeled complementary probe Detection of RNA–probe hybrids by autoradiography or chemiluminescence ✔ Only RNA sequences complementary to the probe will be detected. 3. Types of RNA Analyzed mRNA (most common) rRNA tRNA miRNA and siRNA (with modified protocols) 4. Requirements / Materials Total RNA or poly(A)+ RNA Denaturing agarose ...

𓆉 INDEX PAGE -NOTETHEPOINT43

INDEX PAGE   MAIN    CONTENT 1.   HSST BOTANY SYLLABUS, DETAILED NOTES, MCQ 2.  SET GENERAL PAPER SYLLABUS, DETAILED NOTES, 50MCQ 3.  SET BOTANY SYLLABUS, DETAILED NOTES, MCQ 4. MSC BOTANY THIRD SEMESTER SYLLABUS, NOTES (KERALA UNIVERSITY ) 5. MSC BOTANY THIRD SEMESTER QUESTION PAPER (KERALA UNIVERSITY ) 6. MSC BOTANY FOURTH SEMESTER SYLLABUS &NOTES (KERALA UNIVERSITY ) 7. FOURTH SEMESTER MSC BOTANY PREVIOUS QUESTION PAPER  (KERALA UNIVERSITY )

Fourth Semester M.Sc. Degree Examination, September 2019BotanySpecial Paper II - ElectiveBO 242 a: BIOTECHNOLOGY(2013 Admission onwards)

Reg. No.......  Name......... G-5263 Fourth Semester M.Sc. Degree Examination, September 2019 Botany Special Paper II - Elective BO 242 a: BIOTECHNOLOGY (2013 Admission onwards) Max. Marks: 75 1. Answer the following questions: 1. Humulin 2. YAC 3. Cybrids 4. Hybridomas 5. IPR 6. Gene therapy 7. C DNA library 8. AFLP 9. Hairy root culture 10. Somacional variation (10 x 1=10 Marks) II. Answer the following questions in not more than 50 words : 11. (a) What are immobilized enzymes? What is its advantage? OR (b) Write a short note on molecular farming. 12. (a) Give an account of bioprocess technology for the production of secondary metabolites. OR (b) What are bioreactors? How it operates? 13. (a) What are probiotics?. How do they work? OR (b) Discuss the methodology and application of western blotting. 14. (a) Briefly explain the application of protoplast culture OR (b) Write a short note on gene therapy 15. (a) What are reporter genes? Discuss its utility in transformation studies O...