Skip to main content

Protein Sequence DatabasesPIR, SWISS-PROT and TREMBEL


Protein Sequence Databases
PIR, SWISS-PROT and TREMBEL

1. Introduction

Protein sequence databases are biological databases that store information about amino acid sequences of proteins, along with their functional, structural, and biochemical characteristics. Since proteins are the functional molecules of the cell, protein databases are essential for understanding gene expression, metabolism, enzymatic activity, signaling pathways, and evolution.
Protein sequence databases mainly contain data derived from translated nucleotide sequences and experimental protein studies.

2. Types of Protein Sequence Databases

Protein sequence databases are broadly classified into:

A. Primary Protein Databases

Contain original protein sequence data
Minimal or no manual annotation

B. Secondary Protein Databases
Derived from primary databases
Provide curated functional and structural information

C. Composite Protein Databases
Combine protein data from multiple sources
Reduce redundancy
3. Protein Information Resource (PIR)

Overview
Protein Information Resource (PIR) is one of the earliest protein sequence databases, developed to store and analyze protein sequences.

Maintained by

Georgetown University (USA)
In collaboration with NBRF (National Biomedical Research Foundation)


Data Content

Protein sequences
Functional information
Evolutionary relationships
Classification into protein families

Unique Features
Organized into protein superfamilies
Emphasis on evolutionary and functional classification
Non-redundant dataset

Advantages
High-quality annotations
Useful for comparative protein studies

Limitations
Smaller than newer databases
Less frequently updated compared to UniProt


4. SWISS-PROT Database

Overview
SWISS-PROT is a manually curated, high-quality protein sequence database known for its accuracy and reliability.

Maintained by
Swiss Institute of Bioinformatics (SIB)
European Bioinformatics Institute (EMBL-EBI)

Data Content

Amino acid sequences
Protein function
Enzyme activity
Post-translational modifications
Domain structure
Subcellular localization


Key Features

Manual curation by experts
Minimal redundancy
High annotation accuracy
Extensive cross-references


SWISS-PROT Entry Includes : 
Accession number
Protein name
Organism
Function
Sequence length
Amino acid sequence

Advantages
Highly reliable
Preferred for functional studies
Limitations
Slow growth due to manual annotation

5. TrEMBL (Translated EMBL)

Overview
TrEMBL is a computer-annotated protein database that contains protein sequences translated from nucleotide sequence databases.

Maintained by
EMBL-EBI
Swiss Institute of Bioinformatics

Data Source
Translations of coding sequences from:
EMBL
GenBank
DDBJ
Key Features
Automatically annotated
Large and rapidly growing database
Supplement to SWISS-PROT

Advantages
Covers newly discovered proteins
Fast data availability

Limitations

Annotation may contain errors
Less reliable than SWISS-PROT

6. UniProt Knowledgebase (UniProtKB)

SWISS-PROT and TrEMBL together form the UniProt Knowledgebase (UniProtKB).
Components
UniProtKB/Swiss-Prot – reviewed, manually curated
UniProtKB/TrEMBL – unreviewed, automatically annotated

Purpose
Provide comprehensive protein sequence and functional information
Serve as a central protein knowledge hub


7. Comparison of PIR, SWISS-PROT, and TrEMBL


8. Applications of Protein Sequence Databases

Protein function prediction
Identification of conserved domains
Comparative protein analysis
Phylogenetic studies
Drug target identification
Enzyme characterization

9. Importance of Protein Sequence Databases
Link genes to protein function
Support proteomics research
Assist in metabolic pathway analysis
Aid in molecular evolution studies
Help in crop improvement and biotechnology

10. Conclusion
Protein sequence databases such as PIR, SWISS-PROT, and TrEMBL play a vital role in modern bioinformatics. While SWISS-PROT provides high-quality, manually curated protein data, TrEMBL ensures rapid availability of newly sequenced proteins. PIR contributes valuable evolutionary and functional classifications. Together, these databases support comprehensive protein research and biological discovery.

Comments

Popular Posts

❥NORTHERN BLOTTING

NORTHERN BLOTTING – 30 MARK DETAILED NOTES  𓆞❥ 𓆞❥ 𓆞❥ 𓆞❥ 𓆞❥ 𓆞 ❥ 𓆞❥ 𓆞❥  Northern blotting is a molecular biology technique used to detect specific RNA molecules in a complex mixture. It provides information about gene expression, RNA size, and transcript abundance by hybridizing RNA with a labeled complementary DNA or RNA probe. 📌 Named by analogy to Southern blotting (DNA detection). 2. Principle The principle of Northern blotting is based on: Separation of RNA molecules by size using denaturing agarose gel electrophoresis Transfer (blotting) of separated RNA onto a nylon or nitrocellulose membrane Hybridization of membrane-bound RNA with a labeled complementary probe Detection of RNA–probe hybrids by autoradiography or chemiluminescence ✔ Only RNA sequences complementary to the probe will be detected. 3. Types of RNA Analyzed mRNA (most common) rRNA tRNA miRNA and siRNA (with modified protocols) 4. Requirements / Materials Total RNA or poly(A)+ RNA Denaturing agarose ...

Biological Databases – Types of Data and DatabasesNucleotide Sequence Databases (EMBL, GenBank, DDBJ)

Biological Databases – Types of Data and Databases Nucleotide Sequence Databases (EMBL, GenBank, DDBJ) 1. Introduction Biological databases are systematic, computerized collections of biological information that allow efficient storage, retrieval, updating, and analysis of large volumes of biological data. With the advent of genome sequencing, molecular biology, and bioinformatics, biological databases have become essential tools in biological research. These databases support studies in genomics, proteomics, evolutionary biology, taxonomy, medicine, agriculture, and biotechnology. 2. Types of Data Stored in Biological Databases Biological databases store diverse types of biological information, including: 1. Sequence Data DNA sequences RNA sequences Protein sequences 2. Structural Data Three-dimensional structures of proteins Nucleic acid structures 3. Functional Data Gene functions Enzyme activity Regulatory elements 4. Genomic Annotation Data Gene location Exons, introns Promoters a...

Information retrieval from databases - search concepts, Tools for searching, homology searching, finding Domain and Functional site homologies

Information retrieval from databases - search concepts, Tools for searching, homology searching, finding Domain and Functional site homologies Information Retrieval from Databases 1. Introduction Information retrieval in bioinformatics refers to the process of extracting relevant biological data (DNA, RNA, protein sequences, structures, or functional information) from databases. Aim : Identify sequences, functions, or structural features for analysis, comparison, and annotation. Databases can be primary (raw sequence data) or secondary/derived (annotated, processed data). 2. Search Concepts in Biological Databases 2.1 Types of Searches Exact Match Search Returns results only if the query exactly matches database entries. Useful for known accession numbers or IDs. Pattern/Keyword Search Searches based on specific motifs, keywords, or annotations. Example: “kinase domain,” “signal peptide.” Similarity/Homology Search Detects sequences similar to the query based on sequence alignment. Use...

❃HPLC – High Performance Liquid Chromatography

HPLC – High Performance Liquid Chromatography ┏━━━━━ •❃°•°❀°•°❃•━━━━•━━━┓  1. Introduction High Performance Liquid Chromatography (HPLC) is an advanced analytical technique used for the separation, identification, and quantification of components present in a mixture. It is based on the differential distribution of analytes between a stationary phase and a liquid mobile phase under high pressure. HPLC is widely used in biochemistry, biotechnology, pharmaceuticals, food analysis, environmental studies, and clinical diagnostics. 2. Principle of HPLC The principle of HPLC is based on partition, adsorption, ion-exchange, or size-exclusion mechanisms, depending on the type of column used. A liquid mobile phase is pumped at high pressure through a column packed with fine stationary phase particles Sample components interact differently with the stationary phase Components with stronger interaction elute slower Components with weaker interaction elute faster Separated components are detec...

Exploitation of Somaclonal and Gametoclonal Variations for Plant Improvement

Exploitation of Somaclonal and Gametoclonal Variations for Plant Improvement  1. Introduction Plant tissue culture often induces genetic and epigenetic variations among regenerated plants. These variations, when stable and heritable, can be exploited as a source of novel traits for crop improvement. Somaclonal variation: Variation arising in plants regenerated from somatic cells cultured in vitro. Gametoclonal variation: Variation arising in plants regenerated from gametic cells (anther, pollen, ovule culture). Both provide additional genetic variability beyond conventional breeding. 2. Somaclonal Variation 2.1 Definition Somaclonal variation refers to genetic variation observed among plants regenerated from somatic tissue cultures, such as callus, suspension cultures, or explants. Term coined by Larkin and Scowcroft (1981). 2.2 Sources of Somaclonal Variation Chromosomal changes Aneuploidy Polyploidy Chromosome rearrangements Gene mutations Point mutations Insertions and deletions...

Microbial Production of PharmaceuticalsSomatostatin, Humulin and Interferons

Microbial Production of Pharmaceuticals Somatostatin, Humulin and Interferons 1. Introduction Advances in recombinant DNA technology have enabled microorganisms to produce human therapeutic proteins safely, economically and in large quantities. Microbial systems such as Escherichia coli and yeast (Saccharomyces cerevisiae) are widely used for the production of pharmaceuticals that were earlier isolated from human or animal tissues. Important microbial-derived pharmaceuticals include somatostatin, human insulin (Humulin) and interferons. 2. Advantages of Microbial Production of Pharmaceuticals High yield and rapid production Cost-effective and scalable Free from animal pathogens Consistent product quality Easy genetic manipulation 3. General Steps in Microbial Production of Recombinant Pharmaceuticals Isolation of target gene Construction of recombinant DNA Insertion into suitable vector Transformation into host microorganism Expression of protein Downstream processing and purification ...

𓆉 INDEX PAGE -NOTETHEPOINT43

INDEX PAGE   MAIN    CONTENT 1.   HSST BOTANY SYLLABUS, DETAILED NOTES, MCQ 2.  SET GENERAL PAPER SYLLABUS, DETAILED NOTES, 50MCQ 3.  SET BOTANY SYLLABUS, DETAILED NOTES, MCQ 4. MSC BOTANY THIRD SEMESTER SYLLABUS, NOTES (KERALA UNIVERSITY ) 5. MSC BOTANY THIRD SEMESTER QUESTION PAPER (KERALA UNIVERSITY ) 6. MSC BOTANY FOURTH SEMESTER SYLLABUS &NOTES (KERALA UNIVERSITY ) 7. FOURTH SEMESTER MSC BOTANY PREVIOUS QUESTION PAPER  (KERALA UNIVERSITY )