Skip to main content

♡Introduction to Bioinformatics: Definition and History of Bioinformatics Internet. Computational Biology and Bioinformatics

Introduction to Bioinformatics: Definition and History of Bioinformatics Internet.
 Computational Biology and Bioinformatics

ﮩ٨ـﮩﮩ٨ـ♡ﮩ٨ـﮩﮩ٨ـﮩ٨ـﮩﮩ٨ـ♡ﮩ٨ـﮩﮩ٨ﮩ٨ـﮩﮩ٨ـ♡ﮩ٨ـﮩ

Definition of Bioinformatics



Bioinformatics is an interdisciplinary field that combines biology, computer science, mathematics, and statistics to collect, store, analyze, and interpret large volumes of biological data. It mainly deals with molecular biology data such as DNA, RNA, protein sequences, gene expression data, and biological networks.

Bioinformatics helps in understanding biological processes at the molecular level using computational tools. It plays a crucial role in modern biological research, especially after the availability of whole genome sequences.

According to NIH, “Bioinformatics is the application of computational tools to capture and interpret biological data.”


History and Evolution of Bioinformatics

Early Beginnings (Pre-1970)

The roots of bioinformatics date back to the 1950s–1960s.
In 1953, Watson and Crick discovered the structure of DNA, which laid the foundation for molecular biology.

In 1965, Margaret Dayhoff created the first protein sequence database called Atlas of Protein Sequence and Structure.
She also developed the PAM (Point Accepted Mutation) matrix, an important bioinformatics tool.

Development Phase (1970–1990)

The first DNA sequencing methods were developed by Sanger and Maxam–Gilbert.
In 1982, GenBank, a public nucleotide sequence database, was established.
Sequence alignment methods such as Needleman–Wunsch and Smith–Waterman algorithms were developed.

Bioinformatics emerged as a distinct discipline during this period.

Genome Era (1990–2005)
The Human Genome Project (HGP) started in 1990 and was completed in 2003.
Huge volumes of genomic data required advanced computational tools.
Development of databases like EMBL, DDBJ, PDB, and tools like BLAST.
Bioinformatics became essential for genome annotation and comparative genomics.

Post-Genome Era (2005–Present)

Advancement in next-generation sequencing (NGS) technologies.
Growth of proteomics, transcriptomics, metabolomics, and systems biology.
Integration of artificial intelligence and machine learning in bioinformatics.
Applications expanded to personalized medicine, drug discovery, and vaccine development.


Role of Internet in Bioinformatics

The Internet plays a vital role in the growth and application of bioinformatics by enabling:

Global access to biological databases.
Sharing of genome sequences and research data.

Online bioinformatics tools and web servers.
Collaboration among scientists worldwide.
Major bioinformatics resources accessible through the internet include:

NCBI (National Center for Biotechnology Information)
EMBL-EBI
UniProt
PDB
Without the internet, large-scale biological data analysis and collaboration would not be possible.


Computational Biology
Definition
Computational biology is a branch of biology that uses mathematical models, simulations, and algorithms to study biological systems.
It focuses more on theoretical modeling and understanding biological mechanisms rather than just data management.

Key Features

Uses mathematical and statistical modeling.
Studies complex biological systems.
Predicts biological behavior using simulations.
Emphasizes hypothesis-driven research.

Application

Modeling gene regulatory networks.
Protein structure prediction.
Population genetics.
Systems biology and pathway analysis.

Bioinformatics vs Computational Biology

Although often used interchangeably, they are slightly different:
Bioinformatics focuses on data storage, retrieval, and analysis.

Computational biology focuses on modeling and simulation of biological systems.
Bioinformatics is more data-driven, while computational biology is more theory-driven.
Both fields complement each other and are essential for modern biological research.


Importance and Applications of Bioinformatics

Genome sequencing and annotation.
Comparative genomics and evolutionary studies.
Drug discovery and vaccine development.
Disease diagnosis and personalized medicine.
Agricultural biotechnology and crop improvement.
Forensic science and environmental biology.

Conclusion


Bioinformatics has revolutionized biological research by enabling efficient analysis of complex biological data. The integration of computational biology, bioinformatics, and the internet has accelerated discoveries in genomics, proteomics, and medicine. With rapid advancements in sequencing technologies and artificial intelligence, bioinformatics will continue to play a central role in life sciences and healthcare.


1. Bioinformatics is the integration of
A. Biology and Chemistry
B. Biology and Physics
C. Biology, Computer Science and Statistics
D. Biology and Mathematics only
Answer: C
2. The term bioinformatics was first used in
A. 1960
B. 1970
C. 1978
D. 1990
Answer: C
3. The primary goal of bioinformatics is to
A. Perform wet lab experiments
B. Analyze and interpret biological data
C. Produce chemicals
D. Study anatomy
Answer: B
4. The first protein sequence database was created by
A. Watson
B. Crick
C. Margaret Dayhoff
D. Sanger
Answer: C
5. PAM matrix is used in
A. DNA replication
B. Sequence alignment
C. PCR
D. Transcription
Answer: B
6. Which algorithm is used for global sequence alignment?
A. BLAST
B. FASTA
C. Needleman–Wunsch
D. Smith–Waterman
Answer: C
7. Smith–Waterman algorithm is used for
A. Global alignment
B. Local alignment
C. Phylogenetic analysis
D. Genome sequencing
Answer: B
8. GenBank is a database for
A. Protein structures
B. Protein sequences
C. Nucleotide sequences
D. Metabolic pathways
Answer: C
9. The Human Genome Project was completed in
A. 1990
B. 1995
C. 2000
D. 2003
Answer: D
10. BLAST is used for
A. Gene cloning
B. Sequence similarity search
C. Protein synthesis
D. DNA replication
Answer: B
11. Which database stores 3D structures of proteins?
A. GenBank
B. UniProt
C. PDB
D. EMBL
Answer: C
12. EMBL database is located in
A. USA
B. Japan
C. Europe
D. India
Answer: C
13. DDBJ is maintained in
A. USA
B. Germany
C. Japan
D. UK
Answer: C
14. UniProt is mainly a database of
A. DNA sequences
B. RNA sequences
C. Protein sequences
D. Metabolites
Answer: C
15. Computational biology mainly focuses on
A. Data storage
B. Wet lab techniques
C. Mathematical modeling
D. DNA extraction
Answer: C
16. Bioinformatics is mainly
A. Theory driven
B. Data driven
C. Chemistry based
D. Physics based
Answer: B
17. Computational biology is mainly
A. Data driven
B. Theory driven
C. Database oriented
D. Tool oriented
Answer: B
18. Which of the following is NOT an application of bioinformatics?
A. Drug discovery
B. Genome annotation
C. Vaccine development
D. Microscopy
Answer: D
19. Internet is important in bioinformatics because it enables
A. DNA synthesis
B. Global data sharing
C. Cell culture
D. Protein purification
Answer: B
20. NCBI stands for
A. National Center for Biotechnology Information
B. National Cell Biology Institute
C. Network Center for Bioinformatics
D. National Computational Biology Institute
Answer: A
21. Which tool is commonly used for homology search?
A. PCR
B. BLAST
C. ELISA
D. Western blot
Answer: B
22. FASTA is used for
A. Genome annotation
B. Sequence alignment
C. Protein folding
D. Gene cloning
Answer: B
23. Next Generation Sequencing (NGS) produces
A. Small data
B. No data
C. Large volumes of data
D. Only protein data
Answer: C
24. Proteomics deals with the study of
A. Genes
B. RNA
C. Proteins
D. Lipids
Answer: C
25. Transcriptomics studies
A. DNA sequences
B. RNA transcripts
C. Proteins
D. Metabolites
Answer: B
26. Metabolomics is the study of
A. Genes
B. RNA
C. Proteins
D. Metabolites
Answer: D
27. Systems biology mainly studies
A. Single gene
B. Single protein
C. Entire biological systems
D. Only DNA
Answer: C
28. Which of the following is a primary bioinformatics database?
A. GenBank
B. PROSITE
C. Pfam
D. KEGG
Answer: A
29. KEGG database is related to
A. Protein structure
B. Metabolic pathways
C. DNA sequences
D. Gene cloning
Answer: B
30. Sequence annotation means
A. DNA extraction
B. Identifying functional elements
C. PCR amplification
D. Gel electrophoresis
Answer: B
31. Phylogenetic analysis helps in studying
A. Gene expression
B. Evolutionary relationships
C. Protein folding
D. DNA replication
Answer: B
32. Multiple sequence alignment is used to
A. Clone genes
B. Study conserved regions
C. Extract DNA
D. Amplify DNA
Answer: B
33. Which programming language is widely used in bioinformatics?
A. COBOL
B. FORTRAN
C. Python
D. Assembly
Answer: C
34. R programming is mainly used for
A. Web design
B. Statistical analysis
C. DNA synthesis
D. Protein purification
Answer: B
35. In silico means
A. Laboratory experiment
B. Computer-based experiment
C. Field study
D. Animal experiment
Answer: B
36. Which file format is commonly used for sequence data?
A. DOC
B. PDF
C. FASTA
D. JPG
Answer: C
37. Genome annotation involves
A. Sequencing DNA
B. Identifying genes and functions
C. DNA replication
D. Protein translation
Answer: B
38. Structural bioinformatics deals with
A. DNA replication
B. Protein structure analysis
C. Gene expression
D. RNA splicing
Answer: B
39. Comparative genomics compares
A. Proteins only
B. Genomes of different species
C. Metabolites
D. Single gene
Answer: B
40. Drug discovery using bioinformatics is called
A. Pharmacognosy
B. Cheminformatics
C. Pharmacology
D. Toxicology
Answer: B
41. Which of the following is a secondary database?
A. GenBank
B. EMBL
C. PROSITE
D. DDBJ
Answer: C
42. Multiple sequence alignment tool
A. BLAST
B. ClustalW
C. FASTA
D. PCR
Answer: B
43. Bioinformatics helps in personalized medicine by
A. Studying anatomy
B. Analyzing genetic variation
C. Cell staining
D. Tissue culture
Answer: B
44. Which organization maintains PDB?
A. NCBI
B. RCSB
C. EMBL
D. DDBJ
Answer: B
45. The backbone of bioinformatics development is
A. Internet and databases
B. Microscopy
C. Cell culture
D. Fermentation
Answer: A
46. Structural genomics focuses on
A. Gene expression
B. Protein structure determination
C. DNA replication
D. RNA synthesis
Answer: B
47. Which one is a web-based bioinformatics tool?
A. PCR
B. BLAST
C. Centrifuge
D. Autoclave
Answer: B
48. The main challenge in bioinformatics is
A. Lack of data
B. Data storage and analysis
C. No computers
D. Lack of internet
Answer: B
49. Artificial intelligence in bioinformatics is mainly used for
A. Data entry
B. Pattern recognition and prediction
C. DNA extraction
D. Cell staining
Answer: B
50. Bioinformatics is essential in modern biology because
A. Experiments are impossible
B. Data volume is huge
C. Biology has no theory
D. Computers are cheap
Answer: B


ﮩ٨ـﮩﮩ٨ـ♡ﮩ٨ـﮩﮩ٨ـﮩ٨ـﮩﮩ٨ـ♡ﮩ٨ـﮩﮩ٨ﮩ٨ـﮩﮩ٨ـ♡ﮩ٨ـﮩ

Comments

Popular Posts

❥NORTHERN BLOTTING

NORTHERN BLOTTING – 30 MARK DETAILED NOTES  𓆞❥ 𓆞❥ 𓆞❥ 𓆞❥ 𓆞❥ 𓆞 ❥ 𓆞❥ 𓆞❥  Northern blotting is a molecular biology technique used to detect specific RNA molecules in a complex mixture. It provides information about gene expression, RNA size, and transcript abundance by hybridizing RNA with a labeled complementary DNA or RNA probe. 📌 Named by analogy to Southern blotting (DNA detection). 2. Principle The principle of Northern blotting is based on: Separation of RNA molecules by size using denaturing agarose gel electrophoresis Transfer (blotting) of separated RNA onto a nylon or nitrocellulose membrane Hybridization of membrane-bound RNA with a labeled complementary probe Detection of RNA–probe hybrids by autoradiography or chemiluminescence ✔ Only RNA sequences complementary to the probe will be detected. 3. Types of RNA Analyzed mRNA (most common) rRNA tRNA miRNA and siRNA (with modified protocols) 4. Requirements / Materials Total RNA or poly(A)+ RNA Denaturing agarose ...

Information retrieval from databases - search concepts, Tools for searching, homology searching, finding Domain and Functional site homologies

Information retrieval from databases - search concepts, Tools for searching, homology searching, finding Domain and Functional site homologies Information Retrieval from Databases 1. Introduction Information retrieval in bioinformatics refers to the process of extracting relevant biological data (DNA, RNA, protein sequences, structures, or functional information) from databases. Aim : Identify sequences, functions, or structural features for analysis, comparison, and annotation. Databases can be primary (raw sequence data) or secondary/derived (annotated, processed data). 2. Search Concepts in Biological Databases 2.1 Types of Searches Exact Match Search Returns results only if the query exactly matches database entries. Useful for known accession numbers or IDs. Pattern/Keyword Search Searches based on specific motifs, keywords, or annotations. Example: “kinase domain,” “signal peptide.” Similarity/Homology Search Detects sequences similar to the query based on sequence alignment. Use...

Biological Databases – Types of Data and DatabasesNucleotide Sequence Databases (EMBL, GenBank, DDBJ)

Biological Databases – Types of Data and Databases Nucleotide Sequence Databases (EMBL, GenBank, DDBJ) 1. Introduction Biological databases are systematic, computerized collections of biological information that allow efficient storage, retrieval, updating, and analysis of large volumes of biological data. With the advent of genome sequencing, molecular biology, and bioinformatics, biological databases have become essential tools in biological research. These databases support studies in genomics, proteomics, evolutionary biology, taxonomy, medicine, agriculture, and biotechnology. 2. Types of Data Stored in Biological Databases Biological databases store diverse types of biological information, including: 1. Sequence Data DNA sequences RNA sequences Protein sequences 2. Structural Data Three-dimensional structures of proteins Nucleic acid structures 3. Functional Data Gene functions Enzyme activity Regulatory elements 4. Genomic Annotation Data Gene location Exons, introns Promoters a...

Exploitation of Somaclonal and Gametoclonal Variations for Plant Improvement

Exploitation of Somaclonal and Gametoclonal Variations for Plant Improvement  1. Introduction Plant tissue culture often induces genetic and epigenetic variations among regenerated plants. These variations, when stable and heritable, can be exploited as a source of novel traits for crop improvement. Somaclonal variation: Variation arising in plants regenerated from somatic cells cultured in vitro. Gametoclonal variation: Variation arising in plants regenerated from gametic cells (anther, pollen, ovule culture). Both provide additional genetic variability beyond conventional breeding. 2. Somaclonal Variation 2.1 Definition Somaclonal variation refers to genetic variation observed among plants regenerated from somatic tissue cultures, such as callus, suspension cultures, or explants. Term coined by Larkin and Scowcroft (1981). 2.2 Sources of Somaclonal Variation Chromosomal changes Aneuploidy Polyploidy Chromosome rearrangements Gene mutations Point mutations Insertions and deletions...

••CLASSIFICATION OF ALGAE - FRITSCH

      MODULE -1       PHYCOLOGY  CLASSIFICATION OF ALGAE - FRITSCH  ❖F.E. Fritsch (1935, 1945) in his book“The Structure and  Reproduction of the Algae”proposed a system of classification of  algae. He treated algae giving rank of division and divided it into 11  classes. His classification of algae is mainly based upon characters of  pigments, flagella and reserve food material.     Classification of Fritsch was based on the following criteria o Pigmentation. o Types of flagella  o Assimilatory products  o Thallus structure  o Method of reproduction          Fritsch divided algae into the following 11 classes  1. Chlorophyceae  2. Xanthophyceae  3. Chrysophyceae  4. Bacillariophyceae  5. Cryptophyceae  6. Dinophyceae  7. Chloromonadineae  8. Euglenineae    9. Phaeophyceae  10. Rhodophyceae  11. Myxophyce...

Fourth Semester M.Sc. Degree Examination, September 2019BotanySpecial Paper II - ElectiveBO 242 a: BIOTECHNOLOGY(2013 Admission onwards)

Reg. No.......  Name......... G-5263 Fourth Semester M.Sc. Degree Examination, September 2019 Botany Special Paper II - Elective BO 242 a: BIOTECHNOLOGY (2013 Admission onwards) Max. Marks: 75 1. Answer the following questions: 1. Humulin 2. YAC 3. Cybrids 4. Hybridomas 5. IPR 6. Gene therapy 7. C DNA library 8. AFLP 9. Hairy root culture 10. Somacional variation (10 x 1=10 Marks) II. Answer the following questions in not more than 50 words : 11. (a) What are immobilized enzymes? What is its advantage? OR (b) Write a short note on molecular farming. 12. (a) Give an account of bioprocess technology for the production of secondary metabolites. OR (b) What are bioreactors? How it operates? 13. (a) What are probiotics?. How do they work? OR (b) Discuss the methodology and application of western blotting. 14. (a) Briefly explain the application of protoplast culture OR (b) Write a short note on gene therapy 15. (a) What are reporter genes? Discuss its utility in transformation studies O...

Fourth Semester M.Sc. Degree Examination, September 2019BotanyBO 241: BIOINFORMATICS(2013 Admission onwards)

Reg. No....... Name:........ G-5262 Fourth Semester M.Sc. Degree Examination, September 2019 Botany BO 241: BIOINFORMATICS (2013 Admission onwards) Time: 3 Hours I. Answer the following questions: 1. Contiguous sequences 2. Protein Motif 3. INDEL 4. DOTPLOT 5. J PRED 6. Phylogram 7. EST 8. Query sequence 9. TIGR 10. CLUSTAL (10 x 1 = 10 Marks) II. Answer the following questions in not more than 50 words: 11. (a) Describe protein atlas and its significance in the development of bioinformatics. OR (b) Contribution of Frederick Sanger in advancement of proteomics. 12. (a) Describe Entrez with its significance. OR (b) Describe the advantages of EST search in gene sequencing programme. 13. (a) What are the salient features of UniProt KB (SWISSPROT). OR (b) Describe the features GenBank. 14. (a) Describe the concept of evolutionary tree. OR (b) Describe RasMol. 15. (a) Significance of BioPerl software in Human Genome project OR (b) Describe boutique databases. (5 x 2 = 10 Marks) III. Answer ...

Protein Sequence DatabasesPIR, SWISS-PROT and TREMBEL

Protein Sequence Databases PIR, SWISS-PROT and TREMBEL 1. Introduction Protein sequence databases are biological databases that store information about amino acid sequences of proteins, along with their functional, structural, and biochemical characteristics. Since proteins are the functional molecules of the cell, protein databases are essential for understanding gene expression, metabolism, enzymatic activity, signaling pathways, and evolution. Protein sequence databases mainly contain data derived from translated nucleotide sequences and experimental protein studies. 2. Types of Protein Sequence Databases Protein sequence databases are broadly classified into: A. Primary Protein Databases Contain original protein sequence data Minimal or no manual annotation B. Secondary Protein Databases Derived from primary databases Provide curated functional and structural information C. Composite Protein Databases Combine protein data from multiple sources Reduce redundancy 3. Protein Informati...

Secondary Databases (PROSITE, PRINTS, BLOCKS)

Secondary Databases (PROSITE, PRINTS, BLOCKS  Secondary Databases Introduction Biological databases are broadly classified into primary and secondary databases. Primary databases store raw experimental data (e.g., nucleotide or protein sequences), whereas secondary databases contain derived information obtained by analyzing primary sequence data. Secondary databases are mainly used to: Identify protein families Detect conserved motifs, patterns, and domains Predict protein function Study structure–function relationships Examples of secondary databases include PROSITE, PRINTS, BLOCKS, Pfam, etc. 1. PROSITE Database Definition PROSITE is a secondary database that documents protein domains, families, and functional sites in the form of patterns and profiles. Developed by Swiss Institute of Bioinformatics (SIB) Maintained along with UniProt Principle PROSITE is based on the idea that functionally important regions of proteins are conserved during evolution. These conserved regions can ...

𓆉 INDEX PAGE -NOTETHEPOINT43

INDEX PAGE   MAIN    CONTENT 1.   HSST BOTANY SYLLABUS, DETAILED NOTES, MCQ 2.  SET GENERAL PAPER SYLLABUS, DETAILED NOTES, 50MCQ 3.  SET BOTANY SYLLABUS, DETAILED NOTES, MCQ 4. MSC BOTANY THIRD SEMESTER SYLLABUS, NOTES (KERALA UNIVERSITY ) 5. MSC BOTANY THIRD SEMESTER QUESTION PAPER (KERALA UNIVERSITY ) 6. MSC BOTANY FOURTH SEMESTER SYLLABUS &NOTES (KERALA UNIVERSITY ) 7. FOURTH SEMESTER MSC BOTANY PREVIOUS QUESTION PAPER  (KERALA UNIVERSITY )