SureshKumar's Bioinformatics Blog

I am Suresh Kumar Sampathrajan. I have completed my PhD degree in bioinformatics from the University of Vienna, Austria in the year 2010. If you want to know more about me and my research,please click the menus at the top.

I have started this bioinformatics blog mainly for undegraduate and postgraduate students of bioinformatics. This blog will serve as an open resource material for the students and for those who wish to know about bionformatics. This blog contains video tutorials, tips, bioinformatics software downloads, articles on bioinformatics and career opportunities.

Bioinformatics research fields

Bioinformatics research fields broadly classified to the following sub-fields. Some research fields have inter related with one another


The term "sequence analysis" implies subjecting a DNA or peptide sequence to sequence alignment, sequence databases, repeated sequence searches, or other bioinformatics methods on a computer.

Sequence analysis in bioinformatics is an automated, computer-based examination of characteristical fragments. It basically includes five biologically relevant topics:

1. the comparison of sequences in order to find similar sequences (sequence alignment)
2. identification of gene-structures, reading frames, distributions of introns and exons and regulatory elements
3. prediction of protein structures
4. genome mapping
5. comparison of homologous sequences to construct a molecular phylogeny


structural bioinformatics refers to the analysis of macromolecular structure particularly proteins, using computational tools and theoretical frameworks.


It is defined as analysis of the full genomes of organisms that have been sequenced and to identify those genes that are predicted to have a particular biological function.Comparative genome analysis is one of the component of genome analysis.

Comparative genomics include a comparision of gene number,gene content and gene location in both prokaryotic and eukaryotic groups of organisms.


Gene expression, or simply expression, is the process by which a gene's DNA sequence is converted into the structures and functions of a cell. Non-protein coding genes (e.g. rRNA genes, tRNA genes) are not translated into protein.

Gene regulatory network (also called a GRN or genetic regulatory network) is a collection of DNA segments in a cell which interact with each other and with other substances in the cell, thereby governing the rates at which genes in the network are transcribed into mRNA.

Mathematical models of GRNs have been developed to allow predictions of the models to be tested. Various modeling techniques have been used, including Boolean networks, Petri nets, Bayesian networks, graphical Gaussian models, Stochastic Process Calculi and sets of differential equations.


Systems biology is the coordinated study of biological systems by (1) investigating the components of cellular networks and their interactions, (2) applying exprerimental high-throughput and whole-genome techniques, and (3) integrating computational methods with experiemntal efforts.”


Data mining (DM), also called Knowledge-Discovery in Databases (KDD) or Knowledge-Discovery and Data Mining, is the process of automatically searching large volumes of data for patterns such as association rules. It applies computational techniques from statistics, information retrieval, machine learning and pattern recognition.


phylogenetics (Greek: phylon = tribe, race and genetikos = relative to birth, from genesis = birth) is the study of evolutionary relatedness among various groups of organisms (e.g., species, populations). Also known as phylogenetic systematics, phylogenetics treats a species as a group of lineage-connected individuals over time.The most commonly used methods to infer phylogenies include parsimony, maximum likelihood, and MCMC-based Bayesian inference.


Genetic analysis: The study of a sample of DNA to look for mutations (changes) that may increase risk of disease or affect the way a person responds to treatment.

Population analysis: Population analysis encompasses methods used to characterize and understand changes in populations. Typically, through population analysis we are interested in being able to explain observed changes in population dynamics and make predictions regarding future possibilities. Knowledge from analyses is expressed as a model.


  • Bourne, Weissig (2003) Structural Bioinformatics, Wiley
  • James M. Bower, Hamid Bolouri (editors), (2001) Computational Modeling of Genetic and Biochemical Networks Computational Molecular Biology Series, MIT Press, ISBN 0-262-02481-0
  • Klipp E et al. ”Systems Biology in Practice”, WILEY-VCH, 2005
  • Pang-Ning Tan, Michael Steinbach and Vipin Kumar, Introduction to Data Mining (2005), ISBN 0-321-32136-7

what is bioinformatics?

Bioinformatics has evolved into a full-fledged multidisciplinary subject that integrates developments in information and computer technology as applied to biotechnology and biological Sciences.

Roughly, bioinformatics describes any use of computers to handle biological information. In practice the definition used by most people is narrower; bioinformatics to them is a synonym for "computational molecular biology"- the use of computers to characterize the molecular components of living things.

The NIH Biomedical Information Science and Technology Initiative Consortium computational tools andapproaches for expanding the use of biological, medical, agreed on the following definitions of bioinformatics as research, development, or application of behavioral or health data, including those to acquire, store, organize, archive, analyze, or visualize such data.

The National Center for Biotechnology Information defines bioinformatics as "Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline.There are three important sub-disciplines within bioinformatics: the development of new algorithms and statistics with which to assess relationships among members of large data sets; the analysis and interpretation of various types of data including nucleotide and amino acid sequences, protein domains, and protein structures; and the development and implementation of tools that enable efficient access and management of different types of information."

(Molecular) bio – informatics: bioinformatics is conceptualising biology in terms of molecules (in the sense of Physical chemistry) and applying “informatics techniques” (derived from disciplinessuch as applied maths, computer science and statistics) to understand andorganise the information associatedwith these molecules, on a large scale. Inshort, bioinformatics is a managementinformation system for molecular biology and has many practical applications.

1. What is bioinformatics? A proposed definition and overview of the field. NM Luscombe,
D Greenbaum, M Gerstein (2001) Methods Inf Med 40: 346-58

Twitter Delicious Facebook Digg Stumbleupon Favorites More