Assignent-01/Abhishek Mishra/HBTI Kanpur Bioinformatics-Programs & Tools
Assignent-01/Abhishek Mishra/HBTI Kanpur Bioinformatics-Programs & Tools
Assignent-01/Abhishek Mishra/HBTI Kanpur Bioinformatics-Programs & Tools
Bioinformatics tools are software programs that are designed for extracting the meaningful information from the mass of data & to carry out this analysis step. Factors that must be taken into consideration when designing these tools are The end user (the biologist) may not be a frequent user of computer technology These software tools must be made available over the internet given the global distribution of the scientific research community Major categories of Bioinformatics Tools: There are both standard and customized products to meet the requirements of particular projects. There are data-mining software that retrieves data from genomic sequence databases and also visualization tools to analyze and retrieve information from proteomic databases. These can be classified as homology and similarity tools, protein functional analysis tools, sequence analysis tools and miscellaneous tools. Here is a brief description of these. Everyday bioinformatics is done with sequence search programs like BLAST, sequence analysis programs, like the EMBOSS and Staden packages, structure prediction programs like THREADER or PHD or molecular imaging/modeling programs like RasMol and WHATIF. Homology and Similarity Tools: Homologous sequences are sequences that are related by divergence from a common ancestor. Thus the degree of similarity between two sequences can be measured while their homology is a case of being either true or false. This set of tools can be used to identify similarities between novel query sequences of unknown structure and function and database sequences whose structure and function have been elucidated. Protein Function Analysis: These groups of programs allow you to compare your protein sequence to the secondary (or derived) protein databases that contain information on motifs, signatures and protein domains. Highly significant hits against these different pattern databases allow you to approximate the biochemical function of your query protein. Structural Analysis: These sets of tools allow you to compare structures with the known structure databases. The function of a protein is more directly a consequence of its structure rather than its sequence
Sequence Analysis: This set of tools allows you to carry out further, more detailed analysis on your query sequence including evolutionary analysis, identification of mutations, hydropathy regions, CpG islands and compositional biases. The identification of these and other biological properties are all clues that aid the search to elucidate the specific function of your sequence.
FASTA:
FAST homology search all sequences .An alignment program for protein sequences created by Pearsin and Lipman in 1988. The program is one of the many heuristic algorithms proposed to speed up sequence comparison. The basic idea is to add a fast prescreen step to locate the highly matching segments between two sequences, and then
ClustalW:
It is a fully automated sequence alignment tool for DNA and protein sequences. It returns the best match over a total length of input sequences, be it a protein or a nucleic acid.
RasMol:
It is a powerful research tool to display the structure of DNA, proteins, and smaller molecules. Protein Explorer, a derivative of RasMol, is an easier to use program.
PatternHunter:
PatternHunter, based on Java, can identify all approximate repeats in a complete genome in a short time using little memory on a desktop computer. Its features are its advanced patented algorithm and data structures, and the java language used to create it. The Java language version of PatternHunter is just 40 KB, only 1% the size of Blast, while offering a large portion of its functionality.
Applications and software Tools for Bioinformatics
AutoDock:
AutoDock is a suite of automated docking tools. It is design to predict how small molecule, such as substrate or drug candidates, bind to a receptor of known 3D structure.
AutoDock Vina:
AutoDock Vina is a new open source program for drug discovery, molecular docking & virtual screening, offering multi core capability, high performance and enhanced accuracy and ease of use.
ClustalX:
ClastalX is a window interfaces for ClastalW multiple sequence alignment programs. It provides an integrated environment for performing multiple sequence and profile alignments and analyzing the results. The sequence alignment is displayed in a window on the screen. A versatile coloring scheme has been incorporated allowing you to highlight conserve features in the alignment. The pool-down means at the top of window allow to you select all the position required for traditional multiple sequence and profile alignment.
MODELLER:
MODELLER is used for homology or comparative modeling of protein three dimensional structures. The user provides an alignment of a sequence to be modeled with known related structures and MODELLER automatically calculates a mode containing all non-hydrogen atoms. MODELLER implements comparative protein structure modeling by satisfaction of spatial restraints and can perform many additional task, including de novo modeling of loop in protein structure, optimization of various models of protein structure with respect to flexibly defined objective function, multiple alignment of protein sequence and/or structures, clustering, searching of sequence database, comparison of protein structure etc.
Applications and software Tools for Bioinformatics
Glimmer:
Glimmer is a system for finding genes in microbial DNA, especially the genomes of bacteria, archaea and virus. Glimmer (Gene Locator and interpolated Markov ModelER) uses interpolated Markov models (IMMs) to identify the coding regions and distinguish them from noncoding DNA. .
Chemtool:
Chemtool is a program for drawing organic molecules easily and store them in a variety of output formats including as an X bitmap, Xfig, SVG or EPS file. It runs under the X window system using the GTK widget set.
Avogadro:
An advanced molecular editor designed for cross platform use in computational chemistry, molecular modeling, bioinformatics, material science, and related areas, which offers flexible rendering and a powerful plugin architecture.
GROMACS:
It is a versatile and extremely well optimized package to perform molecular dynamics computer simulation and subsequent trajectory analysis. It is developed for biomolecule like
Jackal:
Jackal is a collection of program designed for modeling and analysis of program structure. Its core program is a versatile homology modeling package nest.
HMMER:
Profile hidden Markov models (Profile HMMs) can be used to do sensitive database searching using statistical description of a sequence familys consensus. HMMER is a freely distributable implementation of profile HMM software for protein sequence analysis.
Bioinformatics Projects
BioJava
Applications and software Tools for Bioinformatics
The BioJava Project is dedicated to providing Java tools for processing biological data which includes objects for manipulating sequences, dynamic programming, file parsers, simple statistical routines, etc. BioPerl The BioPerl project is an international association of developers of Perl tools for bioinformatics and provides an online resource for modules, scripts and web links for developers of Perl-based software.
BioXML
A part of the BioPerl project, this is a resource to gather XML documentation, DTDs and XML aware tools for biology in one location.
Biocorba
Interface objects have facilitated interoperability between bioperl and other perl packages such as Ensembl and the Annotation Workbench. However, interoperability between bioperl and packages written in other languages requires additional support software. CORBA is one such framework for interlanguage support, and the biocorba project is currently implementing a CORBA interface for bioperl. With biocorba, objects written within bioperl will be able to communicate with objects written in bio python and biojava.
Bioinformatics Applications
Molecular medicine: The human genome will have profound effects on the fields of biomedical research and clinical medicine. Every disease has a genetic component. This may be inherited (as is the case with an estimated 3000-4000 hereditary disease including Cystic Fibrosis and Huntingtons disease) or a result of the body's response to an environmental stress which causes alterations in the genome (e.g. cancers, heart disease, diabetes.). The completion of the human genome means that we can search for the genes directly associated with different diseases and begin to understand the molecular basis of these diseases more clearly. This new knowledge of the molecular mechanisms of disease will enable better treatments, cures and even preventative tests to be developed. Personalised medicine: Clinical medicine will become more personalized with the development of the field of pharmacogenomics. This is the study of how an individual's genetic inheritance affects the body's response to drugs. At present, some drugs fail to make it to the market because a small percentage of the clinical patient population shows adverse effects to a drug due to sequence variants in their DNA. As a result, potentially lifesaving drugs never makes it to the marketplace. Today, doctors have to use trial and error to find the best drug to treat a particular patient as those with the same clinical symptoms can show a wide range of responses to the same treatment. In the future, doctors will be able to analyses a patient's genetic profile and prescribe the best available drug therapy and dosage from the beginning. Preventative medicine: With the specific details of the genetic mechanisms of diseases being unraveled, the development of diagnostic tests to measure a persons susceptibility to different diseases may become a distinct reality. Preventative actions such as change of lifestyle or having treatment at the earliest possible stages when they are more likely to be successful, could result in huge advances in our struggle to conquer disease. Gene therapy: In the not too distant future, the potential for using genes themselves to treat disease may become a reality. Gene therapy is the approach used to treat, cure or even prevent disease by