COMP50030 Bioinformatics Applications and Databases

Academic Year 2014/2015

This is aimed at Biologists who have minimal previous programming experience. Computer Scientists will therefore find the programming tutorials basic, but should find the bioinformatics applications and biodatabases aspects educational.

Each lecture/tutorial combination will deal with both biological and technical (database design, programming, algorithms) features associated with a particular database. The goal is a balance of 50:50 biology content and technology.

Databases: Protein database Swissprot standard entry format, and BLAST searching tool. DNA and protein http://srs.ebi.ac.uk (SRS representation of sequence databases)www.rscb.org (PDB structural database and 3D structure viewers) Human polymorphism www.hapmap.org (and haploview application) Genome data www.ensembl.org (and ensmart tools) www.pfam.org (alignment database, alignment methods, profile searching methods) Systems biology database of models (SBML) Gene expression database (geo; limitations of current tools) Proteomics databases (2D gel and mass spec databases). Protein interaction databases (biogrid, www.hprd.org , cytoscape network application), Protein structure (PDB), chemical information (PubChem, ZINC, etc).

Show/hide contentOpenClose All

Curricular information is subject to change

Learning Outcomes:

This will teach non-biologists a lot of biology by forcing exposure to biological data. It will teach biologists a lot of computer application issues (CGI, relational database model, mysql, html, and others) . It will give students introductory specialist knowledge in a number of applied bioinformatics areas. It will expose students to the extent of complex and difficult-to-structure observational data in modern biology and the markedly heuristic approaches favoured in biology.

Lectures will cover techniques and methods for dealing with the following kinds of data:
- database structure and manipulation.
- genomic data (compiling, analysing and annotating DNA sequencing data).
- clinical bioinformatics (SNPs, alleles, haplotypes)
- interactome data (proteomics data, protein complexes, protein identification).
- chemical information (combinatorial libraries, SDF, SMILES, SMARTS, fingerprints).
- protein structural data (PDB, homology modelling, active site determination).

�Hands on� experience in the tutorials will take students through the basic steps needed to build a program that retrieves and stores web-based biological data. After the three days of tutorials students will be able to start from an unformatted computer hard drive and end with a functional, searchable database of biologically relevant information.

Student Effort Hours: 
Student Effort Type Hours
Lectures

15

Tutorial

15

Autonomous Student Learning

70

Total

100

 
Requirements, Exclusions and Recommendations
Learning Requirements:

BSc(Hons) in Biology or Computer Science

Learning Recommendations:

Python Programming for Computational Biologists (COMP50050)



Module Requisites and Incompatibles
Equivalents:
Biodatabases& their Apps (COMP40710)

 
Description % of Final Grade Timing
Continuous Assessment: Self assessment sheets scored and signed off by supervisors

100

Varies over the Trimester

Compensation

This module is not passable by compensation

Resit Opportunities

In-semester assessment

Remediation

If you fail this module you may repeat, resit or substitute where permissible