CS 5634
Data Management in Bioinformatics

Fall 2005

CS 5634 is a database course specifically geared towards bioinformatics. Its goal is to cover all the traditional aspects of database management (data modeling, schema design, querying, and interface issues) in the context of biological problems. We will begin with the major schemes of data organization such as relational (with coverage of SQL), object-oriented, text, self-describing flat files, and XML. Next, we will cover customized data representation and storage schemes for graphs (popular for modeling biological pathways and networks). Emphasis will be placed on information integration (harnessing multiple biological databases to answer specific queries) and, to a lesser extent, on data mining (inferring patterns from biological databases).

Pre-requisites: CS 5046 or graduate standing in CSA. Partially duplicates CS 5614.

Instructor

Class Meeting Times and Contact Info:


First Day's Handout

(post) Lecture Notes and Projects