MLBio+Laboratory Machine Learning in Biomedical Informatics



Syllabus

Class Information
Instructor: Huzefa Rangwala, Room #Engineering 4423, rangwala@cs.gmu.edu
Class Time & Location: M: 7:20-10:00 pm Robinson Hall B 228
Text Book: Understanding Bioinformatics by Zvelebil & Baum
Office Hours: Instructor: M 4-6pm, EB 4423
About the Course
Course Description
CS 795 (Biological Sequence Analysis) is an inter-disciplinary course aimed at bridging the gap between biology and computer science, by exposing students to the widely used algorithms and methods playing a key role in bioinformatics and computational biology. The human genome project and advances in sequencing technologies have left us with a wealth of DNA, RNA, protein sequence data. Its important to infer key characteristics of biological systems using sequence analysis methods. The first half of the course will help students understand basic sequence alignment algorithms, hidden Markov models, classification and prediction methods. The second half will be an application of the concepts and ideas learned to some of the current bioinformatics applications motivated with a fair biological understanding.
Course Prerequisites
Programming in language of your choice. The class will cover the needed biology.
Course Outcomes
As an outcome of taking this class, a student will be able to
  • Conceptualize and implement sequence alignment algorithm methods which use adynamic programming solution.
  • Study the working of large genomic sequence database search tools like FASTAand BLAST.
  • Analyze the vast amount of genomic and proteomic data using machine learningand data mining tools (discriminative and generative models).
  • Understand the theoretical aspects of Markov chains and hidden Markov models and their application to gene prediction, protein sequence annotation and multiple sequence alignment.
  • Read research papers pertaining to bioinformatic and computational biology.
  • Learn about new sequencing technologies along with development of short-read assembly algorithms
Course Format
Lectures will be given by the instructor. Besides material from the textbook, topics not discussed in the book may also be covered. Research papers and handouts of material not covered in the book will be made available. Grading will be based on homework assignments, exams, and a project. Homework assignments will require some programming. Exams and homework assignments must be done on an individual basis. Any deviation from this policy will be considered a violation of the GMU Honor Code.
Tentative Class Topics
Sequence Alignment, Sequence Assembly, Markov Models, Genome Annotation, Short-Read Sequencing, Protein Structure and Function Prediction.
Grading Scheme
Exam 20 %
Assignments 40 %
Readings 10 %
Projects 30%
Grade Distribution
Grade Score Range
A 97-100
A- 92-97
B+ 86-92
B 80-86
C 70-80
F < 70
Policies:
Attendance
Attendance is not compulsory but highly recommended for doing well in the class. This class has lots of active learning exercises, and they will be a lot of fun.
Assignment Submission
Please ensure that the assignments are submitted on-time. No late submissions.
Make-Up Exams & Incompletes
Make up exams and incompletes will not be given for this class.
Academic Honesty and GMU Honor Code
Please visit the University's Academic Honesty Page and GMU Honor Code .
Disability Statement
If you have a documented learning disability or other condition that may affect academic performance you should: 1) make sure this documentation is on file with the Office of Disability Services (SUB I, Rm. 222; 993-2474; www.gmu.edu/student/drc) to determine the accommodations you need; and 2) talk with me to discuss your accommodation needs.


Powered by Drupal, an open source content management system