Concepts and techniques in data mining and multidisciplinary applications. Topics include databases; data cleaning and transformation; concept description; association and correlation rules; data classification and predictive modeling; performance analysis and scalability; data mining in advanced database systems, including text, audio, and images; and emerging themes and future challenges.
Thursday 7:20-10:00pm
Horizon Hall 2009
Dr. Jessica Lin
Email: jessica [AT] gmu [DOT] edu
Office Hours: TBA
Li Zhang
Office Hours: TBA
Office: TBA
There will be 4-5 competition-style programming assignments. The preferred programming language is Python. Competition winners will get 1% extra credit added to the final grade. No late submission will be accepted.
There will be quizzes throughout the semester covering lectures and readings, and one final exam. The final exam is comprehensive. All exams are closed-book, and they must be taken at the scheduled time and place, unless prior arrangement has been made with the instructor. Missed exams cannot be made up. The lowest quiz grade will be dropped.
Required: Introduction
to Data Mining by Pang-Ning Tan, Michael Steinbach, and Vipin
Kumar (click on the link for the companion website)
The GMU Honor Code is in effect at all times. In addition, the CS Department has further honor code policies regarding programming projects, which are detailed here. Any deviation from the GMU or the CS department Honor Code is considered an Honor Code violation. All assignments for this class are individual unless otherwise specified.
If you have a documented learning disability or other condition which may affect academic performance, make sure this documentation is on file with the Office of Disability Services and then discuss with the professor about accommodations.