course number instructor title
CS 4984 Siddharth Krishnan Capstone in Social Network Analytics

How do memes spread on blogs? How and when does a hashtag become popular? Can we forecast viral content? How can we harness information cascades to make ’real-world’ predictions? 
Can Twitter foretell the Flu? In this course, we will develop projects that use web mining, data analytics, computational social science, and applied machine learning to model, analyze, characterize dynamical processes (like information cascades) on social networks. Networks provide enormous potential both to address long-standing scientific questions and also to inform the design of future social computing applications. Social networks, thus pose interesting challenges and questions, which will motivate projects in this capstone course: How is information in a social network created? How does it flow and mutate as it spreads through the network? How do we leverage the information flow to develop real-world predictive applications?

Course Specifics:
  1. The first four-five weeks will comprise of lectures to cover some background necessary to work on problems in this space. After which, we will meet only to learn about some cutting edge research via presentations from CS faculty and senior graduate students and on a need basis for discussing project progress.
  2. This is a 4000 level capstone course, so senior level work is expected. Students should be prepared to apply what they have learned in prior courses (like algorithms, database concepts, etc.) and to hone their skills at learning a new field using primary references as well as some secondary resources. 
  3. Students work on team term projects, in groups of size 4-5. Students will be responsible for (with input from the instructor):
Final deliverables will be an implementation/working prototype, a report, and a poster to be presented at the Virginia Tech Undergraduate Computer Science Research Symposium (VTURCS) at the end of April. Select projects, based on results, will be communicated to competitive data mining conferences or workshops. 
Prerequisite: A grade of C or better in CS 3654 or CS 4804 or CS 4824 or permission of instructor.