Office Hours: 10-11am Mon/Wed, or by appointment
This course is an introduction to advanced research topics in
Internet-scale data management, addressing the relevant theoretical
foundations, methods, and tools from a wide spectrum, including (i)
large-scale distributed information management; (ii) data and text
mining techniques and algorithms; and (iii) data privacy and security
issues in large-scale Internet systems.
Course Syllabus
The course grading policy is as follows: 20% Proposal, 50% Final Project Deliverable (Paper + Poster), 30% Assignments. Refer to your course syllabus for more details.
- Project proposal: Due September 28, 2010 11:59pm. Your proposal should be around 5 pages, single column, normal fonts, normal margins. See the syllabus for specific questions we expect to see addressed as well as this brief presentation.
During most weeks, I will present an overview of the week's topics
during the first half of class on Tuesday. The remaining time each week
will be devoted to intense paper discussion.
You should check the schedule often, as it is in flux.
August 31 / September 2: Introduction to the course, administrivia,
etc.
September 7 / 9: MapReduce
September 14 / 16: Link Analysis
September 21 / 23: Text/Web/Blog Mining
September 28 / 30: Communities
October 5 / 7: Social Media
October 12 / 14: Real-Time Web
October 19 / 21: Duplicate Detection
October 26 / 28: Guest Lectures on Visualizing Large Datasets
November 2 / 4: Social Spam
November 9 / 11: Geography and the Social Web
November 16 / 18: Influence in Social Information Networks
November 23: Short Text
November 30 / December 2
December 7
Hadoop
Toolkits
Data
The Americans with Disabilities Act (ADA) is a
federal anti-discrimination statute that provides comprehensive civil
rights protection for persons with disabilities. Among other things,
this legislation requires that all students with disabilities be
guaranteed a learning environment that provides for reasonable
accommodation of their disabilities. If you believe you have a
disability requiring an accommodation, please contact the Department of
Student Life, Services for Students with Disabilities, in Cain Hall or
call 845-1637.
AGGIE HONOR CODE: ''An Aggie does not lie, cheat, or steal or tolerate
those who do.'' Upon accepting admission to Texas A&M University, a
student immediately assumes a commitment to uphold the Honor Code, to
accept responsibility for learning, and to follow the philosophy and
rules of the Honor System. Students will be required to state their
commitment on examinations, research papers, and other academic work.
Ignorance of the rules does not exclude any member of the TAMU community
from the requirements or the processes of the Honor System. For
additional information please visit: http://www.tamu.edu/aggiehonor/