It should be mentioned that this course is NOT about specific applications.
Also, the course is NOT structured around certain standardized
application-layer protocols, such as HTTP, RTP, or SIP. We will discuss
some of these protocols in the context of the applications that use them.
Lectures and mini-presentations
Each lecture consists of two parts. In
the first part, the instructor will talk about the state-of-the-art
in that week's topic. The lectures will be based on recent research
papers and other references. In the second part of the class, we will
have one or two small (typically 5 minutes each) student presentations
about specific applications that are related to that week's topic. For
example, if the
lecture topic is Voice-over-IP, the instructor will first cover
fundamental principles and technologies about VoIP, while a student
presentation at the end of the class can focus on a particular Skype feature.
Prerequisites
Some (recent and solid) knowledge of computer
networking
is required. This should certainly NOT be the first networking course
you are
taking. However, you do not need to be a "networking student" to
take this course. Students with a primary interest in machine learning
or AI, HCI, or databases will also find some of the topics interesting.
CS6250 is NOT a required prerequisite for this course, as long as you
have taken another networking course before.
Textbook references
The course will NOT have a textbook. Instead, we will study a number
of research and survey papers (listed in the syllabus section).
You are strongly encouraged to look at the recent proceedings of the
following conferences, especially as you look for a project topic:
- World Wide Web (WWW) conference
- Networks and Operating Systems Support for Digital Audio and Video (NOSSDAV)
- Internet Measurement Conference (IMC)
- Sigcomm workshop on Online Social Networks (OSN)
- International AAAI Conference on Weblogs and Social Media (ICWSM)
- ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD)
- ACM Conference on Electronic Commerce (EC)
You may also find the following textbooks useful at certain
parts of the course.
- "Web Protocols and Practice", by B.Krishnamurthy and J.Rexford.
- "Digital Compression for Multimedia", by Gibson et al.
- "Internet Measurement", by M.Crovella and B.Krishnamurthy.
- "Web Data Mining", by B.Liu
Syllabus, schedule, and links to references
-
Week-1: Introduction - How do people use the Internet? Traffic classification
- How to read a paper? by S.Keshav
- Broadband Fact Book by the Internet Innovation Alliance
- see also most recent Ipoque study
- Is P2P dying or just hiding? by T.Karagiannis et al.
- Internet Traffic Classification Demystified: Myths, Caveats, and the Best Practices by H.Kim et al.
-
Week-2: Voice over the Internet
- Voice over Internet protocol by B. Goode
- Detailed analysis of Skype traffic by D. Bonfiglio et al.
- Quantifying Skype user satisfaction by K.T. Chen et al.
-
Week-3: Video over the Internet
- Video Streaming: Concepts, Algorithms, and Systems by J. Apostolopoulos et al.
- I Tube, You Tube, Everybody Tubes: Analyzing the World's Largest User Generated Content Video System by M.Cha et al.
- Watching
Television Over an IP Network by M.Cha et al.
-
Week-4: P2P file distribution
- Search and replication in unstructured peer-to-peer networks by Q. Lv et al.
- The KaZaA Overlay: A Measurement Study by J. Liang et al.
- Analyzing and Improving a BitTorrent Network's Performance Mechanisms by A. Bharambe et al.
-
Week-5: P2P
streaming
- Opportunities and Challenges of Peer-to-Peer Internet Video Broadcast by J. Liu et al. (optional reading)
- Insights into PPLive: A Measurement Study of a LargeScale P2P IPTV System by X. Hei et al.
- CoolStreaming/DONet: A Data-Driven Overlay Network for Efficient Live Media Streaming by X. Zhang et al.
- (Sigcomm'08 PPlive paper)
-
Week-6: Web performance and the user's perspective on Web performance
- On the Use and Performance of Content Distribution Networks by B. Krishnamurthy et al.
- Measuring Human Satisfaction in Data Networks by M. Andrews et al.
-
Week-7: Midterm
-
Week-8: Multiplayer games and virtual worlds
- Donnybrook: Enabling Large-Scale, High-Speed, Peer-to-Peer Games by A. Bharambe et al.
- Measurement-based Characterization of a Collection of On-line Games by C. Chambers et al.
- Traffic Analysis Beyond This World: the Case of Second Life by S. Fernandes et al.
-
Week-9: Email, instant messaging, and other text communication
- Planetary-Scale Views on a Large Instant-Messaging
Network by Leskovec and Horvitz
- Comparative Graph Theoretical Characterization of Networks of Spam and Legitimate Email by Gomez et al. (optional reading)
- Behavioral Profiles for Advanced Email Features by Karagiannis and Bojnovic (optional reading)
-
Week-10: Social
networking applications and online communities
- Measurement and Analysis of Social Networks by A.Mislove
- Structure and Evolution of Online Social Networks by R.Kumar et al.
- Comparison of online social relations in volume vs interaction: a case study of cyworld by Chun et al.
- Unveiling Facebook: A Measurement Study of Social Network Based Applications by Nazir et al. (Optional reading)
-
Week-11: Searching the Web
- How are we Searching the World Wide Web? A Comparison of Nine Search Engine Transaction Logs by B.J.Janser and A.Spink
- Spatial Variation in Search Engine Queries by Backstrom et al.
- Why We Search: Visualizing and Predicting User Behavior by E. Adar et al.
-
Week-12: Extracting knowledge from the Web
- The Semantic Web by T.Berners-Lee et al.
- Knowledge Sharing and Yahoo Answers: Everyone Knows Something by Adamic et al.
- Expertise networks in online communities: structure and algorithms by Zhang et al. (optional reading)
-
Week-13: The Web as a marketplace and a public forum
- The Dynamics of Viral Marketing by J.Leskovec et al.
- Online Auctions Efficiency: A Survey of eBay Auctions by Wenyuan et al. (optional reading)
- Cost-effective outbreak detection in networks by J.Leskovec et al.
-
Week-14: User behavior studies - Presentation of student projects
- Yes, There is a Correlation From Social Networks to Personal Behavior on the Web by Singla and Richardson
- Improving Web Search Ranking by Incorporating User Behavior Information by Agichtein et al.
- Demographic Prediction Based on Users' Browsing Behavior by Hu et al. (optional reading)
Term Project
Students will work on a term project in groups of
four students. The term project is supposed to be the most creative and fun part
of the course. Also, the project will be a "hands-on" study
that is related to one or more Internet applications. The
exact project description will be determined by
students themselves. The only requirement is that the project should
have a sufficiently challenging component. For instance, you can write a new (or modify
an existing open-source) application that is related to the course
topics. You can measure or experiment with existing applications,
evaluating their performance or other characteristics. You can study
the social network of an online community application. Or, you can
propose, study and implement an algorithm that would be useful in one
of the applications we cover (for example, an advanced web searching
algorithm based on clustering or semantic associations). Writing a
survey paper or just running some applications would certainly not be enough.
Project Milestones:
- January 15: form groups of 4 students - email group participant names and email addresses to TA and instructor
- February 5: write a 2-page project description, explaining what you want to do
and how you plan to do it - the instructor and TA will provide feedback and you may need to revise your project description
- March 10: first progress report (each group discusses their progress with the instructor or TA)
- April 1: second progress report (each group discusses their progress with the instructor or TA)
- April 24: final project presentations
Grading
- Class participation and mini-presentations: 10%
- Midterm exam: 25%
- Final exam: 25%
- Term project and final
presentation: 40%