Course Objectives and
Lecture: 3pm - 4:15am Mon/Wed (Jan 08 - May 03, 2018), KACB 1447
Office hours: by appointment
Course TA:Emre Gursoy and Semih Sahin
Office hours: by appointment in KACB
Course Website: www.cc.gatech.edu/~lingliu/courses/cs6675/18Spring/cs6675.html
Course Evaluation online Click Here
Advanced Internet-scale systems and applications are geographically distributed, highly available, incrementally scalable, and dynamically configurable. Typical questions that systems researchers and advanced application developers are facing today include: How would you build a web service that can handle billions of frantic requests? What systems support do we need for developing applications of Internet scale? Can we provide dynamic configuration, replication, and migration of Web services? What new techniques will enable Internet systems and applications to better exploit high-speed networks? How should traditional systems issues such as naming, persistence, resource management, performance, and security be provided in a system of Internet scale? How much data can an internet scale system process? What does big data technology mean to a computer scientist? to a data scientist? to a business owner or a scientist?
This course reviews concepts, techniques, and systems issues in advanced Internet application development, and explores new challenges and research issues that are critical for developing big data systems and applications, such as search engines (incl. robots and indexing servers), Web servers, application servers (such as the E-commerce information servers like eBay, CNet, Amazon), and web-based online transaction systemsi, nternet scale social media and social network systems. Main topics to be covered include fundamentals of search engines, web servers, application servers and online transaction systems, especially the important search techniques, indexing techniques, the various means for querying heterogeneous data sources and delivering fresh information from sources to consumers, the methods for distribution of control and performance optimizations to avoid bottlenecks and to achieve elastic scalability. One of the important goals of the course is to look beyond the present status of the Internet and conjecture what possible future technologies and applications will evolve. The course will include a significant project component that will typically require Java/C/CGI programming.
Students are expected to have taken Operating Systems (CS2200 or equivalent) and Introduction to database systems (CS 4400/6400 or equivalent). Computer Networks and Parallel and Distributed Systems (CS 4230 and CS6236 or equivalent) are highly recommended for students interested in doing research in systems or networks. In addition, students are expected to have a solid grasp of Java/C/CGI programming. Sockets programming is not required but desirable.
A detailed description of course structure and administration can be found in Course Introduction.
Grades will be computed using the tentative weighting scheme below:
There are 5 homework assignments starting from Week 2. For each homework assignment, you can choose either reading assignment or programming assignment. You are given 2 weeks to complete the assignment. All five assignments are individual homeworks.
Each reading assignment requires you to read two papers from the given list. Each assignment is given a period of two weeks and due by the midnight of the given Friday with no penalty grace period until 9am on the following Saturday. Each assignment requires a student to write a critique or reading summary about each of the two papers you read during that two weeks. Preferably, the two papers you choose to read are relevant to the lecture topics covered in the lectures of that two weeks if you select from the list provided by the instructor. You can find the reading recommendations in TSquare under Assignments Tab. The longer reading list for each topic covered by the class can be found at Course Reading.
Each programming assignment consists of a few options for you to choose from. They are made available on TSquare under Assignment Tab.
Written Technology reviews of topics. Topics will come from weekly lecture and class discussions and guest presentations as well as papers assigned for reading or in the reading list. You are required write a technology review of 10-15 pages in MS word, including figures and references you reading. This technology review paper is due by 11pm on the final exam day.
Example: Write a 10-page summary of the current state of mobile Internet technology addressing the following:
The course project should be significant (non-trivial) and related in some way to the Internet Application Development. Students may work in groups of up to 4 students per project. You are encouraged to come up with your own project ideas. If you are having trouble, talk to me and I will help you out with some possibilities.
In principle, you can propose anything you wish: implementation, benchmarking, evaluation, interesting Internet applications, etc. People from companies may propose projects related to their work. However, all the project related material must not be proprietary, i.e., I will not sign any non-disclosure agreement just to evaluate a project.
The Important due dates for project proposal, project demo, project presentation and project code and documentation deliverable can be found from the TSquare Wiki page.
There is no text book required for this course. The course material comes primarily from course notes and a selection of recent papers on advanced Internet application development and related research issues, including search engines, Web servers, Wide-area distributed information systems, and Web database systems. However, there may be text books that you will find useful or even necessary in order to complete your course projects.