Reading list for CS8803HPC High Performance Communication
(Note: tentative list, only select papers will be discussed in detail, remaining are provided for reference only.)
IntroductionNetworking Technologies
- Class overview: High Performance Communication: Issues, Challanges and Applications
- Overview of platforms and tools for class projects: ILAB resources, IXP Software Development Tools, IXP Architecture Development Tool, Infiniband cluster
- M. Venkatachalam, P. Chandra, R. Yavatkar, A Highly Flexible, Distributed Multiprocessor Architecture for Network Processing, Computer Networks, 2003
Network Interfaces
- H. Frazier and H. Johnson. Gigabit ethernet: From 100 to 1,000 Mbps. IEEE Internet Computing, 3(1), 1999
- Gigabit Ethernet - Technology Overview
- W. Feng, G. Hurwitz, H. Newman, S. Ravot, L. Cottrell, O. Martin, F. Coccetti, C. Jin, D. Wei, and S. Low. Optimizing 10-Gigabit Ethernet for Networks of Workstations, Clusters and Grids: A Case Study., SC2003: High-Performance Networking and Computing Conference, Phoenix AZ, November 2003.
- Fabrizio Petrini, Salvador Coll, Juan Fernandez Peinador, Eitan Frachtenberg (LANL), Scalable Collective Communication on the ASCI Q Machine, Hot Interconnects 11, 2003
- Fabrizio Petrini, Wu-chun Feng, Adolfy Hoisie, Salvador Coll, and Eitan Frachtenberg. The Quadrics Network (QsNet): High-Performance Clustering Technology. In Hot Interconnects 9, Stanford University, Palo Alto, CA, August 2001
- Introduction to InfiniBand, A white paper for Mellanox Corporation
- N.J. Boden, D. Cohen, R.E. Felderman, A.E. Kulawik, C.L. Seitz, J.N. Seizovic, and W. Su. Myrinet: A Gigabit-per-second Local Area Network. IEEE Micro, 15(1):29--36, February 1995
- J. Liu, B. Chandrasekaran, J. Wu, W. Jiang, S. Kini, W. Yu, D. Buntinas, P. Wyckoff, and D. K. Panda, Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics, SuperComputing (SC) Conference, November, 2003
- InfiniBand -- Industry Standard Data Center Fabric is Ready for Prime Time
- Jiuxing Liu, Balasubramanian Chandrasekaran, Weikuan Yu, iesheng Wu, Darius Buntinas, Sushmitha Kini, Peter Wyckoff, Dhabaleswar K. Panda, Micro-Benchmark Level Performance Comparison of High-Speed Cluster Interconnects, Hot Interconnects 2003
- David Mayhew,Venkata Krishnan (Stargen), PCI Express and Advanced Switching: Evolutionary Path to Building Next Generation Interconnects, Hot Interconnects 2003
- Ajay V. Bhatt, Creating a Third Generation I/O Interconnect, Intel Corporation, White paper
- Lloyd Dickman, Beyond Hero Numbers: Factors Affecting Interconnect Performance
High Performance Protocols
- John Kubiatowicz and Anant Agarwal, Anatomy of a Message in the Alewife Multiprocessor, in Proceedings of the 7th ACM International Conference on Supercomputing (ICS), July, 1993.
- Chandramohan A. Thekkath and Henry M. Levy, Limits to Low-Latency Communication on High-Speed Networks, ACM Transactions on Computer Systems, 11(2), May 1993, pp. 179-203.
- Greg Buzzard, David Jacobson, Milon Mackey, Scott Marovich and John Wilkes, An Implementation of the Hamlyn Sender-Managed Interface Architecture, OSDI-2, 1996.
- Randy Osborne, A Hybrid Deposit Model for Low Overhead Communication in High Speed LANs, Mitsubishi Electric Research Laboratories Technical Report TR94-02c.
- Eric A. Brewer, Fredric T. Chong, Lok T. Liu, Shamik D. Sharma, and John Kubiatowicz. Remote Queues: Exposing Message Queues for Optimization and Atomicity. Appears in SPAA '95
- Thorsten von Eicken, Anindya Basu, Vineet Buch and Werner Vogels, U-Net: A User-Level Network Interface for Parallel and Distributed Computing, SOSP-15, 1995.
- P. Willmann, H.-Y. Kim, V. S. Pai, S. Rixner, An Efficient Programmable 10 Gigabit Ethernet Network Interface Card, The International Symposium on High-Performance Computer Architecture (HPCA), San Francisco, CA, (February, 2005)
- P. Shivam, P. Wyckoff and D.K. Panda, Can User Level Protocols Take Advantage of Multi-CPU NICs?, Proc. IPDPS '02, April 2002
- H. -W. Jin, S. Narravula, G. Brown, K. Vaidyanathan, P. Balaji, and D. K. Panda, Performance Evaluation of RDMA over IP: A Case Study with the A mmasso Gigabit Ethernet NIC, Proceedings of Workshop on High Performance Interconnects for Distribute d Computing (HPI-DC); In conjunction with HPDC-14, Research Triangle Park, NC, U SA, July 2005.
- Brecht et al., Evaluating Network Processing Efficiency with Processor Partitioning and Asynchronous I/O, EuroSys 06
- Hyong-youb Kim and Scott Rixner, TCP Offload through Connection Handoff, EuroSys 06
- W. Feng, L. Bhuyan, Performance Characterization of a 10 Gigabit Ethernet TOE, HotI-05
Programmable Networking
- The Case for RDMA, IETF Draft
- Allyn Romanow, Stephen Bailey, RDMA over IP
- Callaghan, T. Lingutla-Raj, A. Chiu, P. Staubach and O. Asad, NFS over RDMA, Proceedings of the ACM 2003 SIGCOMM Workshops
- Murali Rangarajan and Liviu Iftode, Building a User-level Direct Access File System over Infiniband, SAN-3
- Andrew Gallatin, Jeff Chase and Ken Yocum, Trapeze/IP: TCP/IP at Near-Gigabit Speeds, 1999 USENIX Technical Conference, June, 1999
- Socket over -
- P. Bhargava, S. Krishnan, R. Panigrahy (Cisco Systems), Efficient Multicast on a Terabit Router, Hot Interconnects 2004.
- L. Brakmo, S. O'Malley, and L. Peterson. TCP Vegas: New techniques for congestion detection and avoidance. In Proceedings of the SIGCOMM '94 Symposium (Aug. 1994) pages 24-35
- M. Mathis, J. Mahdavi, Forward Acknowledgment: Refining TCP Congestion Control, Proceedings of SIGCOMM'96, pp. 281-191, August, 1996, Stanford, CA.
- E. Weigle and W. Feng, A Comparison of TCP Automatic-Tuning Techniques for Distributed Computing, IEEE Symposium on High-Performance Distributed Computing (HPDC'02), Edinburgh, Scotland, July 2002
- Armando L Caro Jr., Keyur Shah, Janardhan R. Iyengar, Paul D. Amer, Randall R. Stewart, Congestion Control: SCTP vs TCP
- Qi He, and Karsten Schwan, IQ-RUDP: Coordinating Application Adaptation with Network Transport, High Performance Distributed Computing (HPDC-11), ACM/IEEE, July 2002.
- Jin-Soo Kim, Kangho Kim, and Sung-In Jung, SOVIA: A User-level Sockets Layer over Virtual Interface Architecture, Proc. of Cluster 2001 conference, Oct. 2001
- P. Balaji, P. Shivam, P. Wyckoff and D. K. Panda, High Performance User-Level Sockets over Gigabit Ethernet, Cluster '02, Sept. 2002
- James Pinckerton et al., Sockets Direct Protocol (SDP) for iWARP over TCP, Internet Draft
- Jiuxing Liu, Wei Huang, Bulent Abali and Dhabaleswar K. Panda, High Performance VMM-Bypass I/O in Virtual Machines, USENIX Annual Technical Conference 2006
Programming Models and Environments
- Robert Morris, Eddie Kohler, John Jannotti and M. Frans Kaashoek, The Click Modular Router, in Proceedings of the 17th ACM Symposium on Operating Systems Principles, 1999.
- Andrew T. Campbell, Stephen T. Chou, Michael E. Kounavis, Vassilis D. Stachtos and John Vicente, NetBind: A Binding Tool for Constructing Data Paths in Network Processor-Based Routers, OPENARCH'02.
- Tammo Spalink, Scott Karlin, Larry Peterson and Yitzchak Gottlieb, Building a Robust Software-Based Router Using Network Processors, SOSP-18, 2001.
- Campbell A.T., De Meer H.G., Kounavis M.E., Miki K., Vicente J., and Villela D., A Survey of Programmable Networks, Computer Communications Review, April 1999
- W.S. Marcus, I. Hadzic, A.J. McAuley, J.M. Smith, Protocol Boosters: Applying Programmability to Network Infrastructures, IEEE Communications Magazine, vol. 36, no. 10, October, 1998
- David E. Taylor, Jonathan S. Turner, John W. Lockwood, Dynamic Hardware Plugins (DHP): Exploiting Reconfigurable Hardware for High-Performance Programmable Routers, OPENARCH'01, 2001
- Kenneth Mackenzie, Weidong Shi, Austen McDonald and Ivan Ganev, An Intel IXP1200-based Network Interface, SAN-2 2003
- Tilman Wolf and Jonathan S. Turner, Design issues for high-performance active routers, IEEE Journal on Selected Areas of Communication, vol. 19, no. 3, pp. 404-409, Mar. 2001.
- J. Marshall, Cisco Systems - Toaster 2
- H. Kim, V. Pai, S. Rixner, Exploiting Task-Level Concurrency in a Programmable Network Interface, ACM SIGPLAN Symposium on Principles and Practices of Parallel Programming (PPoPP), pp. 61-72, San Diego, CA, (June, 2003)
- H. Kim, V. Pai, S. Rixner, Increasing Web Server Throughput with Network Interface Data Caching, International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), pp. 239-250, San Jose, CA, (October, 2002)
- Jayaram Mudigonda, Harrick M. Vin and Raj Yavatkar, A Case for Data Caching in Network Processors
- Deepak Suryanarayanan, John Marshall, Gregory T. Byrd, A Methodology and Simulator for the Study of Network Processors, NP-1, 2002 (handout)
- John Marshall, Cisco Systems - Toaster2 (handout)
- Peter Glasgowsky, Toaster3 Pops Up at MPF, MPR
- Robin Melnick and Keith Morris, AMCC nPcore NISC Architecture, NP-2, 2003 (handout)
- Ramaswamy Ramaswamy, Ning Weng, and Tilman Wolf, Application Analysis and Resource Mapping for Heterogeneous Network Processor Architectures, NP3, 2004
High-Performance Middleware and Applications
- Thorsten von Eicken, David E. Culler, Seth Copen Goldstein and Klaus Erik Schauser, Active Messages: a Mechanism for Integrated Communication and Computation, in ISCA, 1992
- Message Passing Interface, MPI
- CPlant Portals (Sandia Lab)
- A. Wagner, H. -W. Jin, D. K. Panda and R. Riesen, NIC-Based Offload of Dynamic User-Defined Modules for Myrinet Clusters . IEEE Cluster Computing 2004
- Virtual Machine Interface: Communicationg Efficiently on Cluster Based Grids with MPICH-VMI, Cluster 2004.
- Intel IXA Programming Model
- Harrick M. Vin, Jayaram Mudigonda, Jamie Jason, Erik J. Johnson, Roy Ju, Aaron Kunze, and Ruiqi Lian, Shangri-La: A Programming Environment for Packet-processing Systems: Design Considerations, NP3, 2004
Evalution and Tools
- Greg Eisenhauer, Fabian Bustamante, and Karsten Schwan, Event Services for High Performance Computing, High Performance Distributed Computing (HPDC-9), 2000
- Delta Airlines Case Study: MAFI and Tuxedo
- IBM TPF
- J. Turner, Network Virtualization
- M Kounavis, A. Kumar, H.M. Vin, R. Yavatkar, and A. Campbell, Directions in Packet Classification for Network Processors, In Network Processors Design: Issues and Practices, Volume 2, Edited by P. Crowley, M. Franklin, H. Hadimioglu, and P. Onufryk, Morgan Kaufmann, 2003
- Chris Clark, Wenke Lee, David Schimmel, Didier Contis, Mohamed Kone and Ashley Thomas, A Hardware Platform for Network Intrusion Detection and Prevention, Workshop on Network Processors and Applications, 2004
- A. Raniwala, T-C. Chiueh, Evaluation of a Wireless Enterprise Backbone Network Architecture, Hot Interconnects 2004.
- P. Apparao and M. Bhat, (Intel), A Detailed Look at the Characteristics of XML Parsing, BEACON 2004.
- Francis Chang, Wu-chang Feng, Wu-chi Feng, Kang Li, Efficient Packet Classification with Digest Caches, in Proc. of the Third Workshop on Network Processors & Applications (NP3), Feburary 2004
- P. Gambhire, Implementing QoS Mechanisms on the Motorola C-Port C-5e Network Processor, 2003
- Jaroslaw Sydir et al., Voice over AAL2 on a NP, 2003
- Video over IP on a NP, Path Communications
- Content Delivery and Content Manipulation, L7 Services
- Patrick Crowley, Marc Fiuczynski, Jean-Loup Baer, and Brian Bershad, Workloads for Programmable Network Interfaces, Workload Characterization for Computer System Design, 2000.. IEEE 2nd Annual Workshop on Workload Characterization. Oct, 1999.
- Ramaswamy Ramaswamy and Tilman Wolf, PacketBench: A tool for workload characterization of network processing, in Proc. of IEEE 6th Annual Workshop on Workload Characterization (WWC-6), Austin, TX, Oct. 2003
- Tilman Wolf and Mark A. Franklin. CommBench - a telecommunications benchmark for network processors. In Proc. of IEEE ISPASS, pages 154-162, Austin, TX, April 2000
- Gokhan Memik, William H. Mangione-Smith, and Wendong Hu. NetBench: A benchmarking suite for network processors. In Proc. of International Conference on Computer-Aided Design, San Jose, CA, November 2001
- Byeong Kil Lee, Lizy Kurian John, NpBench: A Benchmark Suite for Control plane and Data plane Applications for Network Processors, Int't Conf. on Computer Design, 2003
- V. Pai, S. Rixner, H. Kim, Isolating the Performance Impacts of Network Interface Cards through Microbenchmarks, ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), New York, NY, (July, 2004)
- Y. Luo, L. Bhuyan, NePSim: A Network Processor Simulator with Power Evaluation Framework, IEEE Micro, Special Issue on Network Processors, September/October 2004
- Mark A. Franklin and Tilman Wolf, Power considerations in network processor design, in Proc. of Second Network Processor Workshop (NP-2) in conjunction with Ninth International Symposium on High Performance Computer Architecture (HPCA-9), Anaheim, CA, Feb. 2003
- Agere Systems Functional Programming Language (Product Brief, White Paper)
- Intel IXP Autopartitioning compiler