I am an Assistant Professor at CSE, College of Computing at Georgia Institute of Technology. I direct the CLAWS Data Science research group. I develop data science, machine learning, and AI solutions for the pressing challenges pertaining to the safety, integrity, and well-being of users, content, platforms, and communities in the cyber space. These have ever-lasting impact on public health, democracy, science, and society. I investigate the social factors and the technological factors that exacerbate the threats that harm others and develop cutting-edge solutions for early identification, prediction, and mitigation. I also work to quantify and improve the robustness of critical cybersafety systems against smart adversaries. I create methods to improve the reliability and trustworthiness of the current technology, and innovate the next-generation of trustworthy adversarially-robust systems that can empower users and improve cybersafety broadly throughout the internet. Towards these ends, I have pioneered the development of platform integrity algorithm that are being used at Flipkart (India's largest e-commerce platform, acquired by Walmart) and has influenced Twitter's Birdwatch platform (community-driven misinformation detection platform).

Prior to Georgia Tech, I was a visiting researcher at Google AI, a postdoctoral researcher at Stanford University, and a PhD student at the University of Maryland. I am honored to be included in Forbes 30 under 30 Science (2022) and to receive the Facebook Faculty Research Award, Adobe Faculty Research Award, Class of 1969 Teaching Fellow, ACM SIGKDD Doctoral Dissertation Award 2018 runner-up, WWW 2017 Best Paper Award runner-up, Larry S. Davis Doctoral Dissertation Award 2017, and Dr. B.C. Roy Gold Medal. I have also been selected as a Computing Innovation Fellows Mentor 2021-2023. My work has been covered in a documentary (Familiar Shapes), in a radio interview (WABE), and by popular press, including Wired, CNN, Wall Street Journal, Tech Crunch, New York Magazine, and more.

My research interests spans the following topics:

(1) Web safety, integrity, and well-being: I develop methods to efficiently characterize the behavior of and detect both malicious actors and harmful content. Accurate characterization and early detection can greatly improve the safety, integrity, and well-being of online users, communities, and platforms. I have worked on the following type of bad behavior:

(2) Trustworthy and Responsible AI: Machine learnind and deep learning models are being used for high-stakes tasks. However, their trustworthiness, reliability, and robustness against manipulation by smart adversaries and to unintentional changes in data is not known. I have explored how adversaries can manipulate recommender systems for their gains. I have conducted the first investigate to quantify the trustworthiness of Facebook's TIES deep learning-based fraud detection models [ACM SIGKDD 2021], graph-based models [ACM CIKM 2021b], and community-driven counter misinformation platform used at Twitter's Birdwatch [ASONAM 2021b].

(3) Recommender systems and Behavior Modeling: Recommender systems power much of the content and products that we see online. I develop user-based and graph-based efficient recommender systems that are accurate, scalable, and trustworthy [ACM CIKM 2021, ACM SIGKDD 2019]. I also investigate how malicious actors can manipulate deep learning-powered recommender systems for their ulterior motives. I create new techniques to quantify this robustness and innovate new adversarially-robust deep recommender system architectures, to usher an era of trustworthy recommendations. Relevant works include:

(4) Graphs and Networks: Modeling and predicting over large-scale graphs and networks is crucial to mine actionable insights from large inter-connect data, including social networks, e-commerce networks, knowledge graphs, spatio-temporal networks, and interaction networks. Relevant works include:

(5) Multimodal data: Online and real-world data has information that spans multiple modalities, including text, multimedia (image, video, GIFs), and networks. I have researched methods to extract complex interaction patterns and knowledge from across modality lines to develop actionable insights in various data settings, such as social media data, video-based meetings/conversations, and debates. Relevant works include:

Postdoc opening. There is an opening for one postdoc in my group, who will get to work on very exciting problems in recommendation systems and/or graphs. There will also be opportunities to work with real industry data. If you have done research in the recommendation systems area, please send me your CV asap at srijan@gatech.edu.

Recent News

Publications

For my complete list of publications, please refer to my Google Scholar profile.

Highlights (selected from the full list below)

  • PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models. [PDF] NEW!
    Bing He, Mustaque Ahamad, Srijan Kumar
    ACM SIGKDD 2021 – 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
    [Project page with data and code] [Link to presentation (pptx)] [Link to presentation (pdf)]
  • Influence-guided Data Augmentation for Neural Tensor Completion. [PDF] NEW!
    Sejoon Oh, Sungchul Kim, Ryan Rossi, Srijan Kumar
    ACM CIKM 2021 – 30th ACM International Conference on Information and Knowledge Management, 2021
    [Project page with data and code]
  • Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis NEW!
    Bing He, Caleb Ziems, Sandeep Soni, Naren Ramakrishnan, Diyi Yang, Srijan Kumar
    IEEE/ACM ASONAM 2021 – The 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
    [Project page with data and code]
  • HawkEye: A Robust Reputation System for Community-based Misinformation Detection. [PDF] NEW!
    Rohit Mujumdar, Srijan Kumar
    IEEE/ACM ASONAM 2021 – The 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
  • The Role of the Crowd in Countering Misinformation: A Case Study of the COVID-19 Infodemic. [PDF]
    Nicholas Micallef*, Bing He*, Srijan Kumar, Mustaque Ahamad, Nasir Memon (* = equal contribution)
    IEEE Big Data 2020 -- Full Paper, research track (top 15%)
    [Project page with data and code]
  • Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks [PDF]
    Srijan Kumar, Xikun Zhang, Jure Leskovec
    ACM SIGKDD, 2019 – 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2019

  • List of all publications

    Pre-prints

    Conference, Journal, and Other Publications

    1. M2P2: Multimodal Persuasion Prediction using Adaptive Fusion [PDF]
      Chongyang Bai, Haipeng Chen, Srijan Kumar, Jure Leskovec, V.S. Subrahmanian
      IEEE TMM 2021 – IEEE Transactions on Multimedia NEW!
      [Project page with data and code]
    2. PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-based Classification Models. [PDF] NEW!
      Bing He, Mustaque Ahamad, Srijan Kumar
      ACM SIGKDD 2021 – 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
      [Project page with data and code] [Link to presentation (pptx)] [Link to presentation (pdf)]
    3. Influence-guided Data Augmentation for Neural Tensor Completion. [PDF] NEW!
      Sejoon Oh, Sungchul Kim, Ryan Rossi, Srijan Kumar
      ACM CIKM 2021 – 30th ACM International Conference on Information and Knowledge Management, 2021
      [Project page with data and code]
    4. Racism is a Virus: Anti-Asian Hate and Counterspeech in Social Media during the COVID-19 Crisis NEW!
      Bing He, Caleb Ziems, Sandeep Soni, Naren Ramakrishnan, Diyi Yang, Srijan Kumar
      IEEE/ACM ASONAM 2021 – The 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
      [Project page with data and code]
    5. HawkEye: A Robust Reputation System for Community-based Misinformation Detection. [PDF] NEW!
      Rohit Mujumdar, Srijan Kumar
      IEEE/ACM ASONAM 2021 – The 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
    6. Evaluating Graph Vulnerability and Robustness using TIGER. NEW!
      Scott Freitas, Diyi Yang, Srijan Kumar, Hanghang Tong, Polo Chau
      ACM CIKM 2021 – 30th ACM International Conference on Information and Knowledge Management, 2021
      [Project page]
    7. Deception Detection in Group Video Conversations using Dynamic Interaction Networks. [PDF]
      Srijan Kumar, Chongyang Bai, VS Subrahmanian, Jure Leskovec
      AAAI ICWSM 2021 – 15th International AAAI Conference on Web and Social Media, 2021
      [Project page with data]
    8. The Role of the Crowd in Countering Misinformation: A Case Study of the COVID-19 Infodemic. [PDF]
      Nicholas Micallef*, Bing He*, Srijan Kumar, Mustaque Ahamad, Nasir Memon (* = equal contribution)
      IEEE Big Data 2020 -- Full paper in research track
      [Project page with data and code]
    9. Higher-Order Label Homogeneity and Spreading in Graphs. [PDF]
      Dhivya Eswaran, Srijan Kumar, Christos Faloutsos
      ACM Web (WWW), 2020 – The ACM Web Conference, 2020
      [Github page with code and data]
    10. User Engagement with Digital Deception
      Maria Glenski, Svitlana Volkova, Srijan Kumar
      Peer-reviewed book chapter in 'Disinformation, Misinformation, and Fake News in Social Media 2020' by Springer.
    11. Predicting Dynamic Embedding Trajectory in Temporal Interaction Networks [PDF]
      Srijan Kumar, Xikun Zhang, Jure Leskovec
      ACM SIGKDD, 2019 – 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2019 [Oral presentation, research track (top 9%)]
      New dataset released: Account blocks on Wikipedia and Reddit. Link below.
      [Github page with code and data] [Slides] [Short explanation video]

      Included in the curriculum at: UCSD, Purdue University, LMU Munchen.

    12. Predicting the Visual Focus of Attention Prediction in Multi-person Discussion Videos [PDF]
      Chongyang Bai, Srijan Kumar, Jure Leskovec, Miriam Metzger, Jay Nunamaker, V.S. Subrahmanian
      IJCAI, 2019 – International Joint Conference on Artificial Intelligence, 2019
      New dataset released: 62 dynamic networks of who-interacts-with-whom. Link below.
      [Dataset] [Project page with demo]
    13. Predicting Dominance in Multi-person Videos [PDF]
      Chongyang Bai, Maksim Bolonkin, Srijan Kumar, Jure Leskovec, Judee Burgoon, Norah Dunbar, V.S. Subrahmanian
      IJCAI, 2019 – International Joint Conference on Artificial Intelligence, 2019
      [Dataset] [Project page with demo]
    14. Community Interaction and Conflict on the Web [PDF]
      Srijan Kumar, William L. Hamilton, Jure Leskovec, Dan Jurafsky
      ACM Web (WWW), 2018 – The ACM Web Conference, 2018
      New dataset released: Reddit community-to-community interlinks and harassment attacks
      [Project page: Data and Code] [Presentation slides (pptx)] [Presentation slides (pdf)]

      Included in the curriculum at: University of Waterloo

      Press: Russian spam accounts are still a big problem for Reddit (Engadget), What Reddit Tells Us About Political Coalitions and Conflicts (The Atlantic), Most Reddit battles are started by 1 percent of communities (Engadget), Tiny percent of Reddit communities spark majority of conflicts (CNET), One Percent of Subreddits Are Responsible for Most of the Raids on Reddit (VICE), and more by Inverse, TheNextWeb, theregister.co.uk

    15. Rev2: Fraudulent User Prediction in Rating Platforms [PDF]
      Srijan Kumar, Bryan Hooi, Disha Makhija, Mohit Kumar, Christos Faloutsos, V.S. Subrahmanian
      ACM WSDM, 2018 – 11th ACM International Web Search and Data Mining Conference, 2018
      New dataset released: Fraudsters on Amazon, Bitcoin networks, and Epinions.
      [Project page: Data and Codes] [Presentation slides (pptx)] [Poster]

      Included in the curriculum at: Stanford University

    16. False Information on Web and Social Media: A Survey [PDF]
      Srijan Kumar, Neil Shah
      Invited book chapter in Social Media Analytics: Advances and Applications, CRC Press, 2018
    17. Breaking Bad: Forecasting Adversarial Android Bad Behavior
      S. Li*, Srijan Kumar*, Tudor Dumitras, and V.S. Subrahmanian. (* indicates equal contribution).
      CyberSecurity, 2018 - From Database to Cybersecurity, 2018.
    18. Measuring the Evolution of a Scientific Field through Citation Frames. [PDF]
      David Jurgens, Srijan Kumar, Raine Hoover, Dan McFarland, Dan Jurafsky
      TACL, 2018 – Transactions of the Association for Computational Linguistics, 2018
      [Project page with data] [Code]
    19. Demand-Driven Single- and Multitarget Mixture Preparation Using Digital Microfluidic Biochips.
      Shalu, Srijan Kumar, A. Singla, Sudip Roy, K. Chakrabarty, P. P. Chakrabarti, and B. B. Bhattacharya.
      TODAES, 2018 - ACM Transactions on Design Automation of Electronic Systems.
    20. An Army of Me: Sockpuppets in Online Discussion Communities. [PDF]
      Srijan Kumar, Justin Cheng, Jure Leskovec, V.S. Subrahmanian.
      ACM Web (WWW), 2017 – 26th International World Wide Web (The ACM Web) Conference, 2017

      Best Paper Award Honorable Mention


      [Presentation Slides]

      Included in the curriculum at: University of Michigan, Virginia Tech University, Stanford University, Penn State University, Saarland University, and University of Freiburg.

      Documentary: Familiar Shapes by Heather D. Freeman

      Press: Sock puppet accounts unmasked by the way they write and post (New Scientist), Tool unmasks online puppeteers (New Scientist, print version), Spotting sockpuppets with science (TechCrunch), Sock Puppet Accounts on the Internet Getting You Down? Here’s How to Spot Them (WOWscience)

    21. Predicting Human Behavior: The Next Frontiers.
      V.S. Subrahmanian, Srijan Kumar.
      Science, 2017 – Science, vol. 355, issue 6324, pp. 489, 2017
    22. Spectral Lens: Explainable Diagnostics, Tools and Discoveries in Directed, Weighted Graphs
      Sebastian Goebl, Srijan Kumar, Christos Faloutsos
      IEEE ICDM, 2017 – IEEE International Conference on Data Mining, 2017
    23. Data-Driven Approaches towards Malicious Behavior Modeling
      Meng Jiang, Srijan Kumar, V.S. Subrahmanian, Christos Faloutsos
      ACM SIGKDD, 2017 (Tutorial) – Tutorial at 23rd ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2017
    24. Antisocial Behavior on the Web: Characterization and Detection
      Srijan Kumar, Justin Cheng, Jure Leskovec
      ACM Web (WWW), 2017 (Tutorial) – 26th International World Wide Web (The ACM Web) Conference, 2017
    25. Edge Weight Prediction in Weighted Signed Networks. [PDF]
      Srijan Kumar, Francesca Spezzano, V.S. Subrahmanian, Christos Faloutsos
      IEEE ICDM, 2016 – IEEE International Conference on Data Mining, 2016

      Top 10 most cited papers of ICDM in the last 5 years. [Link]


      New datasets released: Weighted, signed, temporal networks from Bitcoin OTC and Bitcoin Alpha.
      [Project page: Code] [Bitcoin OTC] [Bitcoin Alpha]
    26. Disinformation on the Web: Impact, Characteristics and Detection of Wikipedia Hoaxes. [PDF]
      Srijan Kumar, Robert West, Jure Leskovec.
      ACM Web (WWW), 2016 – 25th International World Wide Web (The ACM Web) Conference, 2016
      New datasets released: Hoax articles on Wikipedia
      [Project page: Data and Code]

      Included in the curriculum at: UIUC, University of Waterloo, McGill University, Texas A&M University, University of Hawaii, University of Freiburg, Leibniz University, Hannover, University of Waterloo, University of Alberta, University of Wellington, New Zealand, and Bari BigData winter school 2017.

      Press: Don't Ask Wikipedia To Cure the Internet (WIRED), Can Wikipedia Solve YouTube's Conspiracy Theory Problem? (Motherboard)

    27. Structure and Dynamics of Signed Citation Networks. [PDF]
      Srijan Kumar.
      ACM Web (WWW), 2016 companion - 25th International World Wide Web (The ACM Web) Conference companion, 2016.
      [Project page: Data and Code]
    28. Identifying Malicious Actors on Social Media
      Srijan Kumar, Francesca Spezzano, V.S. Subrahmanian
      IEEE/ACM ASONAM, 2016 tutorial – Advances in Social Network Analysis and Mining, 2016
    29. Stubborn Mining: Generalizing Selfish Mining and Combining with an Eclipse Attack. [PDF]
      Kartik Nayak*, Srijan Kumar*, Andrew Miller and Elaine Shi. (* indicates equal contribution).
      Euro S&P, 2016 - IEEE European Symposium on Security and Privacy, 2016.
    30. VEWS: A Wikipedia Vandal Early Warning System. [PDF]
      Srijan Kumar, Francesca Spezzano, V. S. Subrahmanian.
      ACM SIGKDD, 2015 – 21th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2015
      New datasets released: Vandals and vandalism on Wikipedia
      [Project page: Data and Code]
    31. Linguisitic Harbingers of Betrayal: A Case Study on an Online Strategic Game. [PDF]
      Vlad Niculae, Srijan Kumar, Jordan Boyd-Graber, Cristian Danescu-Niculescu-Mizil.
      ACL, 2015 – 51st Conference of the Association for Computational Linguistics, 2015
      New datasets released: Deception in Diplomacy, a conversation-based online game
      [Project page: Data and Code]

      Press: Should you worry about people who are too polite? (CNN), When Diplomacy Leads to Betrayal (The Wall Street Journal), Here’s a Good Reason to Be Wary of Overly Polite People (New York Magazine) and more here.

    32. Layout-Aware Mixture Preparation of Biochemical Fluids on Application-Specific Digital Microfluidic Biochips. [PDF]
      Sudip Roy, Partha P. Chakrabarti, Srijan Kumar, Krishnendu Chakrabarty, Bhargab B. Bhattacharya.
      ACM TODAES, 2015 - ACM Transactions on Design Automation of Electronic Systems, 2015.
    33. Demand-Driven Mixture Preparation and Droplet Streaming using Digital Microfluidic Biochips. [PDF]
      Sudip Roy, Srijan Kumar, P. P. Chakrabarty, B. B. Bhattacharya and K. Chakrabarty.
      ACM/IEEE DAC, 2014 - ACM/IEEE Design Automation Conference, 2014.
    34. Accurately Detecting Trolls in Slashdot Zoo via Decluttering. [PDF]
      Srijan Kumar, Francesca Spezzano, V. S. Subrahmanian.
      IEEE/ACM ASONAM, 2014 – Advances in Social Network Analysis and Mining, 2014
      New datasets released: Trolls in Slashdot, a signed social networking platform
      [Project page: Data and Code]
    35. Automatic Classification and Analysis of Interdisciplinary fields in Computer science. [PDF]
      T. Chakrabarty, Srijan Kumar, D. Reddy, Suhansanu Kumar, Niloy Ganguly and Animesh Mukherjee.
      ASE/IEEE SocialCom, 2013 - ASE/IEEE International Conference on Social Computing.
    36. Routing-Aware Resource Allocation for Mixture Preparation in Digital Microfluidic Biochips. [PDF]
      Sudip Roy, P. P. Chakrabarty, Srijan Kumar, B. B. Bhattacharya and K. Chakrabarty.
      ISVLSI, 2013 - IEEE International Symposium on VLSI, 2013.
    37. Efficient Mixture Preparation of Biochemical Fluids using Digital Microfluidic Biochip. [PDF]
      Srijan Kumar, Sudip Roy, P. P. Chakrabarti, B. B. Bhattacharya and K. Chakrabarty.
      IEEE DDECS, 2013 - Sixteenth IEEE Symposium on Design and Diagnostics of Electronic Circuits and Systems, 2013.

    For all publications, please see my Google Scholar.