My research combines machine learning and linguistics to build natural language processing systems that are robust to contextual variation and offer new insights about social phenomena.
A complete list of publications can be found here, on Google scholar, and on Semantic scholar.
Here are some current themes and highlights:
- Making NLP robust to language variation
- Structure and dynamics of language change in online media
- A kernel independence test for geographical language variation. Nguyen and Eisenstein. Accepted to Computational Linguistics (MIT Press), 2017.
- The social dynamics of language change in online networks. Goel, Soni, Goyal, Paparrizos, Wallach, Diaz, and Eisenstein. International Conference on Social Informatics (SocInfo), 2016.
- Audience-modulated variation in online social media. Pavalanathan and Eisenstein. American Speech, May 2015. [preprint]
- Systematic patterning in phonologically-motivated orthographic variation. Eisenstein. Journal of Sociolinguistics, April 2015. [preprint]
- Diffusion of lexical variation in online social media. Eisenstein, O'Connor, Smith, and Xing. PLOS-ONE, November 2014. [pdf]
- Computational models of social interactional meaning
- Discourse context for language understanding
Recent and upcoming presentations
- ACL Workshop on Representation Learning for NLP (invited keynote), Vancouver, July 2017.
- Georgetown University, April 2017.
- Massachusetts Institute of Technology, March 2017.
- University of Twente, Netherlands, March 2017.
- Princeton University, February 2017.
- AAAI, San Francisco, February 2017.
- Carnegie Mellon University, December 2016.
- EMNLP, Austin, November 2016.
- DTRA Program Review, July 2016.
- NAACL, San Diego, June 2016.
- AFOSR Program Review, June 2016.
- Xerox Research Center Europe, May 2016. [slides]
- University of Toulouse, May 2016.
- École Normale Supérieure de Lyon, May 2016.
- Workshop on Data-Driven Approaches to Networks and Language (invited), Lyon, May 2016. [slides]
- AAAI Spring Symposium on Observational Studies through Social Media and Other Human-Generated Content, Stanford, March 2016.
- Text as Data Speaker Series, NYU, February 2016.
- Social Science Data Analytics Series, Michigan State University, January 2016. [slides]
- LSA Workshop on "Preparing your Corpus for Archival Storage", January 2016.
- Co-chair: 2016 EMNLP Workshop on NLP and Computational Social Science.
- Co-chair: 2014 ACL Workshop on Language Technologies and Computational Social Science
- Co-chair: 2013-2015 Atlanta Workshop on Computational Social Science
- Area chair: EMNLP 2017, NAACL 2016, ACL 2014, EACL 2013
- Student research workshop, faculty advisor: NAACL 2016
- Student awards coordinator: ICML 2013
- Student volunteer chair: NAACL 2013
- Tutorial co-chair: NAACL 2012
- Program committees (past 12 months): ACL, CONLL, EMNLP, ICML, NAACL, NIPS. All these venues are open-access (OA).
- Editorial boards: Linguistic Issues in Language Technology (OA), Language Variation series at Language Science Press (OA)
- Journal Reviewing: Communications of the ACM, Computational Linguistics (OA), Journal of Machine Learning Research (OA), Journal of Artificial Intelligence Research (OA), Machine Learning Journal, Transactions of the Association of Computational Linguistics (OA), Journal of the American Statistical Association, Language in Society, Proceedings of the National Academy of Science, Digital Scholarship in the Humanities, ...
- I prefer that my reviewing effort goes towards papers that everyone can read. So I usually will not review for venues that are non-OA, unless I submit papers there myself.
Georgia Tech affiliations
School of Interactive Computing
Georgia Institute of Technology
801 Atlantic Drive NW, Atlanta, GA 30308
Admin: Cynthia Jordan, +1 404 385 2827
Most preferred: jacobe (at) gatech (dot) edu