Jacob Eisenstein

I'm an Assistant Professor in the School of Interactive Computing at Georgia Tech, where I lead the Computational Linguistics Laboratory. I work on machine learning approaches to understanding human language. I'm especially interested in non-standard language, discourse, computational social science, and statistical machine learning.

publications | teaching | twitter | biographical sketch | code and data

Some recent publications

Audience-modulated variation in online social media. Pavalanathan and Eisenstein. American Speech, May 2015.
Twitter users modulate their use of non-standard lexical items by the intended audience, with non-standard language used most in strong local ties. [preprint]
Systematic Patterning in Phonologically-Motivated Orthographic Variation. Eisenstein. Journal of Sociolinguistics, April 2015.
Patterns of phonological variation find their way into writing at a surprisingly deep level, with written analogues of spoken variables reflecting syntactic and phonological conditioning. [preprint]
One Vector is Not Enough: Entity-Augmented Distributed Semantics for Discourse Relations. Ji and Eisenstein. TACL, volume 3 (2015).
A two-pass recursive neural network for identifying implicit discourse relations in the Penn Discourse Treebank. To be presented at EMNLP 2015.
“You're Mr. Lebowski, I'm The Dude”: Inducing address term formality in signed social networks. Krishnan and Eisenstein. NAACL 2015 (best student paper!).
Finding address terms in text, and inducing their formality in a joint probabilistic model of content and social network structure.
Unsupervised multi-domain adaptation with feature embeddings. Yang and Eisenstein. NAACL 2015.
A simple, effective approach to unsupervised domain adaptation in NLP applications. Also, the first algorithm for unsupervised domain adaptation across a space of many multi-attribute domains. [code]
Diffusion of lexical variation in online social media. Eisenstein, O'Connor, Smith, and Xing. PLOS-ONE, November 2014.
An analysis of changes in word popularity over time and space, correlated with macro-level demographic features of American metropolitan areas.

Recent and upcoming presentations

  • Workshop on Data-Driven Approaches to Networks and Language (keynote), Lyon, May 2016.
  • Text as Data Speaker Series, NYU, 2016.
  • LSA Workshop on "Preparing your Corpus for Archival Storage", January 2016.
  • Columbia University, IGERT Distinguished Speaker Series, November 2015.
  • Bloomberg, invited speaker, November 2015.
  • Text as Data Conference, NYU, October 2015.
  • TextLink Workshop: Identification and Annotation of Discourse Relations in Spoken Language (Invited Keynote), Saarbrucken, October 2015.
  • University of Copenhagen, September 2015.
  • Empirical Methods in Natural Language Processing, Lisbon, September 2015. Invited talk at the LSDSem workshop. [slides]
  • Jelinek Memorial Summer Workshop on Language Technology, Seattle, July-August 2015. [slides on discourse modeling]
  • NAACL, Denver, June 2015. Invited keynote at the SocialNLP workshop. [slides]
  • International Conference on Learning Representations, San Diego, May 2015
  • AFOSR Program Review, May 2015
  • DIGI@UGA Digital Humanities event, University of Georgia, April 2015 [slides]
  • Southeastern Conference on Linguistics, Raleigh NC, April 2015
  • University of Georgia Digital Humanities Day, April 2015
  • Stanford Linguistics Colloquium, February 2015
  • American Association for the Advancement of Science Annual Meeting, February 2015 in San Jose [slides]
  • NIPS Workshop on Modern Machine Learning and Natural Language Processing, Montreal, December 2014 [slides]
  • Symposium on Computation and Journalism, New York, October 2014
  • Methods in Dialectology (Keynote), Groningen, August 2014
  • Stanford Conference on Computational Social Science, April 2014. [video]

Professional service

  • Co-chair: 2016 EMNLP Workshop on NLP and Computational Social Science
  • Co-chair: 2014 ACL Workshop on Language Technologies and Computational Social Science
  • Co-chair: 2013-2015 Atlanta Workshop on Computational Social Science
  • Area chair: NAACL 2016, ACL 2014, EACL 2013
  • Student research workshop, faculty advisor: NAACL 2016
  • Student awards coordinator: ICML 2013
  • Student volunteer chair: NAACL 2013
  • Tutorial co-chair: NAACL 2012
  • Program committees (past 12 months): ACL, CONLL, EMNLP, ICML, NAACL, NIPS. All these venues are open-access (OA).
  • Editorial boards: Linguistic Issues in Language Technology (OA), Language Variation series at Language Science Press (OA)
  • Journal Reviewing: Communications of the ACM, Computational Linguistics (OA), Journal of Machine Learning Research (OA), Journal of Artificial Intelligence Research (OA), Machine Learning Journal, Transactions of the Association of Computational Linguistics (OA), Journal of the American Statistical Association, Language in Society, Proceedings of the National Academy of Science, Digital Scholarship in the Humanities, ...
All this reviewing takes a lot of time! Please don't be offended if I decline additional requests, particularly from non-OA venues.


School of Interactive Computing
Georgia Institute of Technology
85 Fifth St. NW
Atlanta, GA 30308
Admin: Cynthia Bryant, 404 894 3807
Most preferred: jacobe (at) gatech (dot) edu