Jacob Eisenstein

I'm an Assistant Professor in the School of Interactive Computing at Georgia Tech, where I lead the Computational Linguistics Laboratory. I work on machine learning approaches to understanding human language. I'm especially interested in non-standard language, discourse, computational social science, and statistical machine learning.

publications | teaching | twitter | biographical sketch | code and data

Some recent publications

(all publications; Google scholar; Semantic scholar)
Morphological Priors for Probabilistic Neural Word Embeddings. Bhatia, Guthrie, and Eisenstein. EMNLP 2016.
Word embeddings are modeled as a latent variable, with morphology as a prior. [code]
Toward Socially-Infused Information Extraction: Embedding Authors, Mentions, and Entities. Yang, Chang, and Eisenstein. EMNLP 2016.
Author social network embeddings help to disambiguate whom they are talking about.
A latent variable recurrent neural network for discourse relation language models. Ji, Haffari, and Eisenstein. NAACL 2016.
A generative neural model of text and shallow discourse relations, yielding state-of-the-art performance on relation prediction in the PDTB and Switchboard, as well as discourse-informed language models.
Audience-modulated variation in online social media. Pavalanathan and Eisenstein. American Speech, May 2015.
Twitter users modulate their use of non-standard lexical items by the intended audience, with non-standard language used most in strong local ties. [preprint]
Systematic Patterning in Phonologically-Motivated Orthographic Variation. Eisenstein. Journal of Sociolinguistics, April 2015.
Patterns of phonological variation find their way into writing at a surprisingly deep level, with written analogues of spoken variables reflecting syntactic and phonological conditioning. [preprint]
One Vector is Not Enough: Entity-Augmented Distributed Semantics for Discourse Relations. Ji and Eisenstein. TACL, volume 3 (2015).
A two-pass recursive neural network for identifying implicit discourse relations in the Penn Discourse Treebank. To be presented at EMNLP 2015.

Recent and upcoming presentations

Professional service

  • Co-chair: 2016 EMNLP Workshop on NLP and Computational Social Science. Submit a paper or doctoral consortium abstract!
  • Co-chair: 2014 ACL Workshop on Language Technologies and Computational Social Science
  • Co-chair: 2013-2015 Atlanta Workshop on Computational Social Science
  • Area chair: NAACL 2016, ACL 2014, EACL 2013
  • Student research workshop, faculty advisor: NAACL 2016
  • Student awards coordinator: ICML 2013
  • Student volunteer chair: NAACL 2013
  • Tutorial co-chair: NAACL 2012
  • Program committees (past 12 months): ACL, CONLL, EMNLP, ICML, NAACL, NIPS. All these venues are open-access (OA).
  • Editorial boards: Linguistic Issues in Language Technology (OA), Language Variation series at Language Science Press (OA)
  • Journal Reviewing: Communications of the ACM, Computational Linguistics (OA), Journal of Machine Learning Research (OA), Journal of Artificial Intelligence Research (OA), Machine Learning Journal, Transactions of the Association of Computational Linguistics (OA), Journal of the American Statistical Association, Language in Society, Proceedings of the National Academy of Science, Digital Scholarship in the Humanities, ...
  • I prefer that my reviewing effort goes towards papers that everyone can read. So I usually will not review for venues that are non-OA, unless I submit papers there myself.


School of Interactive Computing
Georgia Institute of Technology
801 Atlantic Drive NW, Atlanta, GA 30308
Admin: Cynthia Jordan, +1 404 385 2827
Most preferred: jacobe (at) gatech (dot) edu