Publications

[2024][2023][2022][2021][2020][2019][2018][2017][2016][2015][2014][2013][2012][2011][2010][2009][2008][2007][2006][2005][2004][2002]

Latest arXiv Manuscripts

topics

Uriel Singer*, Amit Zohar*, Yuval Kirstain, Shelly Sheynin, Adam Polyak, Devi Parikh, Yaniv Taigman

* equal contribution

Video Editing via Factorized Diffusion Distillation (a.k.a, Emu Video Edit (EVE))

arXiv:2403.09334 2024

[project page]

AI + Creativity

topics

Rohit Girdhar*^, Mannat Singh*^, Andrew Brown*, Quentin Duval*, Samaneh Azadi*, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra

* equal first authors ^ equal technical contributions

Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning

arXiv:2311.10709, 2023

[project page]

AI + Creativity

topics

Xiaoliang Dai*, Ji Hou*, Chih-Yao Ma*, Sam Tsai*, Jialiang Wang*, Rui Wang*, Peizhao Zhang*, Simon Vandenhende, Xiaofang Wang, Abhimanyu Dubey, Matthew Yu, Abhishek Kadian, Filip Radenovic, Dhruv Mahajan, Kunpeng Li, Yue Zhao, Vladan Petrovic, Mitesh Kumar Singh, Simran Motwani, Yi Wen, Yiwen Song, Roshan Sumbaly^, Vignesh Ramanathan^, Zijian He^, Peter Vajda^, Devi Parikh^

* equal contribution, alphabetical order, ^ equal last authors

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

arXiv:2309.15807, 2023

AI + Creativity

2024 [back to top]

topics

Shelly Sheynin*, Adam Polyak*, Uriel Singer*, Yuval Kirstain*, Amit Zohar*, Oron Ashual, Devi Parikh, Yaniv Taigman

* equal contribution

Emu Edit: Precise Image Editing via Recognition and Generation Tasks

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

[project page]

AI + Creativity

2023 [back to top]

topics

Samaneh Azadi, Akbar Shah, Thomas Hayes, Devi Parikh, Sonal Gupta

Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation

International Conference on Computer Vision (ICCV), 2023

[project page]

AI + Creativity

topics

Samaneh Azadi*, Thomas Hayes*, Akbar Shah, Guan Pang, Devi Parikh, Sonal Gupta

* equal contribution

Text-Conditional Contextualized Avatars For Zero-Shot Personalization

arXiv:2304.07410, 2023

AI + Creativity

topics

Uriel Singer*, Shelly Sheynin*, Adam Polyak*, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman

* equal contribution

Text-To-4D Dynamic Scene Generation

International Conference on Machine Learning (ICML), 2023

[project page]

AI + Creativity

topics

Omri Avrahami, Thomas Hayes, Oran Gafni, Sonal Gupta, Yaniv Taigman, Devi Parikh, Dani Lischinski, Ohad Fried, Xi Yin

SpaText: Spatio-Textual Representation for Controllable Image Generation

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023

[project page]

AI + Creativity

topics

Uriel Singer*, Adam Polyak*, Thomas Hayes*, Xi Yin*, Jie An, Songyang Zhang, Qiyuan (Isabelle) Hu, Harry Yang, Oron Ashual, Oran Gafni, Devi Parikh*, Sonal Gupta*, Yaniv Taigman*

* Core contributors

Make-A-Video: Text-to-Video Generation without Text-Video Data

International Conference on Learning Representations (ICLR), 2023

[project page]

AI + Creativity

topics

Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez1, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi

AudioGen: Textually Guided Audio Generation

International Conference on Learning Representations (ICLR), 2023

[project page]

AI + Creativity

2022 [back to top]

topics

Thomas Hayes*, Songyang Zhang*, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh

* equal contribution

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

European Conference on Computer Vision (ECCV), 2022

[project page]

AI + Creativity

topics

Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

European Conference on Computer Vision (ECCV), 2022

[Illustrated story: The Little Red Boat][Illustrated story: New Adventures] [Blog post]

AI + Creativity

topics

Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

European Conference on Computer Vision (ECCV), 2022

[project page]

AI + Creativity

topics

Ramya Srinivasan, Devi Parikh

Building Bridges: Generative Artworks to Explore AI Ethics

Ethical Considerations in Creative applications of Computer Vision (EC3V) Workshop at CVPR, 2022

AI + Creativity

topics

Samyak Datta, Sameer Dharur, Vincent Cartillier, Ruta Desai, Dhruv Batra, Devi Parikh

Episodic Memory Question Answering

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022 (Oral)

topics

Ayush Shrivastava, Karthik Gopalakrishnan, Yang Liu, Robinson Piramuthu, Gokhan Tür, Devi Parikh, Dilek Hakkani-Tür

VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator

Findings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2022

2021 [back to top]

topics

Safinah Ali, Devi Parikh

Telling Creative Stories Using Generative Visual Aids

Machine Learning for Creativity and Design Workshop at Neural Information Processing Systems (NeuRIPS), 2021

AI + Creativity

topics

Gunjan Aggarwal, Devi Parikh

Dance2Music: Automatic Dance-driven Music Generation

Machine Learning for Creativity and Design Workshop at Neural Information Processing Systems (NeuRIPS), 2021

AI + Creativity

topics

Sasha Sheng*, Amanpreet Singh*, Vedanuj Goswami, Jose Alberto Lopez Magana, Wojciech Galuba, Devi Parikh, Douwe Kiela

* equal contribution

Human-Adversarial Visual Question Answering

Neural Information Processing Systems (NeurIPS), 2021

topics

Songwei Ge, Devi Parikh

Visual Conceptual Blending with Large-scale Language and Vision Models

International Conference on Computational Creativity (ICCC), 2021 (Oral)

AI + Creativity

topics

Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal

Contrast and Classify: Alternate Training for Robust VQA

International Conference on Computer Vision (ICCV), 2021

topics

Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, C. Lawrence Zitnick

ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations

ICLR workshop on Deep Learning for Simulation, 2021 (Best Paper Award)

topics

Lowik Chanussot*, Abhishek Das*, Siddharth Goyal*, Thibaut Lavril*, Muhammed Shuaibi*, Morgane Riviére, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Aini Palizhati, Anuroop Sriram, Brandon Wood, Junwoong Yoon, Devi Parikh, C. Lawrence Zitnick, Zachary Ulissi

* equal contribution

The Open Catalyst 2020 (OC20) Dataset and Community Challenges

ACS Catalysis, 2021

[dataset][code][opencatalystproject.org]

topics

C. Lawrence Zitnick, Lowik Chanussot, Abhishek Das, Siddharth Goyal, Javier Heras-Domingo, Caleb Ho, Weihua Hu, Thibaut Lavril, Aini Palizhati, Morgane Riviére, Muhammed Shuaibi, Anuroop Sriram, Kevin Tran, Brandon Wood, Junwoong Yoon, Devi Parikh, Zachary Ulissi

An Introduction to Electrocatalyst Design using Machine Learning for Renewable Energy Storage

arXiv:2010.09435, 2020

[dataset][code][opencatalystproject.org]

topics

Sameer Dharur, Purva Tendulkar, Dhruv Batra, Devi Parikh, Ramprasaath R. Selvaraju

SOrT-ing in VQA : Contrastive Gradient Learning for Improved Consistency

Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2021

topics

Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach

KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

topics

Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani

VX2TEXT: End-to-End Learning of Video-Based Text GenerationFrom Multimodal Inputs

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021

topics

Songwei Ge, Vedanuj Goswami, C. Lawrence Zitnick, Devi Parikh

Creative Sketch Generation

International Conference on Learning Representations (ICLR), 2021

[demo][code and datasets][project page]

AI + Creativity

2020 [back to top]

topics

Samyak Datta, Oleksandr Maksymets, Judy Hoffman, Stefan Lee, Dhruv Batra, Devi Parikh

Integrating Egocentric Localization for More Realistic Point-Goal Navigation Agents

Conference on Robot Learning (CoRL), 2020

topics

Peter Anderson, Ayush Shrivastava, Joanne Truong, Arjun Majumdar, Devi Parikh, Dhruv Batra, Stefan Lee

Sim-to-Real Transfer for Vision-and-Language Navigation

Conference on Robot Learning (CoRL), 2020

topics

Michael Cogswell*, Jiasen Lu*, Rishabh Jain, Stefan Lee, Devi Parikh, Dhruv Batra

* equal contribution

Dialog without Dialog: Learning Image-Discriminative Dialog Policies from Single-Shot Question Answering Data

Neural Information Processing Systems (NeurIPS), 2020

topics

Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James Rehg, Stefan Lee, Peter Anderson

Where Are You? Localization from Embodied Dialog

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020

topics

Purva Tendulkar, Abhishek Das, Ani Kembhavi, Devi Parikh

Feel The Music: Automatically Generating A Dance For An Input Song

International Conference on Computational Creativity (ICCC), 2020 (Oral)

[dances][code][demo][Tech@Facebook article]

AI + Creativity

topics

Devi Parikh, C. Lawrence Zitnick

Exploring Crowd Co-creation Scenarios for Sketches

International Conference on Computational Creativity (ICCC), 2020

[sketching interface]

AI + Creativity

topics

Gunjan Aggarwal, Devi Parikh

Neuro-Symbolic Generative Art: A Preliminary Study

International Conference on Computational Creativity (ICCC), 2020

[examples][demo]

AI + Creativity

topics

X. Alice Li, Devi Parikh

Lemotif: An Affective Visual Journal Using Deep Neural Networks

International Conference on Computational Creativity (ICCC), 2020 (Oral)

[demo][code]

AI + Creativity

topics

Devi Parikh

Predicting A Creator’s Preferences In, and From, Interactive Generative Art

International Conference on Computational Creativity (ICCC), 2020

[art interface]

AI + Creativity

topics

Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh, Dhruv Batra

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web

European Conference on Computer Vision (ECCV), 2020 (Spotlight)

topics

Vishvak Murahari, Dhruv Batra, Devi Parikh, Abhishek Das

Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline

European Conference on Computer Vision (ECCV), 2020

[code]

topics

Yash Kant, Dhruv Batra, Peter Anderson, Alex Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal

Spatially Aware Multimodal Transformers for TextVQA

European Conference on Computer Vision (ECCV), 2020

topics

Medhini Narasimhan, Erik Wijmans, Xinlei Chen, Trevor Darrell, Dhruv Batra, Devi Parikh, Amanpreet Singh

Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation

European Conference on Computer Vision (ECCV), 2020

topics

Nirbhay Modhe, Prithvijit Chattopadhyay, Mohit Sharma, Abhishek Das, Devi Parikh, Dhruv Batra, Ramakrishna Vedantam

IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL

International Joint Conference on Artificial Intelligence (IJCAI), 2020

topics

Devendra Singh Chaplot, Lisa Lee, Ruslan Salakhutdinov, Devi Parikh, Dhruv Batra

Embodied Multimodal Multitask Learning

International Joint Conference on Artificial Intelligence (IJCAI), 2020

[webpage]

topics

Amanpreet Singh*, Vedanuj Goswami*, Devi Parikh

* equal contribution

Are We Pretraining It Right? Digging Deeper Into Visio-linguistic Pretraining

arXiv:2004.08744, 2020

topics

Jiasen Lu*, Vedanuj Goswami*, Marcus Rohrbach, Devi Parikh, Stefan Lee

* equal contribution

12-in-1: Multi-Task Vision and Language Representation Learning

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020

[demo]

topics

Ramprasaath R. Selvaraju, Purva Tendulkar, Devi Parikh, Eric Horvitz, Marco Ribeiro, Besmira Nushi, Ece Kamar

SQuINTing at VQA Models: Interrogating VQA Models with Sub-Questions

IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020 (Oral)

topics

Erik Wijmans, Abhishek Kadian, Ari Morcos, Stefan Lee, Irfan Essa, Devi Parikh, Manolis Savva, Dhruv Batra

Decentralized Distributed PPO: Solving PointGoal Navigation

International Conference on Learning Representations (ICLR), 2020

2019 [back to top]

topics

Jiasen Lu, Dhruv Batra, Devi Parikh, Stefan Lee

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks

Neural Information Processing Systems (NeurIPS), 2019

topics

Remi Cadene, Corentin Dancette, Hedi Ben-younes, Matthieu Cord, Devi Parikh

RUBi: Reducing Unimodal Biases in Visual Question Answering

Neural Information Processing Systems (NeurIPS), 2019

topics

Peter Anderson*, Ayush Shrivastava*, Devi Parikh, Dhruv Batra, Stefan Lee

* equal contribution

Chasing Ghosts: Instruction Following as Bayesian State Tracking

Neural Information Processing Systems (NeurIPS), 2019

topics

Jianwei Yang, Zhile Ren, Hongyuan Zhu, Ji Lin, Chuang Gan, Devi Parikh

Cross-Channel Communication Networks

Neural Information Processing Systems (NeurIPS), 2019

topics

Vishvak Murahari, Prithvijit Chattopadhyay, Dhruv Batra, Devi Parikh, Abhishek Das

Improving Generative Visual Dialog by Answering Diverse Questions

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019

topics

Michael Cogswell, Jiasen Lu, Stefan Lee, Devi Parikh, Dhruv Batra

Emergence of Compositional Language with Deep Generational Transmission

arXiv:1904.09067, 2019

topics

Manolis Savva, Abhishek Kadian, Oleksandr Maksymets, Yili Zhao, Erik Wijmans, Bhavana Jain, Julian Straub, Jia Liu, Vladlen Koltun, Jitendra Malik, Devi Parikh, Dhruv Batra

Habitat: A Platform for Embodied AI Research

International Conference on Computer Vision (ICCV), 2019 (Best Paper Nominee)

[aihabitat.org]

topics