publications

2024

  1. Teaching Models to Balance Resisting and Accepting Persuasion
    Elias Stengel-Eskin, Peter Hase, and Mohit Bansal
    arXiv preprint arXiv:2410.14596 2024
  2. NeurIPS
    LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models
    Elias Stengel-Eskin, Peter Hase, and Mohit Bansal
    NeurIPS 2024
  3. TMLR
    Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?
    Peter Hase, Thomas Hofweber, Xiang Zhou, Elias Stengel-Eskin, and Mohit Bansal
    arXiv preprint arXiv:2406.19354 2024
  4. DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
    Zaid Khan, Elias Stengel-Eskin, Jaemin Cho, and Mohit Bansal
    arXiv preprint arXiv:2410.06215 2024
  5. ACM
    MIRACLE: An Online, Explainable Multimodal Interactive Concept Learning System
    Ansel Blume, Khanh Duy Nguyen, Zhenhailong Wang, Yangyi Chen, Michal Shlapentokh-Rothman, Xiaomeng Jin, Jeonghwan Kim, Zhen Zhu, and 3 more authors
    2024
  6. LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits
    Duy Nguyen, Archiki Prasad, Elias Stengel-Eskin, and Mohit Bansal
    arXiv preprint arXiv:2410.01735 2024
  7. MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning
    Justin Chih-Yao Chen, Archiki Prasad, Swarnadeep Saha, Elias Stengel-Eskin, and Mohit Bansal
    arXiv prepring arXiv:2409.12147 2024
  8. AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
    Han Wang, Archiki Prasad, Elias Stengel-Eskin, and Mohit Bansal
    arXiv preprint arXiv:2409.07394 2024
  9. System-1.x: Learning to balance fast and slow planning with language models
    Swarnadeep Saha, Archiki Prasad, Justin Chih-Yao Chen, Peter Hase, Elias Stengel-Eskin, and Mohit Bansal
    arXiv preprint arXiv:2407.14414 2024
  10. Are language models rational? The case of coherence norms and belief revision
    Thomas Hofweber, Peter Hase, Elias Stengel-Eskin, and Mohit Bansal
    arXiv preprint arXiv:2406.03442 2024
  11. See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding
    Amith Ananthram, Elias Stengel-Eskin, Carl Vondrick, Mohit Bansal, and Kathleen McKeown
    arXiv preprint arXiv:2406.11665 2024
  12. VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
    Ziyang Wang*, Shoubin Yu*, Elias Stengel-Eskin*, Jaehong Yoon, Feng Cheng, Gedas Bertasius, and Mohit Bansal
    arXiv preprint arXiv:2405.19209 2024
  13. ECCV
    Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
    David Wan, Jaemin Cho, Elias Stengel-Eskin, and Mohit Bansal
    ECCV 2024
  14. ICML
    Language-guided Skill Learning with Temporal Variational Inference
    Haotian Fu, Pratyusha Sharma, Elias Stengel-Eskin, George Konidaris, Nicolas Le Roux, Marc-Alexandre Côté, and Xingdi Yuan
    ICML 2024
  15. CoLLAs
    Sub-goal Distillation: A Method to Improve Small Language Agents
    Maryam Hashemzadeh, Elias Stengel-Eskin, Sarath Chandar, and Marc-Alexandre Cote
    Third Conference on Lifelong Learning Agents 2024
  16. Soft Self-Consistency Improves Language Model Agents
    Han Wang*, Archiki Prasad*, Elias Stengel-Eskin*, and Mohit Bansal
    ACL 2024
  17. NeurIPS
    GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
    Jinhao Duan, Renming Zhang, James Diffenderfer, Bhavya Kailkhura, Lichao Sun, Elias Stengel-Eskin, Mohit Bansal, Tianlong Chen, and 1 more author
    arXiv 2024
  18. ICML
    ReGAL: Refactoring Programs to Discover Generalizable Abstractions
    Elias Stengel-Eskin*, Archiki Prasad*, and Mohit Bansal
    ICML 2024
  19. ICML
    MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
    Justin Chih-Yao Chen, Swarnadeep Saha, Elias Stengel-Eskin, and Mohit Bansal
    ICML 2024
  20. ICLR
    Zero and Few-shot Semantic Parsing with Ambiguous Inputs
    Elias Stengel-Eskin, Kyle Rawlins, and Benjamin Van Durme
    ICLR 2024
  21. ICLR
    Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
    Archiki Prasad, Elias Stengel-Eskin, and Mohit Bansal
    ICLR 2024

2023

  1. Did You Mean...? Confidence-based Trade-offs in Semantic Parsing
    Elias Stengel-Eskin, and Benjamin Van Durme
    EMNLP 2023
  2. Calibrated Interpretation: Confidence Estimation in Semantic Parsing
    Elias Stengel-Eskin, and Benjamin Van Durme
    TACL 2023
  3. Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous Questions in VQA
    Elias Stengel-Eskin, Jimena Guallar-Blasco, Yi Zhou, and Benjamin Van Durme
    ACL 2023
  4. Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning (CVPR Highlight)
    Zhuowan Li, Xingrui Wang, Elias Stengel-Eskin, Adam Kortylewski, Wufei Ma, Benjamin Van Durme, and Alan Yuille
    CVPR 2023

2022

  1. Automatic Evaluation of Chit-chat via Semantic Parsing
    Shalaka Vaidya, Elias Stengel-Eskin, and João Sedoc
    Mid-Atlantic Student Colloquium on Speech, Language and Learning 2022
  2. The Curious Case of Control
    Elias Stengel-Eskin, and Benjamin Van Durme
    Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing 2022
  3. When More Data Hurts: A Troubling Quirk in Developing Broad-Coverage Natural Language Understanding Systems
    Elias Stengel-Eskin, Emmanouil Antonios Platanios, Adam Pauls, Sam Thomson, Hao Fang, Benjamin Van Durme, Jason Eisner, and Yu Su
    Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing 2022
  4. Visual Commonsense in Pretrained Unimodal and Multimodal Models
    Chenyu Zhang, Benjamin Van Durme, Zhuowan Li, and Elias Stengel-Eskin
    In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) Jul 2022

2021

  1. Guiding Multi-Step Rearrangement Tasks with Natural Language Instructions
    Elias Stengel-Eskin*, Andrew Hundt*, Zhuohong He, Aditya Murali, Nakul Gopalan, Matthew Gombolay, and Gregory D. Hager
    In 5th Annual Conference on Robot Learning Jul 2021
  2. Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
    Zhuowan Li, Elias Stengel-Eskin, Yixiao Zhang, Cihang Xie, Quan Tran, Benjamin Van Durme, and Alan Yuille
    In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Oct 2021
  3. Human-Model Divergence in the Handling of Vagueness
    Elias Stengel-Eskin, Jimena Guallar-Blasco, and Benjamin Van Durme
    In Proceedings of the 1st Workshop on Understanding Implicit and Underspecified Language Aug 2021
  4. Joint Universal Syntactic and Semantic Parsing
    Elias Stengel-Eskin, Kenton Murray, Sheng Zhang, Aaron Steven White, and Benjamin Van Durme
    Transactions of the Association for Computational Linguistics Aug 2021
  5. Exploring Human-Model Divergence Through Vagueness
    Elias Stengel-Eskin, Jimena Guallar-Blasco, and Benjamin Van Durme
    Proceedings of the Society for Computation in Linguistics Feb 2021
  6. Iterative Paraphrastic Augmentation with Discriminative Span Alignment
    Ryan Culkin, J. Edward Hu, Elias Stengel-Eskin, Guanghui Qin, and Benjamin Van Durme
    Transactions of the Association for Computational Linguistics May 2021

2020

  1. Universal Decompositional Semantic Parsing
    Elias Stengel-Eskin, Aaron Steven White, Sheng Zhang, and Benjamin Van Durme
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics May 2020
  2. The Universal Decompositional Semantics Dataset and Decomp Toolkit
    Aaron Steven White, Elias Stengel-Eskin, Siddharth Vashishtha, Venkata Subrahmanyan Govindarajan, Dee Ann Reisinger, Tim Vieira, Keisuke Sakaguchi, Sheng Zhang, and 3 more authors
    In Proceedings of The 12th Language Resources and Evaluation Conference May 2020

2019

  1. A Discriminative Neural Model for Cross-Lingual Word Alignment
    Elias Stengel-Eskin, Tzu-Ray Su, Matt Post, and Benjamin Van Durme
    In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) May 2019

2017

  1. Polyglot and Speech Corpus Tools: A System for Representing, Integrating, and Querying Speech Corpora.
    Michael McAuliffe, Elias Stengel-Eskin, Michaela Socolof, and Morgan Sonderegger
    In INTERSPEECH May 2017