Orals, Posters, Demo’s Up to date Schedule

Presentation Format

Main Conference - Long Papers

  • Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences Hongyi Liu, Qingyun Wang, Payam Karisani, Heng Ji

  • Text Diffusion Model with Encoder-Decoder Transformers for Sequence-to-Sequence Generation Hongyi Yuan, Zheng Yuan, Chuanqi Tan, Fei Huang, Songfang Huang

  • An Interactive Framework for Profiling News Media Sources Nikhil Mehta, Dan Goldwasser

  • Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study Yinghao Li, Haorui Wang, Chao Zhang

  • TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation Taeyang Yun, Hyunkuk Lim, Jeonghwan Lee, Min Song

  • Effective and Efficient Conversation Retrieval for Dialogue State Tracking with Implicit Text Summaries Seanie Lee, Jianpeng Cheng, Joris Driesen, Alexandru Coca, Anders Johannsen

  • Promptly Predicting Structures: The Return of Inference Maitrey Mehta, Valentina Pyatkin, Vivek Srikumar

  • On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQL Yutong Shao, Ndapa Nakashole

  • Extractive Summarization with Text Generator Thang Le, Anh Tuan Luu

  • Self-generated Replay Memories for Continual Neural Machine Translation Michele Resta, Davide Bacciu

  • Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran

  • Building Knowledge-Guided Lexica to Model Cultural Variation Shreya Havaldar, Salvatore Giorgi, Sunny Rai, Thomas Talhelm, Sharath Chandra Guntuku, Lyle Ungar

  • Adaptive Rank Selections for Low-Rank Approximation of Language Models Shangqian Gao, Ting Hua, Yen-Chang Hsu, Yilin Shen, Hongxia Jin

  • An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation Pengzhi Gao, Ruiqing Zhang, Zhongjun He, Hua Wu, Haifeng Wang

  • Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration Zhenhailong Wang, Shaoguang Mao, Wenshan Wu, Tao Ge, Furu Wei, Heng Ji

  • FPT: Feature Prompt Tuning for Few-shot Readability Assessment Ziyang Wang, Sanwoo Lee, Hsiu-Yuan Huang, Yunfang Wu

  • Self-Prompting Large Language Models for Zero-Shot Open-Domain QA Junlong Li, Jinyuan Wang, Zhuosheng Zhang, hai zhao

  • Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs? Kai Sun, Yifan Ethan Xu, Hanwen Zha, Yue Liu, Xin Luna Dong

  • $k$NN-ICL: Compositional Task-Oriented Parsing Generalization with Nearest Neighbor In-Context Learning Wenting Zhao, Ye Liu, Yao Wan, Yibo Wang, Qingyang Wu, Zhongfen Deng, Jiangshu Du, Shuaiqi LIU, Yunlong Xu, Philip S. Yu

  • ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems Jon Saad-Falcon, Omar Khattab, Christopher Potts, Matei Zaharia

  • DEMO: A Statistical Perspective for Efficient Image-Text Matching Fan Zhang, Xian-Sheng Hua, Chong Chen, Xiao Luo

  • SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning Bin Wang, Zhengyuan Liu, Xin Huang, Fangkai Jiao, Yang Ding, AiTi Aw, Nancy F. Chen

  • Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision Seongyun Lee, Sue Hyun Park, Yongrae Jo, Minjoon Seo

  • LLMs Are Few-Shot In-Context Low-Resource Language Learners Samuel Cahyawijaya, Holy Lovenia, Pascale Fung

  • Simple and effective data augmentation for compositional generalization Yuekun Yao, Alexander Koller

  • Rethinking Tabular Data Understanding with Large Language Models Tianyang Liu, Fei Wang, Muhao Chen

  • From Shortcuts to Triggers: Backdoor Defense with Denoised PoE Qin Liu, Fei Wang, Chaowei Xiao, Muhao Chen

  • BookSQL: A Large Scale Text-to-SQL Dataset for Accounting Domain Rahul Kumar, Amar Raja Dibbu, Shrutendra Harsola, Vignesh Subrahmaniam, Ashutosh Modi

  • FLAP: Flow-Adhering Planning with Constrained Decoding in LLMs Shamik Roy, Sailik Sengupta, Daniele Bonadiman, Saab Mansour, Arshit Gupta

  • DuRE: Dual Contrastive Self Training for Semi-Supervised Relation Extraction Yuxi Feng, Laks V. S. Lakshmanan

  • Query-Efficient Textual Adversarial Example Generation for Black-Box Attacks Zhen Yu, Zhenhua Chen, Kun He

  • Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles Kung-Hsiang Huang, Philippe Laban, Alexander Fabbri, Prafulla Kumar Choubey, Shafiq Joty, Caiming Xiong, Chien-Sheng Wu

  • AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation Haoyi Qiu, Kung-Hsiang Huang, Jingnong Qu, Nanyun Peng

  • PILOT: Legal Case Outcome Prediction with Case Law Lang Cao, Zifeng Wang, Cao Xiao, Jimeng Sun

  • ALoRA: Allocating Low-Rank Adaptation for Fine-tuning Large Language Models Zequan Liu, Jiawen Lin, Wei Zhu, Xing Tian

  • R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces Heng-Jui Chang, James R. Glass

  • InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning Large Language Models with Instructions Yifan Wang, Yafei Liu, Chufan Shi, Haoling Li, Chen Chen, Haonan Lu, Yujiu Yang

  • Language Agnostic Code Embeddings Saiteja Utpala, Alex Gu, Pin-Yu Chen

  • An Examination of the Compositionality of Large Generative Vision-Language Models Teli Ma, Rong Li, Junwei Liang

  • Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors Victoria Graf, Qin Liu, Muhao Chen

  • VertAttack: Taking Advantage of Text Classifiers’ Horizontal Vision Jonathan Rusert

  • KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning Cong-Duy T Nguyen, Thong Thanh Nguyen, Xiaobao Wu, Anh Tuan Luu

  • The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language Jian Zhu, Changbing Yang, Farhan Samir, Jahurul Islam

  • Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks Yunqi Zhang, Songda Li, Chunyuan Deng, Luyi Wang, Hui Zhao

  • BeLLM: Backward Dependency Enhanced Large Language Model for Sentence Embeddings Xianming LI, Jing Li

  • Assessing Factual Reliability of Large Language Model Knowledge Weixuan Wang, Barry Haddow, Alexandra Birch, Wei Peng

  • Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems Zhenpeng Su, Xing W, Wei Zhou, Guangyuan Ma, Songlin Hu

  • Toolink: Linking Toolkit Creation and Using through Chain-of-Solving on Open-Source Model Cheng Qian, Chenyan Xiong, Zhenghao Liu, Zhiyuan Liu

  • Create! Don’t Repeat: A Paradigm Shift in Multi-Label Augmentation through Label Creative Generation Letian Wang, Xianggen Liu, Jiancheng Lv

  • Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Ali Safaya, Deniz Yuret

  • Unveiling the Generalization Power of Fine-Tuned Large Language Models Haoran Yang, Yumeng Zhang, Jiaqi Xu, Hongyuan Lu, Pheng-Ann Heng, Wai Lam

  • A Closer Look at the Self-Verification Abilities of Large Language Models in Logical Reasoning Ruixin Hong, Hongming Zhang, Xinyu Pang, Dong Yu, Changshui Zhang

  • Exploring Self-supervised Logic-enhanced Training for Large Language Models Fangkai Jiao, Zhiyang Teng, Bosheng Ding, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty

  • MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning DEBRUP DAS, Debopriyo Banerjee, Somak Aditya, Ashish Kulkarni

  • CoUDA: Coherence Evaluation via Unified Data Augmentation Dawei Zhu, Wenhao Wu, Yifan Song, Fangwei Zhu, Ziqiang Cao, Sujian Li

  • mEdIT: Multilingual Text Editing via Instruction Tuning Vipul Raheja, Dimitris Alikaniotis, Vivek Kulkarni, Bashar Alhafni, Dhruv Kumar

  • Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning Yunchao Zhang, Zonglin Di, Kaiwen Zhou, Cihang Xie, Xin Eric Wang

  • In-context Learning and Gradient Descent Revisited Gilad Deutch, Nadav Magar, Tomer Bar Natan, Guy Dar

  • Corpus Considerations for Annotator Modeling and Scaling Sarumi Olufunke Oluyemi, Béla Neuendorf, Joan Plepi, Lucie Flek, Jörg Schlötterer, Charles Welch

  • On Large Language Models’ Hallucination with Regard to Known Facts Che Jiang, Biqing Qi, Xiangyu Hong, Dayuan Fu, Yang Cheng, Fandong Meng, Mo Yu, Bowen Zhou, Jie Zhou

  • One-Size-Fits-All? Examining Expectations around What Constitute “Fair” or “Good” NLG System Behaviors Li Lucy, Su Lin Blodgett, Milad Shokouhi, Hanna Wallach, Alexandra Olteanu

  • Language Models Hallucinate, but May Excel at Fact Verification Jian Guan, Jesse Dodge, David Wadden, Minlie Huang, Hao Peng

  • A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference Resolution Bowen Ding, Qingkai Min, Shengkun Ma, Yingjie Li, Linyi Yang, Yue Zhang

  • TrojFSP: Trojan Insertion in Few-shot Prompt Tuning Mengxin Zheng, Jiaqi Xue, Xun Chen, Yanshan Wang, Qian Lou, Lei Jiang

  • Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models Yi Luo, Zhenghao Lin, YuHao Zhang, Jiashuo Sun, Chen Lin, Chengjin Xu, Xiangdong Su, yelong shen, Jian Guo, Yeyun Gong

  • X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across Paragraphs Juan Diego Rodriguez, Katrin Erk, Greg Durrett

  • Topics, Authors, and Institutions in Large Language Model Research: Trends from 17K arXiv Papers Rajiv Movva, Sidhika Balachandar, Kenneth L Peng, Gabriel Agostini, Nikhil Garg, Emma Pierson

  • $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, Exhibit and Extrapolate Zhehao Zhang, Yan Gao, Jian-Guang Lou

  • S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Model Fangyu Lei, Qian Liu, Yiming Huang, Shizhu He, Jun Zhao, Kang Liu

  • MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning Fuxiao Liu, Xiaoyang Wang, Wenlin Yao, Jianshu Chen, Kaiqiang Song, Sangwoo Cho, Yaser Yacoob, Dong Yu

  • Visual Grounding Helps Learn Word Meanings in Low-Data Regimes Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas

  • Accurate Knowledge Distillation via n-best Reranking Hendra Setiawan

  • AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao

  • SEMQA: Semi-Extractive Multi-Source Question Answering Tal Schuster, Adam D Lelkes, Haitian Sun, Jai Gupta, Jonathan Berant, William W. Cohen, Donald Metzler

  • Fine-Tuning Language Models with Reward Learning on Policy Hao Lang, Fei Huang, Yongbin Li

  • A Universal Dependencies Treebank for Highland Puebla Nahuatl Robert Pugh, Francis M. Tyers

  • COPAL-ID: Indonesian Language Reasoning with Local Culture and Nuances Haryo Akbarianto Wibowo, Erland Hilman Fuadi, Made Nindyatama Nityasya, Radityo Eko Prasojo, Alham Fikri Aji

  • IterAlign: Iterative Constitutional Alignment of Large Language Models Xiusi Chen, Hongzhi Wen, Sreyashi Nag, Chen Luo, Qingyu Yin, Ruirui Li, Zheng Li, Wei Wang

  • OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking Chia-Hsuan Lee, Hao Cheng, Mari Ostendorf

  • Multi-Operational Mathematical Derivations in Latent Space Marco Valentino, Jordan Meadows, Lan Zhang, Andre Freitas

  • Large Language Models Help Humans Verify Truthfulness – Except When They Are Convincingly Wrong Chenglei Si, Navita Goyal, Tongshuang Wu, Chen Zhao, Shi Feng, Hal Daumé III, Jordan Lee Boyd-Graber

  • XferBench: a Data-Driven Benchmark for Emergent Language Brendon Boldt, David R Mortensen

  • Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation Se-eun Yoon, Zhankui He, Jessica Maria Echterhoff, Julian McAuley

  • A Symbolic Framework for Evaluating Mathematical Reasoning and Generalisation with Transformers Jordan Meadows, Marco Valentino, Damien Teney, Andre Freitas

  • Identifying Linear Relational Concepts in Large Language Models David Chanin, Anthony Hunter, Oana-Maria Camburu

  • Benchmark Transparency: Measuring the Impact of Data on Evaluation Venelin Kovatchev, Matthew Lease

  • JAMDEC: Unsupervised Authorship Obfuscation using Constrained Decoding over Small Language Models Jillian Fisher, Ximing Lu, Jaehun Jung, Liwei Jiang, Zaid Harchaoui, Yejin Choi

  • REST: Retrieval-Based Speculative Decoding Zhenyu He, Zexuan Zhong, Tianle Cai, Jason D. Lee, Di He

  • Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations Sihao Chen, Hongming Zhang, Tong Chen, Ben Zhou, Wenhao Yu, Dian Yu, Baolin Peng, Hongwei Wang, Dan Roth, Dong Yu

  • MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference Mobashir Sadat, Cornelia Caragea

  • Causal Inference for Human-Language Model Collaboration Bohan Zhang, Yixin Wang, Paramveer Dhillon

  • SELF-GUARD: Empower the LLM to Safeguard Itself Zezhong WANG, Fangkai Yang, Lu Wang, Pu Zhao, Hongru WANG, Liang CHEN, Qingwei Lin, Kam-Fai Wong

  • COSIGN: Contextual Facts Guided Generation for Knowledge Graph Completion Jinpeng Li, Hang Yu, Xiangfeng Luo, Qian Liu

  • Toward Informal Language Processing: Knowledge of Slang in Large Language Models Zhewei Sun, Qian Hu, Rahul Gupta, Richard Zemel, Yang Xu

  • Ghostbuster: Detecting Text Ghostwritten by Large Language Models Vivek Verma, Eve Fleisig, Nicholas Tomlin, Dan Klein

  • End-to-End Beam Retrieval for Multi-Hop Question Answering Jiahao Zhang, Haiyang Zhang, Dongmei Zhang, Liu Yong, Shen Huang

  • Leveraging Generative Large Language Models with Visual Instruction and Demonstration Retrieval for Multimodal Sarcasm Detection Binghao Tang, Boda Lin, Haolong Yan, Si Li

  • Multi-Scale Prompt Memory-Augmented Model for Black-Box Scenarios Xiaojun Kuang, C. L. Philip Chen, Shuzhen Li, Tong Zhang

  • Ungrammatical-syntax-based In-context Example Selection for Grammatical Error Correction Chenming Tang, Fanyi Qu, Yunfang Wu

  • BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer Akari Asai, Sneha Kudugunta, Xinyan Velocity Yu, Terra Blevins, Hila Gonen, Machel Reid, Yulia Tsvetkov, Sebastian Ruder, Hannaneh Hajishirzi

  • TISE: A Tripartite In-context Selection Method for Event Argument Extraction Yanhe Fu, Yanan Cao, Qingyue Wang, Yi Liu

  • Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks Zhaofeng Wu, Linlu Qiu, Alexis Ross, Ekin Akyürek, Boyuan Chen, Bailin Wang, Najoung Kim, Jacob Andreas, Yoon Kim

  • TRUE-UIE: Two Universal Relations Unify Information Extraction Tasks Yucheng Wang, Bowen Yu, Yilin Liu, Shudong Lu

  • zrLLM: Zero-Shot Relational Learning on Temporal Knowledge Graphs with Large Language Models Zifeng Ding, Heling Cai, Jingpei Wu, Yunpu Ma, Ruotong Liao, Bo Xiong, Volker Tresp

  • Embodied Executable Policy Learning with Language-based Scene Summarization Jielin Qiu, Mengdi Xu, William Han, Seungwhan Moon, Ding Zhao

  • Metacognitive Prompting Improves Understanding in Large Language Models Yuqing Wang, Yun Zhao

  • MART: Improving LLM Safety with Multi-round Automatic Red-Teaming Suyu Ge, Chunting Zhou, Rui Hou, Madian Khabsa, Yi-Chia Wang, Qifan Wang, Jiawei Han, Yuning Mao

  • DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset Young-Jun Lee, Byungsoo Ko, Han-Gyu Kim, Jonghwan Hyeon, Ho-Jin Choi

  • Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models Keming Lu, Hongyi Yuan, Runji Lin, Junyang Lin, Zheng Yuan, Chang Zhou, Jingren Zhou

  • Automatic Generation of Model and Data Cards: A Step Towards Responsible AI Jiarui Liu, Wenkai Li, Zhijing Jin, Mona T. Diab

  • FUN with Fisher: Improving Generalization of Adapter-Based Cross-lingual Transfer with Scheduled Unfreezing Chen Cecilia Liu, Jonas Pfeiffer, Ivan Vulić, Iryna Gurevych

  • Are Multilingual LLMs Culturally-Diverse Reasoners? An Investigation into Multicultural Proverbs and Sayings Chen Cecilia Liu, Fajri Koto, Timothy Baldwin, Iryna Gurevych

  • The Colorful Future of LLMs: Evaluating and Improving LLMs as Emotional Supporters for Queer Youth Shir Lissak, Nitay Calderon, Geva Shenkman, Yaakov Ophir, Eyal Fruchter, Anat Brunstein Klomek, Roi Reichart

  • IPED: An Implicit Perspective for Relational Triple Extraction based on Diffusion Model Jianli Zhao, Changhao Xu, Bin. Jiang

  • QualEval: Qualitative Evaluation for Model Improvement Vishvak Murahari, Ameet Deshpande, Peter Clark, Tanmay Rajpurohit, Ashish Sabharwal, Karthik R Narasimhan, Ashwin Kalyan

  • Quantum-inspired Language Model with Lindblad Master Equation and Interference Measurement for Sentiment Analysis Kehuan Yan, Peichao Lai, yilei wang

  • VisLingInstruct: Elevating Zero-Shot Learning in Multi-Modal Language Models with Autonomous Instruction Optimization Dongsheng Zhu, Daniel Tang, Weidong Han, Jinghui Lu, Yukun Zhao, Guoliang Xing, Junfeng Wang, Dawei Yin

  • A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily Peng Ding, Jun Kuang, Dan Ma, Xuezhi Cao, Yunsen Xian, Jiajun Chen, Shujian Huang

  • P$^3$Sum: Preserving Author’s Perspective in News Summarization with Diffusion Language Models Yuhan Liu, Shangbin Feng, Xiaochuang Han, Vidhisha Balachandran, Chan Young Park, Sachin Kumar, Yulia Tsvetkov

  • Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes Rose E Wang, Qingyang Zhang, Carly D Robinson, Susanna Loeb, Dorottya Demszky

  • RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization Dongqi Pu, Vera Demberg

  • Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation Yao Lu, Jiayi Wang, Raphael Tang, Sebastian Riedel, Pontus Stenetorp

  • ReTA: Recursively Thinking Ahead to Improve the Strategic Reasoning of Large Language Models Jinhao Duan, Shiqi Wang, James Diffenderfer, Lichao Sun, Tianlong Chen, Bhavya Kailkhura, Kaidi Xu

  • Fact Checking Beyond Training Set Payam Karisani, Heng Ji

  • Program-Aided Reasoners (Better) Know What They Know Anubha Kabra, Sanketh Rangreji, Yash Mathur, Aman Madaan, Emmy Liu, Graham Neubig

  • The Perspectivist Paradigm Shift: Assumptions and Challenges of Capturing Human Labels Eve Fleisig, Su Lin Blodgett, Dan Klein, Zeerak Talat

  • Principles from Clinical Research for NLP Model Generalization Aparna Elangovan, Jiayuan He, Yuan Li, Karin Verspoor

  • First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models Naomi Saphra, Eve Fleisig, Kyunghyun Cho, Adam Lopez

  • Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture

  • From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning Xuansheng Wu, Wenlin Yao, Jianshu Chen, Xiaoman Pan, Xiaoyang Wang, Ninghao Liu, Dong Yu

  • POLYIE: A Dataset of Information Extraction from Polymer Material Scientific Literature Jerry Junyang Cheung, Yuchen Zhuang, Yinghao Li, Pranav Shetty, Wantian Zhao, Sanjeev Grampurohit, Rampi Ramprasad, Chao Zhang

  • LLM-based Medical Assistant Personalization with Short- and Long-Term Memory Coordination Kai Zhang, Yangyang Kang, Fubang Zhao, Xiaozhong Liu

  • SumTra: A Differentiable Pipeline for Few-Shot Cross-Lingual Summarization Jacob Parnell, Inigo Jauregi Unanue, Massimo Piccardi

  • KTRL+F: Knowledge-Augmented In-Document Search Hanseok Oh, Haebin Shin, Miyoung Ko, Hyunji Lee, Minjoon Seo

  • How Well Do Large Language Models Truly Ground? Hyunji Lee, Se June Joo, Chaeeun Kim, Joel Jang, Doyoung Kim, Kyoung-Woon On, Minjoon Seo

  • ALBA: Adaptive Language-Based Assessments for Mental Health Vasudha Varadarajan, Sverker Sikström, Oscar Kjell, H. Schwartz

  • FREB-TQA: A Fine-Grained Robustness Evaluation Benchmark for Table Question Answering Wei Zhou, Mohsen Mesgar, Heike Adel, Annemarie Friedrich

  • MILL: Mutual Verification with Large Language Models for Zero-Shot Query Expansion Pengyue Jia, Yiding Liu, Xiangyu Zhao, Xiaopeng LI, Changying Hao, Shuaiqiang Wang, Dawei Yin

  • Efficient Benchmarking (of Language Models) Yotam Perlitz, Elron Bandel, Ariel Gera, Ofir Arviv, Liat Ein-Dor, Eyal Shnarch, Noam Slonim, Michal Shmueli-Scheuer, Leshem Choshen

  • ReFACT: Updating Text-to-Image Models by Editing the Text Encoder Dana Arad, Hadas Orgad, Yonatan Belinkov

  • A Likelihood Ratio Test of Genetic Relationship among Languages V.S.D.S.Mahesh Akavarapu, Arnab Bhattacharya

  • PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning Xuekai Zhu, Biqing Qi, Kaiyan Zhang, Xinwei Long, Zhouhan Lin, Bowen Zhou

  • MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks Sanchit Ahuja, Divyanshu Aggarwal, Varun Gumma, Ishaan Watts, Ashutosh Sathe, Millicent Ochieng, Rishav Hada, Prachi Jain, Mohamed Ahmed, Kalika Bali, Sunayana Sitaram

  • Unlocking Emergent Modularity in Large Language Models Zihan Qiu, Zeyu Huang, Jie Fu

  • A School Student Essay Corpus for Analyzing Interactions of Argumentative Structure and Quality Maja Stahl, Nadine Michel, Sebastian Kilsbach, Julian Schmidtke, Sara Rezat, Henning Wachsmuth

  • Adjusting Interpretable Dimensions in Embedding Space with Human Judgments Katrin Erk, Marianna Apidianaki

  • PatentEval: Understanding Errors in Patent Generation You Zuo, Kim Gerdes, Éric Villemonte de la Clergerie, Benoît Sagot

  • Contextual Refinement of Translations: Large Language Models for Sentence and Document-Level Post-Editing Sai Koneru, Miriam Exel, Matthias Huck, Jan Niehues

  • Metaphor Detection with Context Enhancement and Curriculum Learning Kaidi Jia, Rongsheng Li

  • What Causes the Failure of Explicit to Implicit Discourse Relation Recognition? Wei Liu, Stephen Wan, Michael Strube

  • UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions Siddhant Arora, Hayato Futami, Jee-weon Jung, Yifan Peng, Roshan Sharma, Yosuke Kashiwagi, Emiru Tsunoo, Karen Livescu, Shinji Watanabe

  • How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities Lingbo Mo, Boshi Wang, Muhao Chen, Huan Sun

  • Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models Yue Zhou, Yada Zhu, Diego Antognini, Yoon Kim, Yang Zhang

  • TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale Pengcheng Jiang, Cao Xiao, Zifeng Wang, Parminder Bhatia, Jimeng Sun, Jiawei Han

  • GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models Pengcheng Jiang, Jiacheng Lin, Zifeng Wang, Jimeng Sun, Jiawei Han

  • Curated Datasets and Neural Models for Machine Translation of Informal Registers between Mayan and Spanish Vernaculars Andrés Lou, Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Víctor M. Sánchez-Cartagena

  • The Effect of Data Partitioning Strategy on Model Generalizability: A Case Study of Morphological Segmentation Zoey Liu, Bonnie J Dorr

  • Measuring Entrainment in Spontaneous Code-switched Speech Debasmita Bhattacharya, Siying Ding, Alayna Nguyen, Julia Hirschberg

  • A Survey of Meaning Representations – From Theory to Practical Utility Zacchary Sadeddine, Juri Opitz, Fabian M. Suchanek

  • Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation Haozhe Zhao, Zefan Cai, Shuzheng Si, Liang Chen, Yufeng He, Kaikai An, Baobao Chang

  • Evaluating In-Context Learning of Libraries for Code Generation Arkil Patel, Siva Reddy, Dzmitry Bahdanau, Pradeep Dasigi

  • Visually-Aware Context Modeling for News Image Captioning Tingyu Qu, Tinne Tuytelaars, Marie-Francine Moens

  • Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning Athul Paul Jacob, Gabriele Farina, Jacob Andreas

  • TopicGPT: A Prompt-based Topic Modeling Framework Chau Minh Pham, Alexander Hoyle, Simeng Sun, Philip Resnik, Mohit Iyyer

  • ChatGPT as an Attack Tool: Stealthy Textual Backdoor Attack via Blackbox Generative Model Trigger Jiazhao Li, Yijin Yang, Zhuofeng Wu, V.G.Vinod Vydiswaran, Chaowei Xiao

  • Social Meme-ing: Measuring Linguistic Variation in Memes Naitian Zhou, David Jurgens, David Bamman

  • ExpertQA: Expert-Curated Questions and Attributed Answers Chaitanya Malaviya, Subin Lee, Sihao Chen, Elizabeth Sieber, Mark Yatskar, Dan Roth

  • What if you said that differently?: How Explanation Formats Affect Human Feedback Efficacy and User Perception Chaitanya Malaviya, Subin Lee, Dan Roth, Mark Yatskar

  • When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels Weiyan Shi, Emily Dinan, Kurt Shuster, Jason E Weston, Jing Xu

  • Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages Nathaniel Romney Robinson, Raj Dabre, Ammon C Shurtz, Rasul Dent, Onenamiyi Onesi, Claire Bizon Monroc, Loïc Grobol, Hasan Muhammad, Ashi Garg, Naome A Etori, Vijay Murari Tiyyala, Olanrewaju Samuel, Matthew Dean Stutzman, Bismarck Bamfo Odoom, Sanjeev Khudanpur, Stephen D. Richardson, Kenton Murray

  • Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models Jiashu Xu, Mingyu Derek Ma, Fei Wang, Chaowei Xiao, Muhao Chen

  • Modeling Empathetic Alignment in Conversation Jiamin Yang, David Jurgens

  • Native Language Identification in Texts: A Survey Dhiman Goswami, Sharanya Thilagan, Kai North, Shervin Malmasi, Marcos Zampieri

  • LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models Yifan Yang, Jiajun Zhou, Ngai Wong, Zheng Zhang

  • Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding Chancharik Mitra, Abrar Anwar, Rodolfo Corona, Dan Klein, Trevor Darrell, Jesse Thomason

  • Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two Benchmarks Ting-Yun Chang, Jesse Thomason, Robin Jia

  • PromptFix: Few-shot Backdoor Removal via Adversarial Prompt Tuning Tianrong Zhang, Zhaohan Xi, Ting Wang, Prasenjit Mitra, Jinghui Chen

  • Comparing Explanation Faithfulness between Multilingual and Monolingual Fine-tuned Language Models ZHIXUE ZHAO, Nikolaos Aletras

  • A Pretrainer’s Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity Shayne Longpre, Gregory Yauney, Emily Reif, Katherine Lee, Adam Roberts, Barret Zoph, Denny Zhou, Jason Wei, Kevin Robinson, David Mimno, Daphne Ippolito

  • Instructional Fingerprinting of Large Language Models Jiashu Xu, Fei Wang, Mingyu Derek Ma, Pang Wei Koh, Chaowei Xiao, Muhao Chen

  • Reinforced Multiple Instance Selection for Speaker Attribute Prediction Alireza Salkhordeh Ziabari, Ali Omrani, Parsa Hejabi, Preni Golazizian, Brendan Kennedy, Payam Piray, Morteza Dehghani

  • DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling Shikhar Tuli, Chi-Heng Lin, Yen-Chang Hsu, Niraj Jha, Yilin Shen, Hongxia Jin

  • Few-shot Knowledge Graph Relational Reasoning via Subgraph Adaptation Haochen Liu, Song Wang, Chen Chen, Jundong Li

  • Uncertainty Quantification for In-Context Learning of Large Language Models Chen Ling, Xujiang Zhao, Xuchao Zhang, Wei Cheng, Yanchi Liu, Yiyou Sun, Mika Oishi, Takao Osaki, Katsushi Matsuda, Jie Ji, Guangji Bai, Liang Zhao, Haifeng Chen

  • HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM Zhilin Wang, Yi Dong, Jiaqi Zeng, Virginia Adams, Makesh Narsimhan Sreedhar, Daniel Egert, Olivier Delalleau, Jane Polak Scowcroft, Neel Kant, Aidan M Swope, Oleksii Kuchaiev

  • A Preference-driven Paradigm for Enhanced Translation with Large Language Models Dawei Zhu, Sony Trenous, Xiaoyu Shen, Dietrich Klakow, Bill Byrne, Eva Hasler

  • Fair Abstractive Summarization of Diverse Perspectives Yusen Zhang, Nan Zhang, Yixin Liu, Alexander Fabbri, Junru Liu, Ryo Kamoi, Xiaoxin Lu, Caiming Xiong, Jieyu Zhao, Dragomir Radev, Kathleen McKeown, Rui Zhang

  • What Are We Measuring When We Evaluate Large Vision-Language Models? An Analysis of Latent Factors and Biases Anthony Tiong, Junqi Zhao, Boyang Li, Junnan Li, Steven Hoi, Caiming Xiong

  • Show Your Work with Confidence: Confidence Bands for Tuning Curves Nicholas Lourie, Kyunghyun Cho, He He

  • GRASP: A Disagreement Analysis Framework to Assess Group Associations in Perspectives Vinodkumar Prabhakaran, Christopher M Homan, Lora Aroyo, Aida Mostafazadeh Davani, Alicia Parrish, Alex Taylor, Mark Diaz, Ding Wang, Gregory Serapio-García

  • Event Causality Is Key to Computational Story Understanding Yidan Sun, Qin Chao, Boyang Li

  • Subspace Representations for Soft Set Operations and Sentence Similarities Yoichi Ishibashi, Sho Yokoi, Katsuhito Sudoh, Satoshi Nakamura

  • My Heart Skipped a Beat! Recognizing Expressions of Embodied Emotion in Natural Language Yuan Zhuang, Tianyu Jiang, Ellen Riloff

  • Low-Cost Generation and Evaluation of Dictionary Example Sentences Bill Cai, Ng Boon Liang Clarence, Daniel Tan Wee Liang, Shelvia Hotama

  • Making Language Models Better Tool Learners with Execution Feedback Shuofei Qiao, Honghao Gui, chengfei lv, Qianghuai Jia, Huajun Chen, Ningyu Zhang

  • Complex Claim Verification with Evidence Retrieved in the Wild Jifan Chen, Grace Kim, Aniruddh Sriram, Greg Durrett, Eunsol Choi

  • Multimodal Multi-loss Fusion Network for Sentiment Analysis zehui wu, Ziwei Gong, Jaywon Koo, Julia Hirschberg

  • Confronting LLMs with Traditional ML: Rethinking the Fairness of Large Language Models in Tabular Classifications Yanchen Liu, Srishti Gautam, Jiaqi Ma, Himabindu Lakkaraju

  • Analyzing the Use of Metaphors in News Editorials for Political Framing Meghdut Sengupta, Roxanne El Baff, Milad Alshomary, Henning Wachsmuth

  • SharpSeq: Empowering Continual Event Detection through Sharpness-Aware Sequential-task Learning Thanh-Thien Le, Viet Dao, Linh Van Nguyen, Thi-Nhung Nguyen, Linh Van Ngo, Thien Huu Nguyen

  • Dissecting Paraphrases: The Impact of Prompt Syntax and supplementary Information on Knowledge Retrieval from Pretrained Language Models Stephan Linzbach, Dimitar Dimitrov, Laura Kallmeyer, Kilian Evang, Hajira Jabeen, Stefan Dietze

  • Know When To Stop: A Study of Semantic Drift in Text Generation Ava Spataru

  • Curriculum Masking in Vision-Language Pretraining to Maximize Cross Modal Interaction Kraig Yuheng Tou, Zijun Sun

  • Elote, Choclo and Mazorca: on the Varieties of Spanish Cristina España-Bonet, Alberto Barrón-Cedeño

  • Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks Chonghua Wang, Haodong Duan, Songyang Zhang, Dahua Lin, Kai Chen

  • A Zero-Shot Monolingual Dual Stage Information Retrieval System for Spanish Biomedical Systematic Literature Reviews Regina Ofori-Boateng, Magaly Aceves-Martins, Nirmalie Wiratunga, Carlos Francisco Moreno-Garcia

  • LayoutPointer: A Spatial-Context Adaptive Pointer Network for Visual Information Extraction Huang Siyuan, yongping xiong, Wu Guibin

  • Long-form evaluation of model editing Domenic Rosati, Robie Gonzales, Jinkun Chen, Xuemin Yu, Yahya Kayani, Frank Rudzicz, Hassan Sajjad

  • Analyzing the Role of Semantic Representations in the Era of Large Language Models Zhijing Jin, Yuen Chen, Fernando Gonzalez Adauto, Jiarui Liu, Jiayi Zhang, Julian Michael, Bernhard Schölkopf, Mona T. Diab

  • TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal Prediction Shuo Li, Sangdon Park, Insup Lee, Osbert Bastani

  • MapGuide: A Simple yet Effective Method to Reconstruct Continuous Language from Brain Activities Xinpei Zhao, Jingyuan Sun, Shaonan Wang, Jing Ye, Xhz, Chengqing Zong

  • On-the-fly Definition Augmentation of LLMs for Biomedical NER Monica Munnangi, Sergey Feldman, Byron C Wallace, Silvio Amir, Tom Hope, Aakanksha Naik

  • This Land is {Your, My} Land: Evaluating Geopolitical Bias in Language Models through Territorial Disputes Bryan Li, Samar Haider, Chris Callison-Burch

  • Set-Aligning Framework for Auto-Regressive Event Temporal Graph Generation Xingwei Tan, Yuxiang Zhou, Gabriele Pergola, Yulan He

  • LanguageFlow: Advancing Diffusion Language Generation with Probabilistic Flows Shujian Zhang, Lemeng Wu, Chengyue Gong, Xingchao Liu

  • Towards Improved Multi-Source Attribution for Long-Form Answer Generation Nilay Patel, Shivashankar Subramanian, Siddhant Garg, Pratyay Banerjee, Amita Misra

  • Synthetic Query Generation for Privacy-Preserving Deep Retrieval Systems using Differentially Private Language Models Aldo Gael Carranza, Rezsa Farahani, Natalia Ponomareva, Alexey Kurakin, Matthew Jagielski, Milad Nasr

  • Okay, Let’s Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation Abhijnan Nath, Shadi Manafi Avari, Avyakta Chelle, Nikhil Krishnaswamy

  • Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey Garima Agrawal, Tharindu Sandaruwan Kumarage, Zeyad Alghamdi, huan liu

  • Pedagogically aligned objectives create reliable automatic cloze tests Brian David Ondov, Kush Attal, Dina Demner-Fushman

  • Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning Kazuma Hashimoto, Karthik Raman, Michael Bendersky

  • LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language Models Chi Han, Qifan Wang, Hao Peng, Wenhan Xiong, Yu Chen, Heng Ji, Sinong Wang

  • CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants Albert Sun, Varun Nair, Elliot Schumacher, Anitha Kannan

  • Advancing Beyond Identification: Multi-bit Watermark for Large Language Models KiYoon Yoo, Wonhyuk Ahn, Nojun Kwak

  • HTCCN: Temporal Causal Convolutional Networks with Hawkes Process for Extrapolation Reasoning in Temporal Knowledge Graphs Tingxuan Chen, Jun Long, Liu Yang, Zidong Wang, Yongheng Wang, Xiongnan Jin

  • SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation Abe Bohan Hou, Jingyu Zhang, Tianxing He, Yichen Wang, Yung-Sung Chuang, Hongwei Wang, Lingfeng Shen, Benjamin Van Durme, Daniel Khashabi, Yulia Tsvetkov

  • Media Bias Detection Across Families of Language Models Iffat Maab, Edison Marrese-Taylor, Sebastian Padó, Yutaka Matsuo

  • Better Zero-Shot Reasoning with Role-Play Prompting Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xin Zhou, Enzhi Wang, Xiaohang Dong

  • Event-Content-Oriented Dialogue Generation in Short Video Fenghua Cheng, Xue Li, Zi Huang, Jinxiang Wang, Sen Wang

  • DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping Yongrui Chen, Haiyun Jiang, Xinting Huang, Shuming Shi, Guilin Qi

  • Beyond Borders: Investigating Cross-Jurisdiction Transfer in Legal Case Summarization Santosh T.Y.S.S, Vatsal Venkatkrishna, Saptarshi Ghosh, Matthias Grabmair

  • EDC: Effective and Efficient Dialog Comprehension For Dialog State Tracking Qifan Lu, Bhaskar Ramasubramanian, Radha Poovendran

  • Automatic Restoration of Diacritics for Speech Data Sets Sara Shatnawi, Sawsan Alqahtani, Hanan Aldarmaki

  • XNLIeu: a dataset for cross-lingual NLI in Basque Maite Heredia Arribas, Julen Etxaniz, Muitze Zulaika, Xabier Saralegi, Jeremy Barnes, Aitor Soroa

  • MDR: Model-Specific Demonstration Retrieval at Inference Time for In-Context Learning Huazheng Wang, Jinming Wu, Haifeng Sun, Zixuan Xia, Daixuan Cheng, Jingyu Wang, Qi Qi, Jianxin Liao

  • Exploring Cross-Cultural Differences in English Hate Speech Annotations: From Dataset Construction to Analysis Nayeon Lee, Chani Jung, Junho Myung, Jiho Jin, Jose Camacho-Collados, Juho Kim, Alice Oh

  • Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding Zheng Zhao, Emilio Monti, Jens Lehmann, Haytham Assem

  • Generalizable Sarcasm Detection is Just Around the Corner, of Course! Hyewon Jang, Diego Frassinelli

  • Encoding of lexical tone in self-supervised models of spoken language Gaofei Shen, Michaela Watkins, Afra Alishahi, Arianna Bisazza, Grzegorz Chrupała

  • A Systematic Comparison of Contextualized Word Embeddings for Lexical Semantic Change Francesco Periti, Nina Tahmasebi

  • iACOS: Advancing Implicit Sentiment Extraction with Informative and Adaptive Negative Examples XIANCAI XU, Jia-Dong Zhang, Lei Xiong, Zhishang Liu

  • Rectifying Demonstration Shortcut in In-Context Learning Joonwon Jang, Sanghwan Jang, WONBIN KWEON, Minjin Jeon, Hwanjo Yu

  • Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark Stephen Mayhew, Terra Blevins, Shuheng Liu, Marek Suppa, Hila Gonen, Joseph Marvin Imperial, Börje F. Karlsson, Peiqin Lin, Nikola Ljubešić, Lester James Validad Miranda, Barbara Plank, Arij Riabi, Yuval Pinter

  • ODD: A Benchmark Dataset for the Natural Language Processing Based Opioid Related Aberrant Behavior Detection SUNJAE KWON, Xun Wang, Weisong Liu, Emily Druhl, Minhee L. Sung, Joel I. Reisman, Wenjun Li, Robert D Kerns, William C. Becker, hong yu

  • A Comprehensive Study of Gender Bias in Chemical Named Entity Recognition Models Xingmeng Zhao, Ali Shahzad Niazi, Anthony Rios

  • The Promises and Pitfalls of Using Language Models to Measure Instruction Quality in Education Paiheng Xu, Jing Liu, Nathan D Jones, Julie Cohen, Wei Ai

  • Differentially Private Next-Token Prediction of Large Language Models James Flemings, Meisam Razaviyayn, Murali Annavaram

  • Improving Adversarial Data Collection by Supporting Annotators: Lessons from GAHD, a German Hate Speech Dataset Janis Goldzycher, Paul Rottger, Gerold Schneider

  • Memory Augmented Language Models through Mixture of Word Experts Cicero Nogueira dos Santos, James Lee-Thorp, Isaac Noble, Chung-Ching Chang, David Uthus

  • Impossible Distillation for Paraphrasing and Summarization: How to Make High-quality Lemonade out of Small, Low-quality Model Jaehun Jung, Peter West, Liwei Jiang, Faeze Brahman, Ximing Lu, Jillian Fisher, Taylor Sorensen, Yejin Choi

  • TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization Liyan Tang, Igor Shalyminov, Amy Wing-mei Wong, Jon Burnsky, Jake W. Vincent, Yuan Yang, Siffi Singh, Song Feng, Hwanjun Song, Hang Su, Lijia Sun, Yi Zhang, Saab Mansour, Kathleen McKeown

  • MOKA: Moral Knowledge Augmentation for Moral Event Extraction Xinliang Frederick Zhang, Winston Wu, Nicholas Beauchamp, Lu Wang

  • Fixing Rogue Memorization in Many-to-One Multilingual Translators of Extremely-Low-Resource Languages by Rephrasing Training Samples Paulo Cavalin, Pedro Henrique Leite da Silva Pires Domingues, Claudio Pinhanez, Julio Nogima

  • Backdoor Attacks on Multilingual Machine Translation Jun Wang, Qiongkai Xu, Xuanli He, Benjamin I. P. Rubinstein, Trevor Cohn

  • Personalized Jargon Identification for Enhanced Interdisciplinary Communication Yue Guo, Joseph Chee Chang, Maria Antoniak, Erin Bransom, Trevor Cohen, Lucy Lu Wang, Tal August

  • Flames: Benchmarking Value Alignment of LLMs in Chinese Kexin Huang, Xiangyang Liu, Qianyu Guo, Tianxiang Sun, Jiawei Sun, Yaru Wang, Zeyang Zhou, Yixu Wang, Yan Teng, Xipeng Qiu, Yingchun Wang, Dahua Lin

  • Mitigating Bias for Question Answering Models by Tracking Bias Influence Mingyu Derek Ma, Jiun-Yu Kao, Arpit Gupta, Yu-Hsiang Lin, Wenbo Zhao, Tagyoung Chung, Wei Wang, Kai-Wei Chang, Nanyun Peng

  • Extending CLIP’s Image-Text Alignment to Referring Image Segmentation Seoyeon Kim, Minguk Kang, Dongwon Kim, Jaesik Park, Suha Kwak

  • Generating Attractive and Authentic Copywriting from Customer Reviews Yu-Xiang Lin, Wei-Yun Ma

  • Effective Long-Context Scaling of Foundation Models Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, Karthik Abinav Sankararaman, Barlas Oguz, Madian Khabsa, Han Fang, Yashar Mehdad, Sharan Narang, Kshitiz Malik, Angela Fan, Shruti Bhosale, Sergey Edunov, Mike Lewis, Sinong Wang, Hao Ma

  • Empowering Diffusion Models on the Embedding Space for Text Generation Zhujin Gao, Junliang Guo, Xu Tan, Yongxin Zhu, Fang Zhang, Jiang Bian, Linli Xu

  • Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback Yu Xia, Tong Yu, Zhankui He, Handong Zhao, Julian McAuley, Shuai Li

  • Fake Alignment: Are LLMs Really Aligned Well? Yixu Wang, Yan Teng, Kexin Huang, Chengqi Lyu, Songyang Zhang, Wenwei Zhang, Xingjun Ma, Yu-Gang Jiang, Yu Qiao, Yingchun Wang

  • Visually Guided Generative Text-Layout Pre-training for Document Intelligence Zhiming Mao, Haoli Bai, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong

  • HILL: Hierarchy-aware Information Lossless Contrastive Learning for Hierarchical Text Classification He Zhu, Junran Wu, Ruomei Liu, Yue Hou, Ze Yuan, Shangzhe Li, Yicheng Pan, Ke Xu

  • Investigating the Emergent Audio Classification Ability of ASR Foundation Models Rao Ma, Adian Liusie, Mark Gales, Kate Knill

  • In-context Learning Generalizes, But Not Always Robustly: The Case of Syntax Aaron Mueller, Albert Webson, Jackson Petty, Tal Linzen

  • Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao

  • Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech Dena Mujtaba, Nihar R. Mahapatra, Megan Arney, J Scott Yaruss, Hope Gerlach-Houck, Caryn Herring, Jia Bin

  • MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification Chadi Helwe, Tom Calamai, Pierre-Henri Paris, Chloé Clavel, Fabian M. Suchanek

  • Diffusion Glancing Transformer for Parallel Sequence-to-Sequence Learning Lihua Qian, Mingxuan Wang, Yang Liu, Hao Zhou

  • No Context Needed: Contextual Quandary In Idiomatic Reasoning With Pre-Trained Language Models Kellen Tan Cheng, Suma Bhat

  • Multi-stage Retrieve and Re-rank Model for Automatic Medical Coding Recommendation Xindi Wang, Robert Mercer, Frank Rudzicz

  • Anisotropy is Not Inherent to Transformers Anemily Machina, Robert Mercer

  • Finding Replicable Human Evaluations via Stable Ranking Probability Parker Riley, Daniel Deutsch, George Foster, Viresh Ratnakar, Ali Dabirmoghaddam, Markus Freitag

  • Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections Yuanpu Cao, Bochuan Cao, Jinghui Chen

  • Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts Sai Ashish Somayajula, Youwei Liang, Li Zhang, Abhishek Singh, Pengtao Xie

  • Detecting Bipolar Disorder from Misdiagnosed Major Depressive Disorder with Mood-Aware Multi-Task Learning Daeun Lee, Hyolim Jeon, Sejung Son, Chaewon Park, Ji hyun An, Seungbae Kim, Jinyoung Han

  • Leveraging Code to Improve In-Context Learning for Semantic Parsing Ben Bogin, Shivanshu Gupta, Peter Clark, Ashish Sabharwal

  • Improving Pre-trained Language Model Sensitivity via Mask Specific losses: A case study on Biomedical NER Micheal Abaho, Danushka Bollegala, Gary Leeming, Dan W Joyce, Iain Edward Buchan

  • Language Models Implement Simple Word2Vec-style Vector Arithmetic Jack Merullo, Carsten Eickhoff, Ellie Pavlick

  • AutoLoRA: Automatically Tuning Matrix Ranks in Low-Rank Adaptation Based on Meta Learning Ruiyi Zhang, Rushi Qiang, Sai Ashish Somayajula, Pengtao Xie

  • SportQA: A Benchmark for Sports Understanding in Large Language Models Haotian Xia, Zhengbang Yang, Yuqing Wang, Rhys Tracy, Yun Zhao, Dongdong Huang, Zezhi Chen, Yan Zhu, Yuan-fang Wang, Weining Shen

  • Revisiting subword tokenization: A case study on affixal negation in large language models Thinh Hung Truong, Yulia Otmakhova, Karin Verspoor, Trevor Cohn, Timothy Baldwin

  • Generating Mental Health Transcripts with SAPE (Spanish Adaptive Prompt Engineering) Daniel Cabrera Lozoya, Alejandro Casar Berazaluce, Juan Alberto Barajas Perches, Eloy Hernández Lúa, Mike Conway, Simon DAlfonso

  • Where are you from? Geolocating Speech and Applications to Language Identification Patrick Foley, Matthew Wiesner, Bismarck Bamfo Odoom, Leibny Paola Garcia Perera, Kenton Murray, Philipp Koehn

  • Teaching Language Models to Self-Improve through Interactive Demonstrations Xiao Yu, Baolin Peng, Michel Galley, Jianfeng Gao, Zhou Yu

  • MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets Hossein Aboutalebi, Hwanjun Song, Yusheng Xie, Arshit Gupta, Lijia Sun, Hang Su, Igor Shalyminov, Nikolaos Pappas, Siffi Singh, Saab Mansour

  • Zero-shot Generative Linguistic Steganography Ke Lin, Yiyang Luo, Zijian Zhang, Luo ping

  • Does GPT-4 pass the Turing test? Cameron Robert Jones, Ben Bergen

  • Polarity Calibration for Opinion Summarization Yuanyuan Lei, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Ruihong Huang, Dong Yu

  • Sentence-level Media Bias Analysis with Event Relation Graph Yuanyuan Lei, Ruihong Huang

  • EMONA: Event-level Moral Opinions in News Articles Yuanyuan Lei, Md Messal Monem Miah, Ayesha Qamar, Sai Ramana Reddy, Jonathan Tong, Haotian Xu, Ruihong Huang

  • DLM: A Decoupled Learning Model for Long-tailed Polyphone Disambiguation in Mandarin Beibei Gao, Yangsen Zhang, Ga Xiang, Yushan Jiang

  • You don’t need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments Bangzhao Shu, Lechen Zhang, Minje Choi, Lavinia Dunagan, Lajanugen Logeswaran, Moontae Lee, Dallas Card, David Jurgens

  • CASA: Causality-driven Argument Sufficiency Assessment Xiao Liu, Yansong Feng, Kai-Wei Chang

  • MacGyver: Are Large Language Models Creative Problem Solvers? Yufei Tian, Abhilasha Ravichander, Lianhui Qin, Ronan Le Bras, Raja Marjieh, Nanyun Peng, Yejin Choi, Thomas L. Griffiths, Faeze Brahman

  • To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages Benedikt Ebing, Goran Glavaš

  • Enhancing Large Language Models Against Inductive Instructions with Dual-critique Prompting Rui Wang, Hongru WANG, Fei Mi, Boyang XUE, Yi Chen, Kam-Fai Wong, Ruifeng Xu

  • GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer Urchade Zaratiana, Nadi Tomeh, Pierre Holat, Thierry Charnois

  • XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models Paul Rottger, Hannah Rose Kirk, Bertie Vidgen, Giuseppe Attanasio, Federico Bianchi, Dirk Hovy

  • Carpe diem: On the Evaluation of World Knowledge in Lifelong Language Models Yujin Kim, Jaehong Yoon, Seonghyeon Ye, Sangmin Bae, Namgyu Ho, Sung Ju Hwang, Se-Young Yun

  • Fine-grained Gender Control in Machine Translation with Large Language Models Minwoo Lee, Hyukhun Koh, Minsung Kim, Kyomin Jung

  • DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade Zefan Cai, Xin Zheng, Tianyu Liu, Haoran Meng, Jiaqi Han, Gang Yuan, Binghuai Lin, Baobao Chang, Yunbo Cao

  • LLatrieval: LLM-Verified Retrieval for Verifiable Generation Xiaonan Li, Changtai Zhu, Linyang Li, Zhangyue Yin, Tianxiang Sun, Xipeng Qiu

  • Mapping Long-term Causalities in Psychiatric Symptomatology and Life Events from Social Media Siyuan Chen, Meilin Wang, Minghao Lv, Zhiling Zhang, Juqianqian, Dejiyangla, Yujia Peng, Kenny Q. Zhu, Mengyue Wu

  • Multimodal Chart Retrieval: A Comparison of Text, Table and Image Based Approaches Averi Nowak, Francesco Piccinno, Yasemin Altun

  • Retrieval Helps or Hurts? A Deeper Dive into the Efficacy of Retrieval Augmentation to Language Models Seiji Maekawa, Hayate Iso, Sairam Gurajada, Nikita Bhutani

  • AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer

  • Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness Ashim Gupta, Rishanth Rajendhran, Nathan Stringham, Vivek Srikumar, Ana Marasovic

  • Sequential Compositional Generalization in Multimodal Models Semih Yagcioglu, Osman Batur İnce, Aykut Erdem, Erkut Erdem, Desmond Elliott, Deniz Yuret

  • Generating Uncontextualized and Contextualized Questions for Document-Level Event Argument Extraction Md Nayem Uddin, Enfa Rose George, Eduardo Blanco, Steven Corman

  • Evidence-Driven Retrieval Augmented Response Generation for Online Misinformation Zhenrui Yue, Huimin Zeng, Yimeng Lu, Lanyu Shang, Yang Zhang, Dong Wang

  • Open-Vocabulary Federated Learning with Multimodal Prototyping Huimin Zeng, Zhenrui Yue, Dong Wang

  • Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning Xiao Li, Yong Jiang, Shen Huang, Pengjun Xie, Gong Cheng, Fei Huang

  • Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense Siqi Shen, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Soujanya Poria, Rada Mihalcea

  • Code Models are Zero-shot Precondition Reasoners Lajanugen Logeswaran, Sungryull Sohn, Yiwei Lyu, Anthony Zhe Liu, Dong-Ki Kim, Dongsub Shim, Moontae Lee, Honglak Lee

  • Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding Suyoung Kim, Jiyeon Hwang, Ho-Young Jung

  • Do Large Language Models Rank Fairly? An Empirical Study on the Fairness of LLMs as Rankers Yuan Wang, Xuyang Wu, Hsin-Tai Wu, ZHIQIANG TAO, Yi Fang

  • TabSQLify: Enhancing Reasoning Capabilities of LLMs Through Table Decomposition Md Mahadi Hasan Nahid, Davood Rafiei

  • Contextual Label Projection for Cross-Lingual Structured Prediction Tanmay Parekh, I-Hung Hsu, Kuan-Hao Huang, Kai-Wei Chang, Nanyun Peng

  • Event Detection from Social Media for Epidemic Prediction Tanmay Parekh, Anh Mac, Jiarui Yu, Yuxuan Dong, Syed Shahriar, Bonnie Liu, Eric J Yang, Kuan-Hao Huang, Wei Wang, Nanyun Peng, Kai-Wei Chang

  • RESPROMPT: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models Song Jiang, Zahra Shakeri, Aaron Chan, Maziar Sanjabi, Hamed Firooz, Yinglong Xia, Bugra Akyildiz, Yizhou Sun, Jinchao Li, Qifan Wang, Asli Celikyilmaz

  • BPE-knockout: Pruning Pre-existing BPE Tokenisers with Backwards-compatible Morphological Semi-supervision Thomas Bauwens, Pieter Delobelle

  • How are Prompts Different in Terms of Sensitivity? Sheng Lu, Hendrik Schuff, Iryna Gurevych

  • LSTDial: Enhancing Dialogue Generation via Long- and Short-Term Measurement Feedback Guanghui Ye, Huan Zhao, Zixing Zhang, Xupeng Zha, Zhihua Jiang

  • The ART of LLM Refinement: Ask, Refine, and Trust Kumar Shridhar, Koustuv Sinha, Andrew Cohen, Tianlu Wang, Ping Yu, Ramakanth Pasunuru, Mrinmaya Sachan, Jason E Weston, Asli Celikyilmaz

  • Modularized Multilingual NMT with Fine-grained Interlingua Sungjun Lim

  • ParallelPARC: A Scalable Pipeline for Generating Natural-Language Analogies Oren Sultan, Yonatan Bitton, Ron Yosef, Dafna Shahaf

  • AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content Shuyang Cao, Lu Wang

  • NLP Systems That Can’t Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps Kristina Gligoric, Myra Cheng, Lucia Zheng, Esin Durmus, Dan Jurafsky

  • Debiasing with Sufficient Projection: A General Theoretical Framework for Vector Representations Enze Shi, Lei Ding, Linglong Kong, Bei Jiang

  • Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection Jianfeng He, Hang Su, Jason Cai, Igor Shalyminov, Hwanjun Song, Saab Mansour

  • AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages Jiayi Wang, David Ifeoluwa Adelani, Sweta Agrawal, Marek Masiak, Ricardo Rei, Eleftheria Briakou, Marine Carpuat, Xuanli He, Sofia Bourhim, Andiswa Bukula, Muhidin A. Mohamed, Temitayo Olatoye, Hamam Mokayed, Christine Mwase, Wangui Kimotho, Foutse Yuehgoh, Anuoluwapo Aremu, Jessica Ojo, Shamsuddeen Hassan Muhammad, Salomey Osei, Abdul-Hakeem Omotayo, Chiamaka Ijeoma Chukwuneke, Perez Ogayo, Oumaima Hourrane, Salma EL ANIGRI, Lolwethu Ndolela, Thabiso Mangwana, Shafie Abdi Mohamed, Hassan Ayinde, Oluwabusayo Olufunke Awoyomi, Lama Alkhaled, sana Sabah al-azzawi, Naome A Etori, Millicent Ochieng, Clemencia Siro, Njoroge Kiragu, Eric Muchiri, Wangari Kimotho, Toadoum Sari Sakayo, Lyse Naomi Momo Wamba, Daud Abolade, Simbiat Ajao, Tosin Adewumi, Iyanuoluwa Shode, Ricky Sambo Macharm, Ruqayya Nasir Iro, Saheed Salahudeen Abdullahi, Stephen Edward Moore, Bernard Opoku, Zainab Akinjobi, Abeeb Afolabi, Nnaemeka Casmir Obiefuna, Onyekachi Ogbu, Sam Brian Ochieng, Verrah Akinyi Otiende, CHINEDU EMMANUEL MBONU, Yao Lu, Pontus Stenetorp

  • TableLlama: Towards Open Large Generalist Models for Tables TIANSHU ZHANG, Xiang Yue, Yifei Li, Huan Sun

  • PEMA: An Offsite-Tunable Plug-in External Memory Adaptation for Language Models HyunJin Kim, Young Jin Kim, JinYeong Bak

  • Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection Jun Yan, Vikas Yadav, Shiyang Li, Lichang Chen, Zheng Tang, Hai Wang, Vijay Srinivasan, Xiang Ren, Hongxia Jin

  • Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models Shuaijie She, Shujian Huang, Xingyun Wang, Yanke Zhou, Jiajun Chen

  • Multilingual Pretraining and Instruction Tuning Improve Cross-Lingual Knowledge Alignment, But Only Shallowly Changjiang Gao, Hongda Hu, Peng Hu, Jiajun Chen, Jixing Li, Shujian Huang

  • A Study on the Calibration of In-context Learning Hanlin Zhang, YiFan Zhang, Yaodong Yu, Dhruv Madeka, Dean Foster, Eric P. Xing, Himabindu Lakkaraju, Sham M. Kakade

  • DialogBench: Evaluating LLMs as Human-like Dialogue Systems Jiao Ou, Junda Lu, Che Liu, Yihong Tang, Fuzheng Zhang, Di ZHANG, Kun Gai

  • GINopic: Topic Modeling with Graph Isomorphism Network Suman Adhya, Debarshi Kumar Sanyal

  • CMB: A Comprehensive Medical Benchmark in Chinese Xidong Wang, Guiming Hardy Chen, Song Dingjie, Zhang Zhiyi, Zhihong Chen, Qingying Xiao, Junying Chen, Feng Jiang, Jianquan Li, Xiang Wan, Benyou Wang, Haizhou Li

  • Massive End-to-end Speech Recognition Models with Time Reduction Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N Sainath, Pedro J Moreno Mengibar

  • SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics Arash Ardakani, Altan Haan, Shangyin Tan, Doru Thom Popovici, Alvin Cheung, Costin Iancu, Koushik Sen

  • Effective Large Language Model Adaptation for Improved Grounding and Citation Generation Xi Ye, Ruoxi Sun, Sercan O Arik, Tomas Pfister

  • Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models Yijia Shao, Yucheng Jiang, Theodore A Kanell, Peter Xu, Omar Khattab, Monica Lam

  • Grounding Gaps in Language Model Generations Omar Shaikh, Kristina Gligoric, Ashna Khetan, Matthias Gerstgrasser, Diyi Yang, Dan Jurafsky

  • When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale Christos Baziotis, Biao Zhang, Alexandra Birch, Barry Haddow

  • ContraSim – Analyzing Neural Representations Based on Contrastive Learning Adir Rahamim, Yonatan Belinkov

  • Universal Prompt Optimizer for Safe Text-to-Image Generation Zongyu Wu, Hongcheng Gao, Yueze Wang, Xiang Zhang, Suhang Wang

  • Language Model Based Unsupervised Dependency Parsing with Conditional Mutual Information and Grammatical Constraints Junjie Chen, Xiangheng He, Yusuke Miyao

  • The Bias Amplification Paradox in Text-to-Image Generation Preethi Seshadri, Sameer Singh, Yanai Elazar

  • Grammar-based Data Augmentation for Low-Resource Languages: The Case of Guarani-Spanish Neural Machine Translation Agustín Lucas, Alexis Baladón Ferreira de Araujo, Victoria Pardiñas, Marvin M. Agüero-Torales, Santiago Góngora, Luis Chiruzzo

  • Global Gallery: The Fine Art of Painting Culture Portraits through Multilingual Instruction Tuning Anjishnu Mukherjee, Aylin Caliskan, Ziwei Zhu, Antonios Anastasopoulos

  • Toward Interactive Regional Understanding in Vision-Large Language Models Jungbeom Lee, Sanghyuk Chun, Sangdoo Yun

  • Script-mix: Mixing Scripts for Low-resource Language Parsing Jaeseong Lee, Dohyeon Lee, seung-won hwang

  • MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation Jiahuan Li, Shanbo Cheng, Shujian Huang, Jiajun Chen

  • ToXCL: A Unified Framework for Toxic Speech Detection and Explanation Nhat M. Hoang, Do Xuan Long, Duc Anh Do, Duc Anh Vu, Anh Tuan Luu

  • $\textit{LinkPrompt}$: Natural and Universal Adversarial Attacks on Prompt-based Language Models Yue Xu, Wenjie Wang

  • CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions Hanchong Zhang, Ruisheng Cao, Hongshen Xu, Lu Chen, Kai Yu

  • ContraDoc: Understanding Self-Contradictions in Documents with Large Language Models Jierui Li, Vipul Raheja, Dhruv Kumar

  • Entity Disambiguation via Fusion Entity Decoding Junxiong Wang, Ali Mousavi, Omar Attia, Saloni Potdar, Alexander M Rush, Umar Farooq Minhas, Yunyao Li

  • PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers Myeonghwa Lee, Seonho An, Min-Soo Kim

  • GPTScore: Evaluate as You Desire Jinlan Fu, See-Kiong Ng, Zhengbao Jiang, Pengfei Liu

  • A Survey of Confidence Estimation and Calibration in Large Language Models Jiahui Geng, Fengyu Cai, Yuxia Wang, Heinz Koeppl, Preslav Nakov, Iryna Gurevych

  • Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References Tianyi Tang, Hongyuan Lu, Yuchen Eleanor Jiang, Haoyang Huang, Dongdong Zhang, Xin Zhao, Tom Kocmi, Furu Wei

  • Separation and Fusion: A Novel Multiple Token Linking Model for Event Argument Extraction Jing Xu, Dandan Song, Siu Cheung Hui, Zhijing Wu, Meihuizi Jia, Hao Wang, Yanru Zhou, Changzhi Zhou, Ziyi Yang

  • The Integration of Semantic and Structural Knowledge in Knowledge Graph Entity Typing Muzhi Li, Minda Hu, Irwin King, Ho-fung Leung

  • ComCLIP: Training-Free Compositional Image and Text Matching Kenan Jiang, Xuehai He, Ruize Xu, Xin Eric Wang

  • ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications Sotaro Takeshita, Tommaso Green, Ines Reinig, Kai Eckert, Simone Paolo Ponzetto

  • XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners Yun Luo, Zhen Yang, Fandong Meng, Yingjie Li, Fang Guo, Qinglin Qi, Jie Zhou, Yue Zhang

  • LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation? Yuchi Wang, Shuhuai Ren, Rundong Gao, Linli Yao, Qingyan Guo, Kaikai An, Jianhong Bai, Xu Sun

  • Intent-conditioned and Non-toxic Counterspeech Generation using Multi-Task Instruction Tuning with RLAIF Amey Hengle, Aswini Kumar Padhi, Sahajpreet Singh, Anil Bandhakavi, Md Shad Akhtar, Tanmoy Chakraborty

  • Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey Zhichen Dong, Zhanhui Zhou, Chao Yang, Jing Shao, Yu Qiao

  • Mind’s Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models Weize Liu, Guocong Li, Kai Zhang, Bang Du, Qiyuan Chen, Xuming Hu, Hongxia Xu, Jintai Chen, Jian Wu

  • Divergent Token Metrics: Measuring degradation to prune away LLM components – and optimize quantization Björn Deiseroth, Max Meuer, Nikolas Gritsch, Constantin Eichenberg, Patrick Schramowski, Matthias Aßenmacher, Kristian Kersting

  • Beyond Performance: Quantifying and Mitigating Label Bias in LLMs Yuval Reif, Roy Schwartz

  • Instructing Large Language Models to Identify and Ignore Irrelevant Conditions Zhenyu Wu, Chao Shen, Meng Jiang

  • On the Relationship Between Non-deterministic FSLMs and RNN LMs Anej Svete, Anisha Mohamed Sahabdeen, Franz Nowak, Ryan Cotterell

  • Transformer LMs Can Represent Probabilistic Strictly Local Languages Anej Svete, Ryan Cotterell

  • The Role of $n$-gram Smoothing in the Age of Neural Networks Luca Malagutti, Anej Svete, Clara Meister, Andrius Buinovskij, Afra Amini, Ryan Cotterell

  • Reliability Estimation of News Media Sources: Birds of a Feather Flock Together Sergio Burdisso, DAIRAZALIA SANCHEZ-CORTES, Esaú VILLATORO-TELLO, Petr Motlicek

  • On the Multilingual Ability of Decoder-based Pre-trained Language Models: Finding and Controlling Language-Specific Neurons Takeshi Kojima, Itsuki Okimura, Yusuke Iwasawa, Hitomi Yanaka, Yutaka Matsuo

  • NLP Progress in Indigenous Latin American Languages Atnafu Lambebo Tonja, Fazlourrahman Balouchzahi, SABUR BUTT, Olga Kolesnikova, Hector Ceballos, Alexander Gelbukh, Thamar Solorio

  • On the Effectiveness of Adversarial Robustness for Abuse Mitigation with Counterspeech Yi-Ling Chung, Jonathan Bright

  • Leveraging the Structure of Pre-trained Embeddings to Minimize Annotation Effort Cesar Gonzalez-Gutierrez, Ariadna Quattoni

  • UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing Yijun Yang, Jie He, Pinzhen Chen, Victor Gutierrez Basulto, Jeff Z. Pan

  • Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity Soyeong Jeong, Jinheon Baek, Sukmin Cho, Sung Ju Hwang, Jong C. Park

  • Knowing What LLMs DO NOT Know: A Simple Yet Effective Self-Detection Method Yukun Zhao, Lingyong Yan, Weiwei Sun, Guoliang Xing, Chong Meng, Shuaiqiang Wang, Zhicong Cheng, Zhaochun Ren, Dawei Yin

  • Are Large Language Model Temporally Grounded? Yifu QIU, Zheng Zhao, Yftah Ziser, Anna Korhonen, Edoardo Ponti, Shay B Cohen

  • Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling Yupu Liang, Yaping Zhang, Cong MA, Zhiyang Zhang, Yang Zhao, Lu Xiang, Chengqing Zong, Yu Zhou

  • Elastic Weight Removal for Faithful and Abstractive Dialogue Generation Nico Daheim, Nouha Dziri, Mrinmaya Sachan, Iryna Gurevych, Edoardo Ponti

  • R-Tuning: Instructing Large Language Models to Say `I Don’t Know’ Hanning Zhang, Shizhe Diao, Yong Lin, Yi Fung, Qing LIAN, Xingyao Wang, Yangyi Chen, Heng Ji, Tong Zhang

  • Bridging the Gap between Different Vocabularies for LLM Ensemble Yangyifan Xu, Jinliang Lu, Jiajun Zhang

  • KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation Xindi Luo, Zequn Sun, Jing Zhao, Zhe Zhao, Wei Hu

  • Extremely Weakly-supervised Text Classification with Wordsets Mining and Sync-Denoising Lysa Xiao

  • F-MALLOC: Feed-forward Memory Allocation for Continual Learning in Neural Machine Translation Junhong Wu, Yuchen Liu, Chengqing Zong

  • Towards Reducing Diagnostic Errors with Interpretable Risk Prediction Denis Jered McInerney, William Dickinson, Lucy C. Flynn, Andrea C Young, Geoffrey Young, Jan-Willem van de Meent, Byron C Wallace

  • Generalizable Multilingual Hate Speech Detection on Low Resource Indian Languages using Fair Selection in Federated Learning Akshay Singh, Rahul Thakur

  • Key ingredients for effective zero-shot cross-lingual knowledge transfer in generative tasks Nadezhda Chirkova, Vassilina Nikoulina

  • The Impact of Depth on Compositional Generalization in Transformer Language Models Jackson Petty, Sjoerd van Steenkiste, Ishita Dasgupta, Fei Sha, Dan Garrette, Tal Linzen

  • Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering Neha Srikanth, Rupak Sarkar, Heran Mane, Elizabeth Aparicio, Quynh Nguyen, Rachel Rudinger, Jordan Lee Boyd-Graber

  • Towards Explainability in Legal Outcome Prediction Models Josef Valvoda, Ryan Cotterell

  • The steerability of large language models toward data-driven personas Junyi Li, Charith Peris, Ninareh Mehrabi, Palash Goyal, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta

  • CCSum: A Large-Scale and High-Quality Dataset for Abstractive News Summarization Xiang Jiang, Markus Dreyer

  • Capturing Perspectives of Crowdsourced Annotators in Subjective Learning Tasks Negar Mokhberian, Myrl G Marmarelis, Frederic Rene Hopp, Valerio Basile, Fred Morstatter, Kristina Lerman

  • Improving Factual Accuracy of Neural Table-to-Text Output by Addressing Input Problems in ToTTo Barkavi Sundararajan, Yaji Sripada, Ehud Reiter

  • CERET: Cost-Effective Extrinsic Refinement for Text Generation Jason Cai, Hang Su, MONICA SUNKARA, Igor Shalyminov, Saab Mansour

  • Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling Subhendu Khatuya, Rajdeep Mukherjee, Akash Ghosh, Manjunath Hegde, Koustuv Dasgupta, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

  • Analysis of State-Level Legislative Process in Enhanced Linguistic and Nationwide Network Contexts Maryam Davoodi, Dan Goldwasser

  • DeMuX: Data-efficient Multilingual Learning Simran Khanuja, Srinivas Gowriraj, Lucio M. Dery, Graham Neubig

  • DUQGen: Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation Ramraj Chandradevan, Kaustubh Dhole, Eugene Agichtein

  • How did we get here? Summarizing conversation dynamics Yilun Hua, Nick Chernogor, Yuzhe Gu, Seoyeon Julie Jeong, Miranda Luo, Cristian Danescu-Niculescu-Mizil

  • Can Language Model Moderators Improve the Health of Online Discourse? Hyundong Justin Cho, Shuai Liu, Taiwei Shi, Darpan Jain, Basem Rizk, Yuyang Huang, Zixun Lu, Nuan Wen, Jonathan Gratch, Emilio Ferrara, Jonathan May

  • LeanReasoner: Boosting Complex Logical Reasoning with Lean Dongwei Jiang, Marcio Fonseca, Shay B Cohen

  • UICoder: Finetuning Large Language Models to Generate User Interface Code through Automated Feedback Jason Wu

  • Measuring Cross-lingual Transfer in Bytes Leandro Rodrigues De Souza, Thales Sales Almeida, Roberto Lotufo, Rodrigo Frassetto Nogueira

  • MisgenderMender: A Community-Informed Approach to Interventions for Misgendering Tamanna Hossain, Sunipa Dev, Sameer Singh

  • Interplay of Machine Translation, Diacritics, and Diacritization Wei-Rui Chen, Ife Adebara, Muhammad Abdul-Mageed

  • From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning Ming Li, Yong Zhang, Zhitao Li, Jiuhai Chen, Lichang Chen, Ning Cheng, Jianzong Wang, Tianyi Zhou, Jing Xiao

  • Safer-Instruct: Aligning Language Models with Automated Preference Data Taiwei Shi, Kai Chen, Jieyu Zhao

  • PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization Joseph J Peper, Wenzhao Qiu, Lu Wang

  • Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination? Bangzheng Li, Ben Zhou, Fei Wang, Xingyu Fu, Dan Roth, Muhao Chen

  • IndiSentiment140: Sentiment Analysis Dataset for Indian Languages with Emphasis on Low-Resource Languages using Machine Translation Saurabh Kumar, Ranbir Singh Sanasam, Sukumar Nandi

  • Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval Nandan Thakur, Jianmo Ni, Gustavo Hernandez Abrego, John Frederick Wieting, Jimmy Lin, Daniel Cer

  • SCANNER: Knowledge-Enhanced Approach for Robust Multi-modal Named Entity Recognition of Unseen Entities Hyunjong Ok, Taeho Kil, Sukmin Seo, Jaeho Lee

  • A Theory Guided Scaffolding Instruction Framework for LLM-Enabled Metaphor Reasoning Yuan Tian, Nan Xu, Wenji Mao

  • Learning to Compress Prompt in Natural Language Formats Yu-Neng Chuang, Tianwei Xing, Chia-Yuan Chang, Zirui Liu, Xun Chen, Xia Hu

  • Automatic, Meta and Human Evaluation for Multimodal Summarization with Multimodal Output Haojie Zhuang, Wei Emma Zhang, Leon Xie, Weitong Chen, Jian Yang, Quan Z. Sheng

  • Naive Bayes-based Context Extension for Large Language Models Jianlin Su, Murtadha Ahmed, Bo Wen, Luo Ao, Mingren Zhu, Yunfeng Liu

  • Leitner-Guided Memory Replay for Cross-lingual Continual Learning Meryem Mhamdi, Jonathan May

  • Multilingual Nonce Dependency Treebanks: Understanding how Language Models Represent and Process Syntactic Structure David Arps, Laura Kallmeyer, Younes Samih, Hassan Sajjad

  • Actively Learn from LLMs with Uncertainty Propagation for Generalized Category Discovery Jinggui Liang, Lizi Liao, Hao Fei, Bobo Li, Jing Jiang

  • Explaining Text Similarity in Transformer Models Oliver Eberle, Alexandros Vasileiou

  • Large Language Models can Contrastively Refine their Generation for Better Sentence Representation Learning Huiming Wang, Zhaodonghui Li, Liying Cheng, De Wen Soh, Lidong Bing

  • HIL: Hybrid Isotropy Learning for Zero-shot Performance in Dense retrieval JaeYoung Kim, Dohyeon Lee, seung-won hwang

  • SuperGLEBer: German Language Understanding Evaluation Benchmark Jan Pfister, Andreas Hotho

  • You are an expert annotator: Automatic Best–Worst-Scaling Annotations for Emotion Intensity Modeling Christopher Bagdon, Prathamesh Karmalkar, Harsha Gurulingappa, Roman Klinger

  • What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? Yan Zeng, Hanbo Zhang, Jiani zheng, Jiangnan Xia, Guoqiang Wei, Yang Wei, Yuchen Zhang, Tao Kong, Ruihua Song

  • Defining and Detecting Vulnerability in Human Evaluation Guidelines: A Preliminary Study Towards Reliable NLG Evaluation Jie Ruan, WangWenqing, Xiaojun Wan

  • MOSAICo: a Multilingual Open-text Semantically Annotated Interlinked Corpus Simone Conia, Edoardo Barba, Abelardo Carlos Martinez Lorenzo, Pere-Lluís Huguet Cabot, Riccardo Orlando, Luigi Procopio, Roberto Navigli

  • SemRoDe: Macro Adversarial Training to Learn Representations that are Robust to Word-Level Attacks Brian Formento, Wenjie Feng, Chuan-Sheng Foo, Anh Tuan Luu, See-Kiong Ng

  • BUST: Benchmark for the evaluation of detectors of LLM-Generated Text Joseph Cornelius, Oscar William Lithgow-Serrano, Sandra Mitrovic, Ljiljana Dolamic, Fabio Rinaldi

  • Improving In-context Learning of Multilingual Generative Language Models with Cross-lingual Alignment Chong Li, Shaonan Wang, Jiajun Zhang, Chengqing Zong

  • MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration Xianwei Zhuang, Zhichang Wang, Xuxin Cheng, Yuxin Xie, Liming Liang, Yuexian Zou

  • Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion? Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

  • Discovering Lobby-Parliamentarian Alignments through NLP Aswin Suresh, Lazar Radojević, Francesco Salvi, Antoine Magron, Victor Kristof, Matthias Grossglauser

  • IterCQR: Iterative Conversational Query Reformulation with Retrieval Guidance Yunah Jang, Kang-il Lee, Hyunkyung Bae, Hwanhee Lee, Kyomin Jung

  • AceGPT, Localizing Large Language Models in Arabic Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Song Dingjie, Zhihong Chen, Mosen Alharthi, Bang An, Juncai He, Ziche Liu, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu

  • Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model Zhiwei He, Xing Wang, Wenxiang Jiao, Zhuosheng Zhang, Rui Wang, Shuming Shi, Zhaopeng Tu

  • Depression Detection in Clinical Interviews with LLM-Empowered Structural Element Graph Zhuang Chen, Jiawen Deng, Jinfeng Zhou, Jincenzi Wu, Tieyun Qian, Minlie Huang

  • SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU Evgeniia Razumovskaia, Goran Glavaš, Anna Korhonen, Ivan Vulić

  • Enhancing Argument Summarization: Prioritizing Exhaustiveness in Key Point Generation and Introducing an Automatic Coverage Evaluation Metric Mohammad Khosravani, Chenyang Huang, Amine Trabelsi

  • ARM: Alignment with Residual Energy-Based Model Bo Pang, Caiming Xiong, Yingbo Zhou

  • HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants Milan Gritta, Gerasimos Lampouras, Ignacio Iacobacci

  • FAMuS: Frames Across Multiple Sources Siddharth Vashishtha, Alexander Martin, William Gantt, Benjamin Van Durme, Aaron Steven White

  • Rationale-based Opinion Summarization Haoyuan Li, Snigdha Chaturvedi

  • Mustango: Toward Controllable Text-to-Music Generation Jan Melechovsky, Zixun Guo, Deepanway Ghosal, Navonil Majumder, Dorien Herremans, Soujanya Poria

  • Adaptive Cross-lingual Text Classification through In-Context One-Shot Demonstrations Emilio Villa Cueva, ADRIAN PASTOR LOPEZ MONROY, Fernando Sánchez-Vega, Thamar Solorio

  • CNER: Concept and Named Entity Recognition Giuliano Martinelli, Francesco Maria Molfese, Simone Tedeschi, Alberte Fernández-Castro, Roberto Navigli

  • Branch-Solve-Merge Improves Large Language Model Evaluation and Generation Swarnadeep Saha, Omer Levy, Asli Celikyilmaz, Mohit Bansal, Jason E Weston, Xian Li

  • REPLUG: Retrieval-Augmented Black-Box Language Models Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Richard James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih

  • David helps Goliath: Inference-Time Collaboration Between Small Specialized and Large General Diffusion LMs Xiaochuang Han, Sachin Kumar, Yulia Tsvetkov, Marjan Ghazvininejad

  • Efficient End-to-End Visual Document Understanding with Rationale Distillation Wang Zhu, Alekh Agarwal, Mandar Joshi, Robin Jia, Jesse Thomason, Kristina Toutanova

  • A Systematic Comparison of Syllogistic Reasoning in Humans and Language Models Tiwalayo Eisape, Michael Henry Tessler, Ishita Dasgupta, Fei Sha, Sjoerd van Steenkiste, Tal Linzen

  • Computationally Efficient Active Learning for Large Imbalanced Datasets Pietro Lesci, Andreas Vlachos

  • ICLE++: Modeling Fine-Grained Traits for Holistic Essay Scoring Shengjie Li, Vincent Ng

  • UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations Wenting Zhao, Justin T Chiu, Jena D. Hwang, Faeze Brahman, Jack Hessel, Sanjiban Choudhury, Yejin Choi, Xiang Lorraine Li, Alane Suhr

  • To Tell The Truth: Language of Deception and Language Models Sanchaita Hazra, Bodhisattwa Prasad Majumder

  • Multilingual Models for ASR in Chibchan Languages Rolando Coto-Solano, Tai Wan Kim, Alexander Jones, Sharid Loáiciga

  • LegalDiscourse: Interpreting When Laws Apply and To Whom Alexander Spangher, Zihan Xue, Te-Lin Wu, Mark Hansen, Jonathan May

  • X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects Minqian Liu, Ying Shen, Zhiyang Xu, Yixin Cao, Eunah Cho, Vaibhav Kumar, Reza Ghanadan, Lifu Huang

  • Is Reference Necessary in the Evaluation of NLG Systems? When and Where? Shuqian Sheng, Yi Xu, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xinbing Wang, Chenghu Zhou

  • Semi-Structured Chain-of-Thought: Integrating Multiple Sources of Knowledge for Improved Language Model Reasoning Xin Su, Tiep Le, Steven Bethard, Phillip Howard

  • Evaluating the Deductive Competence of Large Language Models S M Seals, Valerie Shalin

  • Large Human Language Models: A Need and the Challenges Nikita Soni, H. Schwartz, João Sedoc, Niranjan Balasubramanian

  • On Learning to Summarize with Large Language Models as References Yixin Liu, Kejian Shi, Katherine S He, Longtian Ye, Alexander Fabbri, Pengfei Liu, Dragomir Radev, Arman Cohan

  • Hallucination Diversity-Aware Active Learning for Text Summarization Yu Xia, Xu Liu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Anup Rao, Tung Mai, Shuai Li

  • {K}eep it {P}rivate: Unsupervised Privatization of Online Text Calvin Bao, Marine Carpuat

  • Tied-LoRA: Enhancing parameter efficiency of LoRA with Weight Tying Adithya Renduchintala, Tugrul Konuk, Oleksii Kuchaiev

  • Investigating Data Contamination in Modern Benchmarks for Large Language Models Chunyuan Deng, Yilun Zhao, Xiangru Tang, Mark Gerstein, Arman Cohan

  • Pre-trained Language Models for Entity Blocking: A Reproducibility Study Runhui Wang, Yongfeng Zhang

  • $RE^2$: Region-Aware Relation Extraction from Visually Rich Documents Pritika Ramu, Sijia Wang, Lalla Mouatadid, Joy Rimchala, Lifu Huang

  • Mix-Initiative Response Generation with Dynamic Prefix Tuning Yuxiang Nie, Heyan Huang, Xian-Ling Mao, Lizi Liao

  • Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Value Jing Yao, Xiaoyuan Yi, Yifan Gong, Xiting Wang, Xing Xie

  • IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context Nihar Ranjan Sahoo, Pranamya Prashant Kulkarni, Arif Ahmad, Tanu Goyal, Narjis Asad, Aparna Garimella, Pushpak Bhattacharyya

Main Conference - Short Papers

  • Revisiting Zero-Shot Abstractive Summarization in the Era of Large Language Models from the Perspective of Position Bias Anshuman Chhabra, Hadi Askari, Prasant Mohapatra

  • Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data? Xiangru Tang, Yiming Zong, Jason Phang, Yilun Zhao, Wangchunshu Zhou, Arman Cohan, Mark Gerstein

  • Improving Toponym Resolution by Predicting Attributes to Constrain Geographical Ontology Entries Zeyu Zhang, Egoitz Laparra, Steven Bethard

  • Advancing Regular Language Reasoning in Linear Recurrent Neural Networks Ting-Han Fan, Ta-Chung Chi, Alexander Rudnicky

  • Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers Roy Xie, Orevaoghene Ahia, Yulia Tsvetkov, Antonios Anastasopoulos

  • Clear Up Confusion: Advancing Cross-Domain Few-Shot Relation Extraction through Relation-Aware Prompt Learning Ge Bai, Chenji Lu, Daichi Guo, Shilong Li, Ying Liu, Zhang Zhang, Guanting Dong, Ruifang Liu, Sun Yong

  • Fusion Makes Perfection: An Efficient Multi-Grained Matching Approach for Zero-Shot Relation Extraction Shilong Li, Ge Bai, Zhang Zhang, Ying Liu, Chenji Lu, Daichi Guo, Ruifang Liu, Sun Yong

  • Personalized Review Recommendation based on Implicit dimension mining Bei Xu, Yifan xu

  • Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

  • Returning to the Start: Generating Narratives with Related Endpoints Anneliese Brei, Chao Zhao, Snigdha Chaturvedi

  • Unified Examination of Entity Linking in Absence of Candidate Sets Nicolas Ong, Hassan Shavarani, Anoop Sarkar

  • MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages Daryna Dementieva, Nikolay Babakov, Alexander Panchenko

  • SKICSE: Sentence Knowable Information Prompted by LLMs Improves Contrastive Sentence Embeddings Fangwei Ou, Jinan Xu

  • A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models Jaylen Jones, Lingbo Mo, Eric Fosler-Lussier, Huan Sun

  • How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes Harmon Bhasin, Timothy Ossowski, Yiqiao Zhong, Junjie Hu

  • CELI: Simple yet Effective Approach to Enhance Out-of-Domain Generalization of Cross-Encoders. Xinyu Zhang, Minghan Li, Jimmy Lin

  • ContrastiveMix: Overcoming Code-Mixing Dilemma in Cross-Lingual Transfer for Information Retrieval Junggeun Do, Jaeseong Lee, seung-won hwang

  • SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window Vikas Raunak, Tom Kocmi, Matt Post

  • Separately Parameterizing Singleton Detection Improves End-to-end Neural Coreference Resolution Xiyuan Zou, Yiran Li, Ian Porada, Jackie CK Cheung

  • Unveiling Divergent Inductive Biases of LLMs on Temporal Data Sindhu Kishore, Hangfeng He

  • On Retrieval Augmentation and the Limitations of Language Model Training Ting-Rui Chiang, Xinyan Velocity Yu, Joshua Robinson, Ollie Liu, Isabelle Lee, Dani Yogatama

  • GenDecider: Integrating “None of the Candidates” Judgments in Zero-Shot Entity Linking Re-ranking Kang Zhou, Yuepei Li, Qing Wang, Qiao Qiao, Qi Li

  • Advancing the Robustness of Large Language Models through Self-Denoised Smoothing Jiabao Ji, Bairu Hou, Zhen Zhang, Guanhua Zhang, Wenqi Fan, Qing Li, Yang Zhang, Gaowen Liu, Sijia Liu, Shiyu Chang

  • Can LLM’s Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis Vishnu Sashank Dorbala, Sanjoy Chowdhury, Dinesh Manocha

  • On the Role of Summary Content Units in Text Summarization Evaluation Marcel Nawrath, Agnieszka Wiktoria Nowak, Tristan Ratz, Danilo Constantin Walenta, Juri Opitz, Leonardo F. R. Ribeiro, João Sedoc, Daniel Deutsch, Simon Mille, Yixin Liu, Sebastian Gehrmann, Lining Zhang, Saad Mahamood, Miruna Clinciu, Khyathi Chandu, Yufang Hou

  • More room for language: Investigating the effect of retrieval on language models David Samuel, Lucas Georges Gabriel Charpentier, Sondre Wold

  • Discourse-Aware In-Context Learning for Temporal Expression Normalization Akash Kumar Gautam, Lukas Lange, Jannik Strötgen

  • Contextualizing Argument Quality Assessment with Relevant Knowledge Darshan Girish Deshpande, Zhivar Sourati, Filip Ilievski, Fred Morstatter

  • Selective Perception: Learning Concise State Descriptions for Language Model Actors Kolby Nottingham, Yasaman Razeghi, Kyungmin Kim, JB Lanier, Pierre Baldi, Roy Fox, Sameer Singh

  • ALOHa: A New Measure for Hallucination in Captioning Models Suzanne Petryk, David Chan, Anish Kachinthaya, Haodi Zou, John Canny, Joseph E. Gonzalez, Trevor Darrell

  • Beyond Yes and No: Improving Zero-Shot LLM Rankers via Scoring Fine-Grained Relevance Labels Honglei Zhuang, Zhen Qin, Kai Hui, Junru Wu, Le Yan, Xuanhui Wang, Michael Bendersky

  • LLM-Driven Knowledge Injection Advances Zero-Shot and Cross-Target Stance Detection Zhao Zhang, Yiming Li, Jade Chang, Hui Xu

  • Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information Shadi Iskander, Kira Radinsky, Yonatan Belinkov

  • Direct Preference Optimization for Neural Machine Translation with Minimum Bayes Risk Decoding Guangyu Yang, Jinghong Chen, Weizhe Lin, Bill Byrne

  • EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning Raja Sekhar Reddy Mekala, Yasaman Razeghi, Sameer Singh

  • LEAF: Language Learners’ English Essays and Feedback Corpus Shabnam Behzad, Omid Kashefi, Swapna Somasundaran

  • Zero-Shot vs. Translation-Based Cross-Lingual Transfer: The Case of Lexical Gaps Abteen Ebrahimi, Katharina von der Wense

  • On the True Distribution Approximation of Minimum Bayes-Risk Decoding Atsumoto Ohashi, Ukyo Honda, Tetsuro Morimura, Yuu Jinnai

  • Rehearsal-Free Modular and Compositional Continual Learning for Language Models Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, Hinrich Schuetze

  • Llama meets EU: Investigating the European political spectrum through the lens of LLMs Ilias Chalkidis, Stephanie Brandl

  • M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation Benjamin Hsu, Xiaoyu Liu, Huayang Li, Yoshinari Fujinuma, Maria Nadejde, Xing Niu, Ron Litman, Yair Kittenplon, Raghavendra Pappagari

  • Control-DAG: Constrained Decoding for Non-Autoregressive Directed Acyclic T5 using Weighted Finite State Automata Jinghong Chen, Weizhe Lin, Jingbiao Mei, Bill Byrne

  • Do Vision-Language Models Understand Compound Nouns? Sonal Kumar, Sreyan Ghosh, S Sakshi, Utkarsh Tyagi, Dinesh Manocha

  • Is Prompt Transfer Always Effective? An Empirical Study of Prompt Transfer for Question Answering Minji Jung, Soyeon Park, Jeewoo Sul, Yong Suk Choi

  • Lost in Space: Probing Fine-grained Spatial Understanding in Vision and Language Resamplers Georgios Pantazopoulos, Alessandro Suglia, Oliver Lemon, Arash Eshghi

  • Do Multilingual Language Models Think Better in English? Julen Etxaniz, Gorka Azkune, Aitor Soroa, Oier Lopez de Lacalle, Mikel Artetxe

  • A Continued Pretrained LLM Approach for Automatic Medical Note Generation Dong Yuan, Eti Rastogi, Gautam Naik, Sree Prasanna Rajagopal, Sagar Goyal, Fen Zhao, Bharath Chintagunta, Jeffrey M. Ward

  • Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts Michael Saxon, Yiran Lawrence Luo, Sharon Levy, Chitta Baral, Yezhou Yang, William Yang Wang

  • Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models Tingyu Xie, Qi Li, Yan Zhang, Zuozhu Liu, Hongwei Wang

  • Lifelong Event Detection with Embedding Space Separation and Compaction Chengwei Qin, Ruirui Chen, Ruochen Zhao, Wenhan Xia, Shafiq Joty

  • Language Models (Mostly) Do Not Consider Emotion Triggers When Predicting Emotion Smriti Singh, Cornelia Caragea, Junyi Jessy Li

  • CPopQA: Ranking Cultural Concept Popularity by LLMs Ming Jiang, Mansi Joshi

  • The Impact of Language on Arithmetic Proficiency: A Multilingual Investigation with Cross-Agent Checking Computation Chung-Chi Chen, Hiroya Takamura, Ichiro Kobayashi, Yusuke Miyao

  • Efficient Information Extraction in Few-Shot Relation Classification through Contrastive Representation Learning Philipp Borchert, Jochen De Weerdt, Marie-Francine Moens

  • A diverse Multilingual News Headlines Dataset from around the World Felix Leeb, Bernhard Schölkopf

  • The Unreasonable Effectiveness of Random Target Embeddings for Continuous-Output Neural Machine Translation Evgeniia Tokarchuk, Vlad Niculae

  • Efficient Sample-Specific Encoder Perturbations Yassir Fathullah, Mark Gales

  • Diverse Perspectives, Divergent Models: Cross-Cultural Evaluation of Depression Detection on Twitter Nuredin Ali Abdelkadir, Charles Chuankai Zhang, Ned Mayo, Stevie Chancellor

  • Removing RLHF Protections in GPT-4 via Fine-Tuning Qiusi Zhan, Richard Fang, Rohan Bindu, Akul Gupta, Tatsunori Hashimoto, Daniel Kang

  • $\texttt{LifeTox}$: Unveiling Implicit Toxicity in $\texttt{Life}$ Advice Minbeom Kim, Jahyun Koo, Hwanhee Lee, Joonsuk Park, Hwaran Lee, Kyomin Jung

  • Arithmetic Reasoning with LLM: Prolog Generation & Permutation Xiaocheng Yang, Bingsen Chen, Yik-Cheung Tam

  • Verifying Claims About Metaphors with Large-Scale Automatic Metaphor Identification Kotaro Aono, Ryohei Sasano, Koichi Takeda

  • InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra, Chitta Baral

  • MEMORY-VQ: Compression for Tractable Internet-Scale Memory Yury Zemlyanskiy, Michiel de Jong, Luke Vilnis, Santiago Ontanon, William W. Cohen, Sumit Sanghai, Joshua Ainslie

  • Unveiling the Magic: Investigating Attention Distillation in Retrieval-Augmented Generation Zizhong Li, Haopeng Zhang, Jiawei Zhang

  • Improving Factuality in Clinical Abstractive Multi-Document Summarization by Guided Continued Pre-training Ahmed Salem Mohamed Elhady Mohamed, Khaled Mostafa Elsayed, Eneko Agirre, Mikel Artetxe

  • MuLan: A Study of Fact Mutability in Language Models Constanza Fierro, Nicolas Garneau, Emanuele Bugliarello, Yova Kementchedjhieva, Anders Søgaard

  • Language-Independent Representations Improve Zero-Shot Summarization Vladimir Solovyev, Danni Liu, Jan Niehues

  • Trusting Your Evidence: Hallucinate Less with Context-aware Decoding Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, Wen-tau Yih

  • GuyLingo: The Republic of Guyana Creole Corpora Christopher Clarke, Roland Daynauth, Jason Mars, Charlene Wilkinson, Hubert Devonish

  • DoubleLingo: Causal Estimation with Large Language Models Marko Veljanovski, Zach Wood-Doughty

  • Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification Michail Mitsios, Georgios Vamvoukakis, Georgia Maniati, Nikolaos Ellinas, Georgios Dimitriou, Konstantinos Markopoulos, Panos Kakoulidis, Alexandra Vioni, Myrsini Christidou, Junkwang Oh, Gunu Jho, Inchul Hwang, Georgios Vardaxoglou, Aimilios Chalamandaris, Pirros Tsiakoulis, Spyros Raptis

  • On Narrative Question Answering Skills Emil Kalbaliyev, Kairit Sirts

  • Order-Based Pre-training Strategies for Procedural Text Understanding Abhilash Nandy, Yash Kulkarni, Pawan Goyal, Niloy Ganguly

  • Breaking the Language Barrier: Can Direct Inference Outperform Pre-Translation in Multilingual LLM Applications? Yotam Intrator, Matan Halfon, Roman Goldenberg, Reut Tsarfaty, Matan Eyal, Ehud Rivlin, Yossi Matias, Natalia Aizenberg

Findings Papers

  • Structured Pruning for Large Language Models Using Coupled Components Elimination and Minor Fine-tuning Honghe Zhang, XiaolongShi, Jingwei Sun, Guangzhong Sun

  • Weight-Inherited Distillation for Task-Agnostic BERT Compression Taiqiang Wu, Cheng Hou, Shanshan Lao, Jiayi Li, Ngai Wong, Zhe Zhao, Yujiu Yang

  • Ignore Me But Don’t Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain Eugene Jang, Jian Cui, Dayeon Yim, Youngjin Jin, Jin-Woo Chung, Seungwon Shin, Yongjae Lee

  • Extremely efficient online query encoding for dense retrieval Nachshon Cohen, Yaron Fairstein, Guy Kushilevitz

  • DIVKNOWQA: Assessing the Reasoning Ability of LLMs via Open-Domain Question Answering over Knowledge Base and Text Wenting Zhao, Ye Liu, Tong Niu, Yao Wan, Philip S. Yu, Shafiq Joty, Yingbo Zhou, Semih Yavuz

  • SpeedE: Euclidean Geometric Knowledge Graph Embedding Strikes Back Aleksandar Pavlović, Emanuel Sallinger

  • Language Guided Exploration for RL Agents in Text Environments Hitesh Golchha, Sahil Yerawar, Dhruvesh Patel, Soham Dan, Keerthiram Murugesan

  • GPT-who: An Information Density-based Machine-Generated Text Detector Saranya Venkatraman, Adaku Uchendu, Dongwon Lee

  • DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models Peng Tang, Pengkai Zhu, Tian Li, Srikar Appalaraju, Vijay Mahadevan, R. Manmatha

  • Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation Ta-Chung Chi, Ting-Han Fan, Alexander Rudnicky

  • Automatic Pair Construction for Contrastive Post-training Canwen Xu, Corby Rosset, Ethan C. Chau, Luciano Del Corro, Shweti Mahajan, Julian McAuley, Jennifer Neville, Ahmed Hassan Awadallah, Nikhil Rao

  • Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models Miaoran Li, Baolin Peng, Michel Galley, Jianfeng Gao, Zhu Zhang

  • Low-resource neural machine translation with morphological modeling Antoine Nzeyimana

  • Self-Cleaning: Improving a Named Entity Recognizer Trained on Noisy Data with a Few Clean Instances Zhendong Chu, Ruiyi Zhang, Tong Yu, Rajiv Jain, Vlad I Morariu, Jiuxiang Gu, Ani Nenkova

  • VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding Phong Nguyen-Thuan Do, Son Quoc Tran, Phu Gia Hoang, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

  • LETI: Learning to Generate from Textual Interactions Xingyao Wang, Hao Peng, Reyhaneh Jabbarvand, Heng Ji

  • Bilateral Masking with prompt for Knowledge Graph Completion yonghui kong, Cunhang Fan, Yujie Chen, Shuai Zhang, Zhao Lv, Jianhua Tao

  • MiLe Loss: a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models Zhenpeng Su, Zijia Lin, Baixue, Hui Chen, Songlin Hu, Wei Zhou, Guiguang Ding, Xing W

  • GOLD: Geometry Problem Solver with Natural Language Description Jiaxin Zhang, Yashar Moshfeghi

  • RoDia: A New Dataset for Romanian Dialect Identification from Speech Rotaru Codruț, Nicolae Catalin Ristea, Radu Tudor Ionescu

  • Examining Modularity in Multilingual LMs via Language-Specialized Subnetworks Rochelle Choenni, Ekaterina Shutova, Dan Garrette

  • Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning Yinger Zhang, Hui Cai, Xierui SONG, Yicheng Chen, Rui Sun, Jing Zheng

  • Incorporating Exponential Smoothing into MLP: a Simple but Effective Sequence Model JiqunChu, Zuoquan Lin

  • OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models Yuxuan Kuang, Hai Lin, Meng Jiang

  • Comparing Two Model Designs for Clinical Note Generation; Is an LLM a Useful Evaluator of Consistency? Nathan Brake, Thomas Schaaf

  • VOLTA: Improving Generative Diversity by Variational Mutual Information Maximizing Autoencoder Yueen Ma, DaFeng Chi, Jingjing Li, Kai Song, Yuzheng Zhuang, Irwin King

  • EcoSpeak: Cost-Efficient Bias Mitigation for Partially Cross-Lingual Speaker Verification Divya V Sharma

  • Leveraging Contextual Information for Effective Entity Salience Detection Rajarshi Bhowmik, Marco Ponza, Atharva Tendle, Anant Gupta, Rebecca Jiang, Xingyu Lu, Qian Zhao, Daniel Preotiuc-Pietro

  • LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected? Qihui Zhang, Chujie Gao, Dongping Chen, Yue Huang, Yixin Huang, Zhenyang Sun, Shilin Zhang, Weiye Li, Zhengyan Fu, Yao Wan, Lichao Sun

  • A (More) Realistic Evaluation Setup for Generalisation of Community Models on Malicious Content Detection Ivo Verhoeven, Pushkar Mishra, Rahel Beloch, Helen Yannakoudakis, Ekaterina Shutova

  • Citation: A Key to Building Responsible and Accountable Large Language Models Jie Huang, Kevin Chang

  • Graph-Induced Syntactic-Semantic Spaces in Transformer-Based Variational AutoEncoders Yingji Zhang, Marco Valentino, Danilo Carvalho, Ian Pratt-Hartmann, Andre Freitas

  • Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles Weiting Tan, Haoran Xu, Lingfeng Shen, Shuyue Stella Li, Kenton Murray, Philipp Koehn, Benjamin Van Durme, Yunmo Chen

  • Which Modality should I use - Text, Motif, or Image? : Understanding Graphs with Large Language Models Debarati Das, Ishaan Gupta, Jaideep Srivastava, Dongyeop Kang

  • On-the-Fly Fusion of Large Language Models and Machine Translation Hieu Hoang, Huda Khayrallah, Marcin Junczys-Dowmunt

  • READ: Improving Relation Extraction from an ADversarial Perspective Dawei Li, William P Hogan, Jingbo Shang

  • REQUAL-LM: Reliability and Equity through Aggregation in Large Language Models Sana Ebrahimi, Nima Shahbazi, Abolfazl Asudeh

  • Addressing Both Statistical and Causal Gender Fairness in NLP Models Hannah Chen, Yangfeng Ji, David Evans

  • LLM-Rec: Personalized Recommendation via Prompting Large Language Models Hanjia Lyu, Song Jiang, Hanqing Zeng, Yinglong Xia, Qifan Wang, Si Zhang, Ren Chen, Chris Leung, Jiajie Tang, Jiebo Luo

  • A Robust Semantics-based Watermark for Large Language Model against Paraphrasing Jie Ren, Han Xu, Yiding Liu, Yingqian Cui, Shuaiqiang Wang, Dawei Yin, Jiliang Tang

  • Solving Data-centric Tasks using Large Language Models Shraddha Barke, Christian Poelitz, Carina Suzana Negreanu, Benjamin Zorn, José Cambronero, Andrew D. Gordon, Vu Le, Elnaz Nouri, Nadia Polikarpova, Advait Sarkar, Brian Slininger, Neil Toronto, Jack Williams

  • A Novel Paradigm Boosting Translation Capabilities of Large Language Models Jiaxin GUO, Hao Yang, Zongyao Li, Daimeng Wei, Hengchao Shang, Xiaoyu Chen

  • Measuring Social Norms of Large Language Models Ye Yuan, Kexin Tang, Jianhao Shen, Ming Zhang, Chenguang Wang

  • Source-Free Unsupervised Domain Adaptation for Question Answering via Prompt-Assisted Self-learning Maxwell Juncheng Yin, Boyu Wang, Charles Ling

  • Hierarchical Attention Graph for Scientific Document Summarization in Global and Local Level Chenlong Zhao, Xiwen Zhou, Xiaopeng Xie, Yong Zhang

  • LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systems Nalin Kumar, Ondrej Dusek

  • Efficient Dependency Tree Sampling Without Replacement Bogdan Mihai Dobre

  • Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization Zixuan Zhang, Revanth Gangi Reddy, Kevin Small, Tong Zhang, Heng Ji

  • GEE! Grammar Error Explanation with Large Language Models Yixiao Song, Kalpesh Krishna, Rajesh Bhatt, Kevin Gimpel, Mohit Iyyer

  • AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback Wanpeng Zhang, Zongqing Lu

  • DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations Weihao Zeng, Dayuan Fu, Keqing He, Yejie Wang, Yukai Xu, Weiran Xu

  • Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training Pavel Denisov, Thang Vu

  • CLEAN–EVAL: Clean Evaluation on Contaminated Large Language Models Wenhong Zhu, Hongkun Hao, Zhiwei He, Yun-Ze Song, Jiao Yueyang, Yumeng Zhang, Hanxu Hu, YIRAN WEI, Rui Wang, Hongyuan Lu

  • R-BASS : Relevance-aided Block-wise Adaptation for Speech Summarization Roshan Sharma, Ruchira Sharma, Hira Dhamyal, Rita Singh, Bhiksha Raj

  • OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning Fei Yu, Anningzhe Gao, Benyou Wang

  • The Whole is Better than the Sum: Using Aggregated Demonstrations in In-Context Learning for Sequential Recommendation Lei Wang, Ee-Peng Lim

  • Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA Dhruv Agarwal, Rajarshi Das, Sopan Khosla, Rashmi Gangadharaiah

  • GraSAME: Injecting Token-Level Structural Information to Pretrained Language Models via Graph-guided Self-Attention Mechanism Shuzhou Yuan, Michael Färber

  • Can Public Large Language Models Help Private Cross-device Federated Learning? Boxin Wang, Yibo Jacky Zhang, Yuan Cao, Bo Li, Hugh Brendan McMahan, Sewoong Oh, Zheng Xu, Manzil Zaheer

  • LangNav: Language as a Perceptual Representation for Navigation Bowen Pan, Rameswar Panda, SouYoung Jin, Rogerio Feris, Aude Oliva, Phillip Isola, Yoon Kim

  • Planning and Editing What You Retrieve for Enhanced Tool Learning Tenghao Huang, Dongwon Jung, Vaibhav Kumar, Mohammad Kachuee, Xiang Li, Puyang Xu, Muhao Chen

  • Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs Victor Carbune, Hassan Mansoor, Fangyu Liu, Rahul Aralikatte, Gilles Baechler, Jindong Chen, Abhanshu Sharma

  • SLiM: Speculative Decoding with Hypothesis Reduction Chi-Heng Lin, Shikhar Tuli, James Seale Smith, Yen-Chang Hsu, Yilin Shen, Hongxia Jin

  • REMATCH: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity Zoher Kachwala, Jisun An, Haewoon Kwak, Filippo Menczer

  • Modeling the Sacred: Considerations when Using Religious Texts in Natural Language Processing Ben Hutchinson

  • Testing the Effect of Code Documentation on Large Language Model Code Understanding William Macke, Michael Dean Doyle

  • Aligning Large Language Models with Recommendation Knowledge Yuwei Cao, Nikhil Mehta, Xinyang Yi, Raghunandan Hulikal Keshavan, Lukasz Heldt, Lichan Hong, Ed H. Chi, Maheswaran Sathiamoorthy

  • OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining Yihong Liu, Peiqin Lin, Mingyang Wang, Hinrich Schuetze

  • SELF-EXPERTISE: Knowledge-based Instruction Dataset Augmentation for a Legal Expert Language Model Minju Kim, Haein Jung, Myoung-Wan Koo

  • Re-evaluating the Need for Visual Signals in Unsupervised Grammar Induction Boyi Li, Rodolfo Corona, Karttikeya Mangalam, Catherine Chen, Daniel Flaherty, Serge Belongie, Kilian Q Weinberger, Jitendra Malik, Trevor Darrell, Dan Klein

  • EDEntail: An Entailment-based Few-shot Text Classification with Extensional Definition ZIXIAO ZHU, Junlang Qian, Zijian Feng, Hanzhang Zhou, Kezhi Mao

  • What Makes Math Word Problems Challenging for LLMs? KV Aditya Srivatsa, Ekaterina Kochmar

  • SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models Lee Hyun, Kim Sung-Bin, Seungju Han, Youngjae Yu, Tae-Hyun Oh

  • T3M: Text Guided 3D Human Motion Synthesis from Speech Wenshuo Peng, Kaipeng Zhang, Sai Qian Zhang

  • Deja vu: Contrastive Historical Modeling with Prefix-tuning for Temporal Knowledge Graph Reasoning Miao Peng, Ben Liu, Wenjie Xu, Zihao Jiang, Jiahui Zhu, Min Peng

  • Explanation Extraction from Hierarchical Classification Frameworks for Long Legal Documents Nishchal Prasad, Taoufiq Dkaki, Mohand Boughanem

  • Low-Rank Adaptation for Multilingual Summarization: An Empirical Study Chenxi Whitehouse, Fantine Huot, Jasmijn Bastings, Mostafa Dehghani, Chu-Cheng Lin, Mirella Lapata

  • Empowering Multi-step Reasoning across Languages via Tree-of-Thoughts Leonardo Ranaldi, Giulia Pucci, Federico Ranaldi, Elena Sofia Ruzzetti, Fabio Massimo Zanzotto

  • Emergent Abilities in Reduced-Scale Generative Language Models Sherin Muckatira, Vijeta Deshpande, Vladislav Lialin, Anna Rumshisky

  • Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems Clemencia Siro, Mohammad Aliannejadi, Maarten de Rijke

  • Matching Varying-Length Texts via Topic-Informed and Decoupled Sentence Embeddings Xixi Zhou, Chunbin Gu, Xin Jie, Jiajun Bu, Haishuai Wang

  • Instruction Tuning with Human Curriculum Bruce W Lee, Hyunsoo Cho, Kang Min Yoo

  • Natural Language-based State Representation in Deep Reinforcement Learning Md Masudur Rahman, Yexiang Xue

  • Learning Cross-Architecture Instruction Embeddings for Binary Code Analysis in Low-Resource Architectures Junzhe Wang, Qiang Zeng, Lannan Luo

  • ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks Xiaodong Yu, Hao Cheng, Xiaodong Liu, Dan Roth, Jianfeng Gao

  • An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution Tien-Hong Lo, Fu-An Chao, TZU-I WU, Yao-Ting Sung, Berlin Chen

  • GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond Shen Zheng, Yuyu Zhang, Yijie Zhu, Chenguang Xi, Pengyang Gao, zhou Xun, Kevin Chang

  • Subword Attention and Post-Processing for Rare and Unknown Contextualized Embeddings Raj Patel, Carlotta Domeniconi

  • UGIF-DataSet: A New Dataset for Cross-lingual, Cross-modal Sequential actions on the UI Sagar Gubbi Venkatesh, Partha Talukdar, Srini Narayanan

  • SimSCOOD: Systematic Analysis of Out-of-Distribution Generalization in Fine-tuned Source Code Models Hossein Hajipour, Ning Yu, Cristian-Alexandru Staicu, Mario Fritz

  • Pruning as a Domain-specific LLM Extractor Nan Zhang, Yanchi Liu, Xujiang Zhao, Wei Cheng, Runxue Bao, Rui Zhang, Prasenjit Mitra, Haifeng Chen

  • LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback Wenda Xu, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Biao Zhang, Zhongtao Liu, William Yang Wang, Lei Li, Markus Freitag

  • Noisy Multi-Label Text Classification via Instance-Label Pair Correction Pengyu Xu, Mingyang Song, Linkaida Liu, Bing Liu, Hongjian Sun, Liping Jing, Jian Yu

  • Composite Backdoor Attacks Against Large Language Models Hai Huang, Zhengyu Zhao, Michael Backes, Yun Shen, Yang Zhang

  • Adapting Fake News Detection to the Era of Large Language Models Jinyan Su, Claire Cardie, Preslav Nakov

  • MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval Youbo Lei, Feifei He, Chen Chen, Yingbin Mo, Si Jia Li, Defeng Xie, Haonan Lu

  • Large Language Models are Effective Text Rankers with Pairwise Ranking Prompting Zhen Qin, Rolf Jagerman, Kai Hui, Honglei Zhuang, Junru Wu, Le Yan, Jiaming Shen, Tianqi Liu, Jialu Liu, Donald Metzler, Xuanhui Wang, Michael Bendersky

  • FedLFC: Towards Efficient Federated Multilingual Modeling with LoRA-based Language Family Clustering Zhihan Guo, Yifei Zhang, Zhuo Zhang, Zenglin Xu, Irwin King

  • Gaussian Process Optimization for Adaptable Multi-Objective Text Generation using Linearly-Weighted Language Models Mohammad Mahdi Abdollah Pour, Ali Pesaranghader, Eldan Cohen, Scott Sanner

  • Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study Alessandro Stolfo

  • TagDebias: Entity and Concept Tagging for Social Bias Mitigation in Pretrained Language Models Mehrnaz Moslemi, Amal Zouaq

  • Improving Absent Keyphrase Generation with Diversity Heads Edwin Thomas, Sowmya Vajjala

  • mOthello: When Do Cross-Lingual Representation Alignment and Cross-Lingual Transfer Emerge in Multilingual Models? Tianze Hua, Tian Yun, Ellie Pavlick

  • Discovering and Mitigating Indirect Bias in Attention-Based Model Explanations Farsheed Haque, Depeng Xu, Shuhan Yuan

  • i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data Ziyi Yang, MAHMOUD KHADEMI, Yichong Xu, Reid Pryzant, Yuwei Fang, Chenguang Zhu, Dongdong Chen, Yao Qian, Xuemei Gao, Yi-Ling Chen, Robert Gmyr, Naoyuki Kanda, Noel C Codella, Bin Xiao, Yu Shi, Lu Yuan, Takuya Yoshioka, Michael Zeng, Xuedong Huang

  • Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text Generation Yifu QIU, Varun R. Embar, Shay B Cohen, Benjamin Han

  • It’s All Relative! – A Synthetic Query Generation Approach for Improving Zero-Shot Relevance Prediction Aditi Chaudhary, Karthik Raman, Michael Bendersky

  • RS-DPO: A Hybrid Rejection Sampling and Direct Preference Optimization Method for Alignment of Large Language Models Saeed Khaki, JinJin Li, Lan Ma, Liu Yang, Prathap Ramachandra

  • Hypernetwork-Assisted Parameter-Efficient Fine-Tuning with Meta-Knowledge Distillation for Domain Knowledge Disentanglement Changqun Li, Linlin Wang, Xin Alex Lin, Shizhou Huang, Liang He

  • MICo: Preventative Detoxification of Large Language Models through Inhibition Control Roy Siegelmann, Ninareh Mehrabi, Palash Goyal, Prasoon Goyal, Lisa Bauer, Jwala Dhamala, Aram Galstyan, Rahul Gupta, Reza Ghanadan

  • Reinforcement Learning with Token-level Feedback for Controllable Text Generation Wendi Li, Wei Wei, Kaihe xu, Wenfeng xie, Dangyang Chen, Yu Cheng

  • CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving Pei Chen, Shuai Zhang, Boran Han

  • Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies Anaelia Ovalle, Ninareh Mehrabi, Palash Goyal, Jwala Dhamala, Kai-Wei Chang, Richard Zemel, Aram Galstyan, Yuval Pinter, Rahul Gupta

  • AdaPT: A Set of Guidelines for Hyperbolic Multimodal Multilingual NLP Ramit Sawhney, Shrey Pandit, Vishwa Shah, Megh Thakkar, Shafiq Joty

  • More Samples or More Prompts? Exploring Effective Few-Shot In-Context Learning for LLMs with In-Context Sampling Bingsheng Yao, Guiming Hardy Chen, Ruishi Zou, Yuxuan Lu, Jiachen Li, Shao Zhang, Yisi Sang, Sijia Liu, James Hendler, Dakuo Wang

  • ZSEE: A Dataset based on Zeolite Synthesis Event Extraction for Automated Synthesis Platform Song He, Xin Peng, Yihan Cai, Xin Li, Zhiqing Yuan, WenLi Du, Weimin Yang

  • Mitigating Hallucination in Abstractive Summarization with Domain-Conditional Mutual Information Kyubyung Chae, Jaepill choi, Yohan Jo, Taesup Kim

  • Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents San Kim, Gary Lee

  • Prompt Space Optimizing Few-shot Reasoning Success with Large Language Models Fobo Shi, Peijun Qing, Dong Yang, Nan Wang, Youbo Lei, Haonan Lu, Xiaodong Lin, Duantengchuan Li

  • DAGCN: Distance-based and Aspect-oriented Graph Convolutional Network for Aspect-based Sentiment Analysis Zhihao Wang, Bo Zhang, Ru Yang, Chang Guo, Maozhen Li

  • Connecting the Dots: Inferring Patent Phrase Similarity with Retrieved Phrase Graphs Zhuoyi PENG, Yi Yang

  • Self-Regulated Sample Diversity in Large Language Models Mingyue Liu, Jonathan Frawley, Sarah Wyer, Hubert P. H. Shum, Sara L. Uckelman, Sue Black, Chris G. Willcocks

  • Methods, Applications, and Directions of Learning-to-Rank in NLP Research Justin Lee, Gabriel Bernier-Colborne, Tegan Maharaj, Sowmya Vajjala

  • When Quantization Affects Confidence of Large Language Models? Irina Proskurina, Luc Brun, Guillaume Metzler, Julien Velcin

  • MedCycle: Unpaired Medical Report Generation via Cycle-Consistency Elad Hirsch, Gefen Dawidowicz, Ayellet Tal

  • Beta-LR: Interpretable Logical Reasoning based on Beta Distribution Yizhuo Ma, Ke Qin, Shuang Liang

  • Applications of BERT Models Towards Automation of Clinical Coding in Icelandic Haraldur Orri Hauksson, Hafsteinn Einarsson

  • Tell me who you are and I tell you how you argue: Predicting Stances and Arguments for Stakeholder Groups Philipp Heinisch, Lorik Dumani, Philipp Cimiano, Ralf Schenkel

  • Psychometric Predictive Power of Large Language Models Tatsuki Kuribayashi, Yohei Oseki, Timothy Baldwin

  • Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions Pouya Pezeshkpour, Estevam Hruschka

  • PEEB: Part-based Image Classifiers with an Explainable and Editable Language Bottleneck Thang M. Pham, Peijie Chen, Tin Nguyen, Seunghyun Yoon, Trung Bui, Anh Nguyen

  • Ethos: Rectifying Language Models in Orthogonal Parameter Space Lei Gao, Yue Niu, Tingting Tang, Salman Avestimehr, Murali Annavaram

  • Crafting In-context Examples according to LMs’ Parametric Knowledge Yoonsang Lee, Pranav Atreya, Xi Ye, Eunsol Choi

  • ICXML: An In-Context Learning Framework for Zero-Shot Extreme Multi-Label Classification Yaxin Zhu, Hamed Zamani

  • CLGSI: A Multimodal Sentiment Analysis Framework based on Contrastive Learning Guided by Sentiment Intensity Yang Yang, Xunde Dong, Yupeng Qiang

  • Interpreting Answers to Yes-No Questions in Dialogues from Multiple Domains Zijie Wang, Farzana Rashid, Eduardo Blanco

  • Enhancing Perception: Refining Explanations of News Claims with LLM Conversations Yi-Li Hsu, Jui-Ning Chen, Yang Fan Chiang, Shang-Chien Liu, Aiping Xiong, Lun-Wei Ku

  • How Interpretable are Reasoning Explanations from Prompting Large Language Models? Yeo Wei Jie, Ranjan Satapathy, Rick Siow Mong Goh, Erik Cambria

  • Plug-in Language Model: Controlling Text Generation with a Simple Regression Model Nai-Chi Yang, Wei-Yun Ma, Pu-Jen Cheng

  • Signer Diversity-driven Data Augmentation for Signer-Independent Sign Language Translation honghaofu, Liang Zhang, Biao Fu, Rui Zhao, Jinsong Su, Xiaodong Shi, Yidong Chen

  • A Systematic Analysis of Subwords and Cross-Lingual Transfer in Multilingual Translation Francois Meyer, Jan Buys

  • Multi-Granularity Guided Fusion-in-Decoder Eunseong Choi, Hyeri Lee, Jongwuk Lee

  • Group Fairness in Multilingual Speech Recognition Models Anna Katrine van Zee, Marc van Zee, Anders Søgaard

  • Rethinking Machine Ethics – Can LLMs Perform Moral Reasoning through the Lens of Moral Theories? Jingyan Zhou, Minda Hu, Junan Li, Xiaoying Zhang, Xixin Wu, Irwin King, Helen M. Meng

  • Role Prompting Guided Domain Adaptation with General Capability Preserve for Large Language Models Rui Wang, Fei Mi, Yi Chen, Boyang XUE, Hongru WANG, Qi Zhu, Kam-Fai Wong, Ruifeng Xu

  • BERTweet’s TACO Fiesta: Contrasting Flavors On The Path Of Inference And Information-Driven Argument Mining On Twitter Marc Feger, Stefan Dietze

  • Testing the limits of logical reasoning in neural and hybrid models Manuel Vargas Guzman, Jakub Szymanik, Maciej Malicki

  • METAL: Towards Multilingual Meta-Evaluation Rishav Hada, Varun Gumma, Mohamed Ahmed, Kalika Bali, Sunayana Sitaram

  • AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models Wanjun Zhong, Ruixiang Cui, Yiduo Guo, Yaobo Liang, Shuai Lu, Yanlin Wang, Amin Saied, Weizhu Chen, Nan Duan

  • Product Description and QA Assisted Self-Supervised Opinion Summarization Tejpalsingh Siledar, Rupasai Rangaraju, Sankara Sri Raghava Ravindra Muddu, Suman Banerjee, Amey Patil, Sudhanshu Shekhar Singh, Muthusamy Chelliah, Nikesh Garera, Swaprava Nath, Pushpak Bhattacharyya

  • COMEM: In-Context Retrieval-Augmented Mass-Editing Memory in Large Language Models Shanbao Qiao, Xuebing Liu, Seung-Hoon Na

  • Content-Specific Humorous Image Captioning Using Incongruity Resolution Chain-of-Thought Kohtaro Tanaka, Kohei Uehara, Lin Gu, YUSUKE Mukuta, Tatsuya Harada

  • Denoising Attention for Query-aware User Modeling Elias Bassani, Pranav Kasela, Gabriella Pasi

  • A Lightweight Mixture-of-Experts Neural Machine Translation Model With Stage-wise Training Strategy Fan Zhang, Mei Tu, Song Liu, Jinyao Yan

  • BEAR: A Unified Framework for Evaluating Relational Knowledge in Causal and Masked Language Models Jacek Wiland, Max Ploner, Alan Akbik

  • Conformal Intent Classification and Clarification for Fast and Accurate Intent Recognition Floris den Hengst, Ralf Wolter, Patrick Altmeyer, Arda Kaygan

  • Anonymity at Risk? Assessing Re-Identification Capabilities of Large Language Models in Court Decisions Alex Nyffenegger, Matthias Stürmer, Joel Niklaus

  • X-LLaVA: Optimizing Bilingual Large Vision-Language Alignment DongJae Shin, HyeonSeok Lim, Inho Won, ChangSu Choi, Minjun Kim, SeungWoo Song, HanGyeol Yoo, SangMin Kim, KyungTae Lim

  • Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models against Counterfactual Noise Giwon Hong, Jeonghwan Kim, Junmo Kang, Sung-Hyon Myaeng, Joyce Jiyoung Whang

  • Heterogeneity over Homogeneity: Investigating Multilingual Speech Pre-Trained Models for Detecting Audio Deepfake Orchid Chetia Phukan, Gautam Siddharth Kashyap, Arun Balaji Buduru, Rajesh Sharma

  • Identifying Self-Disclosures of Use, Misuse and Addiction in Community-based Social Media Posts Chenghao Yang, Tuhin Chakrabarty, Karli R Hochstatter, Melissa N Slavin, Nabila El-Bassel, Smaranda Muresan

  • Self-Adaptive Sampling for Accurate Video Question Answering on Image Text Models Wei Han, Hui Chen, Min-Yen Kan, Soujanya Poria

  • Towards an On-device Agent for Text Rewriting Yun Zhu, Yinxiao Liu, Felix Stahlberg, Shankar Kumar, Yu-Hui Chen, Liangchen Luo, Lei Shu, Renjie Liu, Jindong Chen, Lei Meng

  • Tailoring Vaccine Messaging with Common-Ground Opinions Rickard Stureborg, Sanxing Chen, Roy Xie, Aayushi Kunjal Patel, Christopher Li, Chloe Zhu, Tingnan Hu, Jun Yang, Bhuwan Dhingra

  • Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification Robert Vacareanu, Fahmida Alam, Md Asiful Islam, Haris Riaz, Mihai Surdeanu

  • Q-Tuning: Queue-based Prompt Tuning for Lifelong Few-shot Language Learning Yanhui Guo, Shaoyuan Xu, Jinmiao Fu, Jia Liu, Chaosheng Dong, Bryan Wang

  • In-Context Example Ordering Guided by Label Distributions Zhichao Xu, Daniel Cohen, Bei Wang, Vivek Srikumar

  • Beyond Surface Similarity: Detecting Subtle Semantic Shifts in Financial Narratives Jiaxin Liu, Yi Yang, KAR YAN TAM

  • Laying Anchors: Semantically Priming Numerals in Language Modeling Mandar Sharma, Rutuja Taware, Pravesh Koirala, Nikhil Muralidhar, Naren Ramakrishnan

  • UEGP: Unified Expert-Guided Pre-training for Knowledge Rekindle Yutao Mou, Kexiang Wang, Jianhe Lin, Dehong Ma, Jun Fan, Daiting Shi, Zhicong Cheng, Gu Simiu, Dawei Yin, Weiran Xu

  • LatticeGen: Hiding Generated Text in a Lattice for Privacy-Aware Large Language Model Generation on Cloud Mengke Zhang, Tianxing He, Tianle Wang, Lu Mi, Niloofar Mireshghallah, Binyi Chen, Hao Wang, Yulia Tsvetkov

  • HateModerate: Testing Hate Speech Detectors against Content Moderation Policies Jiangrui Zheng, Xueqing Liu, Mirazul Haque, Xing Qian, Guanqun Yang, Wei Yang

  • Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other Yifei Gao, Jie Ou, Lei Wang, Yuting Xiao, xiangzhiyuan, Dai Ruiting, Jun Cheng

  • Contrastive Preference Learning for Neural Machine Translation Jianfei He, Shichao Sun, Sen Peng, Jie Xu, Xiaohua Jia, Wenjie Li

  • SocREval: Large Language Models with the Socratic Method for Reference-free Reasoning Evaluation Hangfeng He, Hongming Zhang, Dan Roth

  • Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis Wenhao Zhu, Hongyi Liu, Qingxiu Dong, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li

  • Unleashing the Power of LLMs in Court View Generation by Stimulating Internal Knowledge and Incorporating External Knowledge Yifei Liu, Yiquan Wu, Ang Li, Yating Zhang, Changlong Sun, Weiming Lu, Fei Wu, Kun Kuang

  • Prompting Vision-Language Models For Aspect-Controlled Generation of Referring Expressions DANFENG GUO, Sanchit Agarwal, Arpit Gupta, Jiun-Yu Kao, Emre Barut, Tagyoung Chung, Jing Huang, Mohit Bansal

  • Task-Agnostic Detector for Insertion-Based Backdoor Attacks Weimin Lyu, Xiao Lin, Songzhu Zheng, Lu Pang, Haibin Ling, Susmit Jha, Chao Chen

  • Uncertainty Estimation on Sequential Labeling via Uncertainty Transmission Jianfeng He, Linlin Yu, Shuo Lei, Chang-Tien Lu, Feng Chen

  • Exploring Language Model’s Code Generation Ability with Auxiliary Functions Seonghyeon Lee, Sanghwan Jang, Seongbo Jang, Dongha Lee, Hwanjo Yu

  • Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models Sang T. Truong, Duc Quang Nguyen, Toan Nguyen, Dong Le, Nhi Ngoc Truong, Tho Quan, Sanmi Koyejo

  • GoT: Effective Graph-of-Thought Reasoning in Language Models Yao Yao, Zuchao Li, hai zhao

  • Enhancing the General Agent Capabilities of Low-Paramter LLMs through Tuning and Multi-Branch Reasoning Qinhao Zhou, Zihan Zhang, Xiang Xiang, Ke Wang, Yuchuan Wu, Yongbin Li

  • MuMath: Multi-perspective Data Augmentation for Mathematical Reasoning in Large Language Models Weihao You, Shuo Yin, Xudong Zhao, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai

  • Tram: A Token-level Retrieval-augmented Mechanism for Source Code Summarization Tong Ye, Lingfei Wu, Tengfei Ma, Xuhong Zhang, Yangkai Du, Peiyu Liu, Shouling Ji, Wenhai Wang

  • UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking Chuang Li, Yan Zhang, Min-Yen Kan, Haizhou Li

  • Evaluating Step-by-Step Reasoning through Symbolic Verification YiFan Zhang, Hanlin Zhang, Li Erran Li, Eric P. Xing

  • Multi-Review Fusion-in-Context Aviv Slobodkin, Ori Shapira, Ran Levy, Ido Dagan

  • Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison Maxime Bouthors, Josep Crego, François Yvon

  • Extending Input Contexts of Language Models through Training on Segmented Sequences Petros Karypis, Julian McAuley, George Karypis

  • Reason from Fallacy: Enhancing Large Language Models’ Logical Reasoning through Logical Fallacy Understanding Yanda Li, Dixuan Wang, Jiaqing Liang, Guochao Jiang, Qianyu He, Yanghua Xiao, Deqing Yang

  • Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models Wanyong Feng, Jaewook Lee, Hunter McNichols, Alexander Scarlatos, Digory Smith, Simon Woodhead, Nancy Otero Ornelas, Andrew Lan

  • Aspect-based Sentiment Analysis with Context Denoising Yuanhe Tian, Chang Liu, Yan Song, Fei Xia, Yongdong Zhang

  • IruMozhi: Automatically classifying diglossia in Tamil Kabilan Prasanna, Aryaman Arora

  • RENOVI: A Benchmark Towards Remediating Norm Violations in Socio-Cultural Conversations Haolan Zhan, Zhuang Li, Xiaoxi Kang, Tao Feng, YUNCHENG HUA, Lizhen Qu, Yi Ying, Mei Rianto Chandra, Kelly Rosalin, JUREYNOLDS JUREYNOLDS, Suraj Sharma, SHILIN QU, LINHAO LUO, Ingrid Zukerman, Lay-Ki Soon, Zhaleh Semnani Azad, Reza Haf

  • Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking Hong Jin Kang, Fabrice Y Harel-Canada, Muhammad Ali Gulzar, Nanyun Peng, Miryung Kim

  • COMMIT: Code-Mixing English-Centric Large Language Model for Multilingual Instruction Tuning Jaeseong Lee, YeonJoon Jung, seung-won hwang

  • DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation Aru Maekawa, Satoshi Kosugi, Kotaro Funakoshi, Manabu Okumura

  • MindAgent: Emergent Gaming Interaction Ran Gong, Qiuyuan Huang, Xiaojian Ma, Hoi Vo, Zane Durante, Yusuke Noda, Zilong Zheng, Demetri Terzopoulos, Li Fei-Fei, Jianfeng Gao

  • BotChat: Evaluating LLMs’ Capabilities of Having Multi-Turn Dialogues Haodong Duan, Jueqi Wei, Chonghua Wang, Hongwei Liu, Yixiao Fang, Songyang Zhang, Dahua Lin, Kai Chen

  • Learning Mutually Informed Representations for Characters and Subwords Yilin Wang, Xinyi Hu, Matthew R. Gormley

  • A Novel Two-step Fine-tuning Framework for Transfer Learning in Low-Resource Neural Machine Translation Yuan Gao, Feng Hou, Ruili Wang

  • Enhancing Cross-lingual Sentence Embedding for Low-resource Languages with Word Alignment Zhongtao Miao, Qiyu Wu, Kaiyan Zhao, Zilong Wu, Yoshimasa Tsuruoka

  • C$^{3}$LPGCN:Integrating Contrastive Learning and Cooperative Learning with Prompt into Graph Convolutional Network for Aspect-based Sentiment Analysis Ye He, Shihao Zou, YuzheChen, Xianying Huang

  • Visual Enhanced Entity-Level Interaction Network for Multimodal Summarization Haolong Yan, Binghao Tang, Boda Lin, Gang Zhao, Si Li

  • Knowledgeable In-Context Tuning: Exploring and Exploiting Factual Knowledge for In-Context Learning Jianing Wang, Chengyu Wang, Chuanqi Tan, Jun Huang, Ming Gao

  • Time Machine GPT Felix Drinkall, Eghbal Rahimikia, Janet B. Pierrehumbert, Stefan Zohren

  • An End-to-End Submodular Framework for Data-Efficient In-Context Learning Lilly Kumari, Shengjie Wang, Arnav Mohanty Das, Tianyi Zhou, Jeff Bilmes

  • Teaching Llama a New Language Through Cross-Lingual Knowledge Transfer Hele-Andra Kuulmets, Taido Purason, Agnes Luhtaru, Mark Fishel

  • Simulating Opinion Dynamics with Networks of LLM-based Agents Yun-Shiuan Chuang, Agam Goyal, Nikunj Harlalka, Siddharth Suresh, Robert D. Hawkins, Sijia Yang, Dhavan V. Shah, Junjie Hu, Timothy T. Rogers

  • Probing the Category of Verbal Aspect in Transformer Language Models Anisia Katinskaia, Roman Yangarber

  • A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets Tanja Samardzic, Ximena Gutierrez, Christian Bentz, Steven Moran, Olga Pelloni

  • Beyond Read-Only: Crafting a Comprehensive Chinese Text-to-SQL Dataset for Database Manipulation and Query Xi Chen, Jinguo You, Likun, Xiang Li

  • Normalizing without Modernizing: Keeping Historical Wordforms of Middle French while Reducing Spelling Variants Raphael Rubino, Johanna Gerlach, Jonathan David Mutal, Pierrette Bouillon

  • Anti-LM Decoding for Zero-shot In-context Machine Translation Suzanna Sia, Alexandra DeLucia, Kevin Duh

  • Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning Shuai Zhao, Leilei Gan, Anh Tuan Luu, Jie Fu, Lingjuan Lyu, Meihuizi Jia, Jinming Wen

  • Select and Summarize: Scene Saliency for Movie Script Summarization Rohit Saxena, Frank Keller

  • Don’t be a Fool: Pooling Strategies in Offensive Language Detection from User-Intended Adversarial Attacks Seunguk Yu, Juhwan Choi, YoungBin Kim

  • Z-GMOT: Zero-shot Generic Multiple Object Tracking Kim Hoang Tran, Anh Duy Le Dinh, Tien-Phat Nguyen, Thinh Phan, Pha Nguyen, Khoa Luu, Donald Adjeroh, Gianfranco Doretto, Ngan Hoang Le

  • NLP for Counterspeech against Hate: A Survey and How-To Guide Helena Bonaldi, Yi-Ling Chung, Gavin Abercrombie, Marco Guerini

  • PRODIGy: a PROfile-based DIalogue Generation dataset Daniela Occhipinti, Serra Sinem Tekiroglu, Marco Guerini

  • WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models Piotr Molenda, Adian Liusie, Mark Gales

  • Cognitive Overload: Jailbreaking Large Language Models with Overloaded Logical Thinking Nan Xu, Fei Wang, Ben Zhou, Bangzheng Li, Chaowei Xiao, Muhao Chen

  • PAELLA: Parameter-Efficient Lightweight Language-Agnostic Captioning Model Rita Ramos, Emanuele Bugliarello, Bruno Martins, Desmond Elliott

  • OSCaR: Object State Captioning and State Change Representation Nguyen Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu

  • SumCSE: Summary as a transformation for Contrastive Learning Raghuveer Thirukovalluru, Xiaolan Wang, Jun Chen, Shuyang Li, Jie Lei, Rong Jin, Bhuwan Dhingra

  • The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text Yanzhu Guo, Guokan Shang, Michalis Vazirgiannis, Chloé Clavel

  • PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits Hang Jiang, Xiajie Zhang, Xubo Cao, Cynthia Breazeal, Deb Roy, Jad Kabbara

  • MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts Xuxin Cheng, Zhihong Zhu, Xianwei Zhuang, Zhanpeng Chen, Zhiqi Huang, Yuexian Zou

  • FIRE: A Dataset for Financial Relation Extraction Hassan Hamad, Abhinav Kumar Thakur, Nijil Kolleri, Sujith Pulikodan, Keith M. Chugg

  • MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response Zihao Deng, Yinghao Ma, Yudong Liu, Rongchen Guo, Ge Zhang, Wenhu Chen, Wenhao Huang, Emmanouil Benetos

  • Investigating Acceleration of LLaMA Inference by Enabling Intermediate Layer Decoding via Instruction Tuning with ‘LITE’ Neeraj Varshney, Agneet Chatterjee, Mihir Parmar, Chitta Baral

  • Instruction-following Evaluation through Verbalizer Manipulation Shiyang Li, Jun Yan, Hai Wang, Zheng Tang, Xiang Ren, Vijay Srinivasan, Hongxia Jin

  • WebWISE: Unlocking Web Interface Control for LLMs via Sequential Exploration Heyi Tao, Sethuraman T V, Michal Shlapentokh-Rothman, Tanmay Gupta, Heng Ji, Derek Hoiem

  • CodecLM: Aligning Language Models with Tailored Synthetic Data Zifeng Wang, Chun-Liang Li, Vincent Perot, Long Le, Jin Miao, Zizhao Zhang, Chen-Yu Lee, Tomas Pfister

  • Prompting Few-shot Multi-hop Question Generation via Comprehending Type-aware Semantics Zefeng Lin, Weidong Chen, Yan Song, Yongdong Zhang

  • When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models Yanhong Li, Chenghao Yang, Allyson Ettinger

  • CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP Chandra Kiran Reddy Evuru, Sreyan Ghosh, Sonal Kumar, Ramaneswaran S, Utkarsh Tyagi, Dinesh Manocha

  • Synonym relations affect object detection learned on vision-language data Giacomo Nebbia, Adriana Kovashka

  • CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models Xiang Li, FanBu, Ambuj Mehrish, Yingting Li, Jiale Han, Bo Cheng, Soujanya Poria

  • RobustSentEmbed: Robust Sentence Embeddings Using Adversarial Self-Supervised Contrastive Learning Javad Rafiei Asl, Prajwal Panzade, Eduardo Blanco, Daniel Takabi, Zhipeng Cai

  • Characterizing Human and Zero-Shot GPT-3.5 Object-Similarity Judgments D Estes McKnight, Alona Fyshe

  • Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models Wei He, Shichun Liu, Jun Zhao, Yiwen Ding, Yi Lu, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang

  • Getting Sick After Seeing a Doctor? Diagnosing and Mitigating Knowledge Conflicts in Event Temporal Reasoning Tianqing Fang, Zhaowei Wang, Wenxuan Zhou, Hongming Zhang, Yangqiu Song, Muhao Chen

  • MCECR: A Novel Dataset for Multilingual Cross-Document Event Coreference Resolution Amir Pouran Ben Veyseh, Viet Dac Lai, Chien Van Nguyen, Franck Dernoncourt, Thien Huu Nguyen

  • Sentiment Analysis in the Era of Large Language Models: A Reality Check Wenxuan Zhang, Yue Deng, Bing Liu, Sinno Jialin Pan, Lidong Bing

  • Tokenizer Choice For LLM Training: Negligible or Crucial? Mehdi Ali, Michael Fromm, Klaudia Thellmann, Richard Rutmann, Max Lübbering, Johannes Leveling, Katrin Klug, Jan Ebert, Niclas Doll, Jasper Schulze Buschhoff, Charvi Jain, Alexander Arno Weber, Lena Jurkschat, Hammam Abdelwahab, Chelsea Maria John, Pedro Ortiz Suarez, Malte Ostendorff, Samuel Weinbach, Rafet Sifa, Stefan Kesselheim, Nicolas Flores-Herr

  • Think Before You Speak: Cultivating Communication Skills of Large Language Models via Inner Monologue Junkai Zhou, Liang Pang, Huawei Shen, Xueqi Cheng

  • The Impact of Differential Privacy on Group Disparity Mitigation Victor Petrén Bach Hansen, Atula Tejaswi Neerkaje, Ramit Sawhney, Lucie Flek, Anders Søgaard

  • Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning Shivam Ratnakant Mhaskar, Nirmesh J. Shah, Mohammadi Zaki, Ashishkumar Prabhakar Gudmalwar, Pankaj Wasnik, Rajiv Ratn Shah

  • Read between the lines - Functionality Extraction From READMEs Prince Kumar, Srikanth G. Tamilselvam, Dinesh Garg

  • AbsPyramid: Benchmarking the Abstraction Ability of Language Models with a Unified Entailment Graph Zhaowei Wang, Haochen Shi, Weiqi Wang, Tianqing Fang, Hongming Zhang, Sehyun Choi, Xin Liu, Yangqiu Song

  • Few-TK: A Dataset for Few-shot Scientific Typed Keyphrase Recognition Avishek Lahiri, Pratyay Sarkar, Medha Sen, Debarshi Kumar Sanyal, Imon Mukherjee

  • Language Models can be Deductive Solvers Jiazhan Feng, Ruochen Xu, Junheng Hao, Hiteshi Sharma, yelong shen, Dongyan Zhao, Weizhu Chen

  • Interpreting User Requests in the Context of Natural Language Standing Instructions Nikita Moghe, Patrick Xia, Jacob Andreas, Jason Eisner, Benjamin Van Durme, Harsh Jhamtani

  • Secure Your Model: An Effective Key Prompt Protection Mechanism for Large Language Models Ruixiang Tang, Yu-Neng Chuang, Xuanting Cai, Mengnan Du, Xia Hu

  • Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models Jiashuo Sun, Yi Luo, Yeyun Gong, Chen Lin, yelong shen, Jian Guo, Nan Duan

  • Do Prompt Positions Really Matter? Junyu Mao, Stuart E. Middleton, Mahesan Niranjan

  • Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Yoon Kim, Xixin Wu, Helen M. Meng, James R. Glass

  • A Study on Scaling Up Multilingual News Framing Analysis Syeda Sabrina Akter, Antonios Anastasopoulos

  • ViGLUE: A Vietnamese General Language Understanding Benchmark and Analysis of Vietnamese Language Models Minh-Nam Tran, Nguyễn Phú Vinh, Long HB Nguyen, Dien Dinh

  • Exploring the Trade-off Between Model Performance and Explanation Plausibility of Text Classifiers Using Human Rationales Lucas Emanuel Resck, Marcos M. Raimundo, Jorge Poco

  • Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation Tong Su, Xin Peng, Sarubi Thillainathan, David Guzmán, Surangika Ranathunga, En-Shiun Annie Lee

  • ADaPT: As-Needed Decomposition and Planning with Language Models Archiki Prasad, Alexander Koller, Mareike Hartmann, Peter Clark, Ashish Sabharwal, Mohit Bansal, Tushar Khot

  • Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations Dayeon Ki, Marine Carpuat

  • Non-contrastive sentence representations via self-supervision Duccio Pappadopulo, Marco Farina

  • Semantically-Prompted Language Models Improve Visual Descriptions Michael Ogezi, Bradley Hauer, Grzegorz Kondrak

  • GenTKG: Generative Forecasting on Temporal Knowledge Graph Ruotong Liao, Xu Jia, Yunpu Ma, Yangzhe Li, Volker Tresp

  • A Transformer Language Model with Stack Attention Jiaoda Li, Jennifer C. White, Mrinmaya Sachan, Ryan Cotterell

  • InstructEval: Systematic Evaluation of Instruction Selection Methods Anirudh Ajith, Chris Pan, Mengzhou Xia, Ameet Deshpande, Karthik R Narasimhan

  • RecMind: Large Language Model Powered Agent For Recommendation Yancheng Wang, Ziyan Jiang, Zheng Chen, Fan Yang, Yingxue Zhou, Eunah Cho, Xing Fan, Yanbin Lu, Xiaojiang Huang, Yingzhen Yang

  • GOLD: Generalized Knowledge Distillation via Out-of-Distribution-Guided Language Data Generation Mohsen Gholami, Mohammad Akbari, Tianxi Hu, Vaden Masrani, Z. Jane Wang, Yong Zhang

  • How Lexical is Bilingual Lexicon Induction? Harsh Kohli, Helian Feng, Nicholas Andrew Dronen, Calvin McCarter, Sina Moeini, Ali Kebarighotbi

  • Fumbling in Babel: An Investigation into ChatGPT’s Language Identification Ability Wei-Rui Chen, Ife Adebara, Khai Duy Doan, Qisheng Liao, Muhammad Abdul-Mageed

  • Targeted Augmentation for Low-Resource Event Extraction Sijia Wang, Lifu Huang

  • Asking More Informative Questions for Grounded Retrieval Sedrick Scott Keh, Justin T Chiu, Daniel Fried

  • Efficient Citer: Tuning Large Language Models for Enhanced Answer Quality and Verification Marzieh S. Tahaei, Aref Jafari, Ahmad Rashid, David Alfonso-Hermelo, Khalil Bibi, Yimeng Wu, Ali Ghodsi, Boxing Chen, Mehdi Rezagholizadeh

  • Addressing Healthcare-related Racial and LGBTQ+ Biases in Pretrained Language Models Sean Xie, Saeed Hassanpour, Soroush Vosoughi

  • ATG: Benchmarking Automated Theorem Generation for Generative Language Models Xiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng YANG, Zhengying Liu, Zhenguo Li, Xiaodan Liang

  • Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization Yixin Liu, Alexander Fabbri, Jiawen Chen, Yilun Zhao, SIMENG HAN, Shafiq Joty, Pengfei Liu, Dragomir Radev, Chien-Sheng Wu, Arman Cohan

  • NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge Phillip Howard, Junlin Wang, Vasudev Lal, Gadi Singer, Yejin Choi, Swabha Swayamdipta

  • Emotion-Anchored Contrastive Learning Framework for Emotion Recognition in Conversation Fangxu Yu, Junjie Guo, Zhen Wu, Xinyu Dai

  • SUQL: Conversational Search over Structured and Unstructured Data with Large Language Models Shicheng Liu, Jialiang Xu, Wesley Tjangnaka, Sina Semnani, Chen Jie Yu, Monica Lam

  • On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering Linyong Nan, Ellen Zhang, Weijin Zou, Yilun Zhao, Wenfei Zhou, Arman Cohan

  • CARE: Extracting Experimental Findings From Clinical Literature Aakanksha Naik, Bailey Kuehl, Erin Bransom, Doug Downey, Tom Hope

  • Personalized Federated Learning for Text Classification with Gradient-Free Prompt Tuning Rui Wang, Tong Yu, Ruiyi Zhang, Sungchul Kim, Ryan A. Rossi, Handong Zhao, Junda Wu, Subrata Mitra, Lina Yao, Ricardo Henao

  • SGSH: Stimulate Large Language Models with Skeleton Heuristics for Knowledge Base Question Generation Shasha Guo, Lizi Liao, Jing Zhang, Yanling Wang, Cuiping Li, Hong Chen

  • Biomedical Entity Representation with Graph-Augmented Multi-Objective Transformer Andrey Sakhovskiy, Natalia Semenova, Artur Kadurin, Elena Tutubalina

  • Cross-Lingual Summarization with Pseudo-Label Regularization Thang Le

  • On the Way to Gentle AI Counselor: Politeness Cause Elicitation and Intensity Tagging in Code-mixed Hinglish Conversations for Social Good Priyanshu Priya, gopendra Vikram singh, Mauajama Firdaus, Jyotsna Agrawal, Asif Ekbal

  • Leveraging Summarization for Unsupervised Dialogue Topic Segmentation Aleksei Artemiev, Daniil Parinov, Alexey Grishanov, Borisov Ivan, Alexey Vasilev, Daniil Muravetskii, Aleksey Rezvykh, Aleksei Goncharov, Andrey Savchenko

  • LLaMA-Rider: Spurring Large Language Models to Explore the Open World Yicheng Feng, Yuxuan Wang, jiazheng liu, Sipeng Zheng, Zongqing Lu

  • Contrastive Learning as a Polarizer: Mitigating Gender Bias by Fair and Biased sentences Kyungmin Park, Sihyun Oh, Daehyun KIM, Juae Kim

  • PoLLMgraph: Unraveling Hallucinations in Large Language Models via State Transition Dynamics Derui Zhu, Dingfan Chen, Qing Li, Zongxiong Chen, Lei Ma, Jens Grossklags, Mario Fritz

  • Improving Open-Domain Health Question Answering with Evidence Quality Assessment Juraj Vladika, Florian Matthes

  • DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers Anna Langedijk, Hosein Mohebbi, Gabriele Sarti, Willem Zuidema, Jaap Jumelet