• HPipe: Large Language Model Pipeline Parallelism for Long Context on Heterogeneous Cost-effective Devices

    Ruilong Ma, Xiang Yang, Jingyu Wang, Qi Qi, Haifeng Sun, Jing Wang, Zirui Zhuang, Jianxin Liao

  • Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

    Jie Ou, Yueming Chen, Prof. Wenhong Tian

  • SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

    Sanghoon Kim, Dahyun Kim, Chanjun Park, Wonsung Lee, Wonho Song, Yunsu Kim, Hyeonwoo Kim, Yungi Kim, Hyeonju Lee, Jihoo Kim, Changbae Ahn, Seonghoon Yang, Sukyung Lee, HYUNBYUNG PARK, Gyoungjin Gim, Mikyoung Cha, Hwalsuk Lee, Sunghun Kim

  • UINav: A Practical Approach to Train On-Device Automation Agents

    Wei Li, Fu-Lin Hsu, Will Bishop, Folawiyo Campbell-Ajala, Max Lin, Oriana Riva

  • Efficiently Distilling LLMs for Edge Applications

    Achintya Kundu, Yu Chin Fabian Lim, Aaron Chew, Laura Wynter, Penny Chong, Rhui Dih Lee

  • Modeling and Detecting Company Risks from News

    Jiaxin Pei, Soumya Vadlamannati, Liang-Kang Huang, Daniel Preotiuc-Pietro, Xinyu Hua

  • Multiple-Question Multiple-Answer Text-VQA

    Peng Tang, Srikar Appalaraju, R. Manmatha, Yusheng Xie, Vijay Mahadevan

  • An NLP-Focused Pilot Training Agent for Safe and Efficient Aviation Communication

    Xiaochen Liu, Bowei Zou, AiTi Aw

  • Visual Grounding for User Interfaces

    Yijun Qian, Yujie Lu, Alexander G Hauptmann, Oriana Riva

  • Prompt Tuned Embedding Classification for Industry Sector Allocation

    Valentin Leonhard Buchner, Lele Cao, Jan-Christoph Kalo, Vilhelm von Ehrenheim

  • REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking

    Nacime Bouziani, Shubhi Tyagi, Joseph Fisher, Jens Lehmann, Andrea Pierleoni

  • Conformer-Based Speech Recognition On Extreme Edge-Computing Devices

    Mingbin Xu, Alex Jin, Sicheng Wang, Mu Su, Tim Ng, Henry Mason, Shiyi Han, Zhihong Lei, Yaqiao Deng, Zhen Huang, Mahesh Krishnamoorthy

  • Generating Signed Language Instructions in Large-Scale Dialogue Systems

    Mert Inan, Katherine Atwell, Anthony Sicilia, Lorna Quandt, Malihe Alikhani

  • Leveraging Natural Language Processing and Large Language Models for Assisting Due Diligence in the Legal Domain

    Myeongjun Erik Jang, Gábor Stikkel

  • AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators

    Xingwei He, Zhenghao Lin, Yeyun Gong, A-Long Jin, Hang Zhang, Chen Lin, Jian Jiao, Siu Ming Yiu, Nan Duan, Weizhu Chen

  • An Automatic Prompt Generation System for Tabular Data Tasks

    Ashlesha Akella, Abhijit Manatkar, Brijkumar Chavda, Hima Patel

  • Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data

    Haitham Hammami, Louis Baligand, Bojan Petrovski

  • Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain

    Brian H Hu, Bill Ray, Alice Leung, Amy Summerville, David Joy, Christopher Funk, Arslan Basharat

  • Reducing hallucination in structured outputs via Retrieval-Augmented Generation

    Orlando Marquez Ayala, Patrice Bechard

  • Towards Translating Objective Product Attributes Into Customer Language

    Ram Yazdi, Oren Kalinsky, Alexander Libov, Dafna Shahaf

  • Automating the Generation of a Functional Semantic Types Ontology with Foundational Models

    Sachin G Konan, Larry Rudolph, Scott Affens

  • Leveraging Customer Feedback for Multi-modal Insight Extraction

    Sandeep Sricharan Mukku, Abinesh Kanagarajan, Pushpendu Ghosh, Chetan Aggarwal

  • Optimizing LLM Based Retrieval Augmented Generation Pipelines in the Financial Domain

    Yiyun Zhao, Prateek Singh, Hanoz Bhathena, Bernardo Ramos, Aviral Joshi, Swaroop Gadiyaram, Saket Sharma

  • Scaling Up Authorship Attribution

    Jacob Striebel, Abishek Edikala, Ethan Irby, Alex Rosenfeld, J. Blake Gage, Daniel Dakota, Sandra Kübler

  • Multimodal Contextual Dialogue Breakdown Detection for Conversational AI Models

    Md Messal Monem Miah, Ulie Schnaithmann, Arushi Raghuvanshi, Youngseo Son

  • Deferred NAM: Low-latency Top-K Context Injection via Deferred Context Encoding for Non-Streaming ASR

    Zelin Wu, Gan Song, Christopher Li, Pat Rondon, Zhong Meng, Xavier Velez, Weiran Wang, Diamantino Caseiro, Golan Pundak, Tsendsuren Munkhdalai, Angad Chandorkar, Rohit Prabhavalkar

  • Less is More for Improving Automatic Evaluation of Factual Consistency

    Tong Wang, Ninad Kulkarni, Yanjun Qi

  • DriftWatch: A Tool that Automatically Detects Data Drift and Extracts Representative Examples Affected by Drift

    Myeongjun Erik Jang, Antonios Georgiadis, Yiyun Zhao, Fran Silavong

  • Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls

    Amin Hosseiny Marani, Ulie Schnaithmann, Youngseo Son, Akil Iyer, Manas Paldhe, Arushi Raghuvanshi

  • Leveraging LLMs for Dialogue Quality Measurement

    Jinghan Jia, Abi Komma, Timothy Leffel, Xujun Peng, Ajay Nagesh, Tamer Soliman, Aram Galstyan, Anoop Kumar

  • Uncertainty Estimation in Large Language Models to Support Biodiversity Conservation

    Maria Mora-Cross, Saul Calderon-Ramirez

  • AMA-LSTM: Pioneering Robust and Fair Financial Audio Analysis for Stock Volatility Prediction

    Shengkun Wang, Taoran Ji, Jianfeng He, Mariam Almutairi, Dan Wang, Linhan Wang, Min Zhang, Chang-Tien Lu

  • Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

    Xue-Yong Fu, Md Tahmid Rahman Laskar, Elena Khasanova, Cheng Chen, Shashi Bhushan TN

  • Shears: Unstructured Sparsity with Neural Low-rank Adapter Search

    Juan Pablo Munoz, Jinjie Yuan, Nilesh Jain

  • Tree-of-Question: Structured Retrieval Framework for Korean Question Answering Systems

    Dongyub Lee, Younghun Jeong, Hwa-Yeon Kim, Hongyeon Yu, Seunghyun Han, Taesun Whang, Seungwoo Cho, Chanhee Lee, Gunsu Lee, Youngbum Kim

  • LLM-based Frameworks for API Argument Filling in Task-Oriented Conversational Systems

    Jisoo Mok, Mohammad Kachuee, Shuyang Dai, Shayan Ray, Tara Taghavi, Sungroh Yoon

  • Large Language Models Encode the Practice of Medicine

    Teja Kanchinadam, Gauher Shaheen

  • Leveraging Interesting Facts to Enhance User Engagement with Conversational Interfaces

    Nikhita Vedula, Giuseppe Castellucci, Eugene Agichtein, Oleg Rokhlenko, Shervin Malmasi

  • Search Query Refinement for Japanese Named Entity Recognition in E-commerce Domain

    Yuki Nakayama, Ryutaro Tatsushima, Erick Mendieta, Koji Murakami, Keiji Shinzato

  • EIVEN: Efficient Implicit Attribute Value Extraction using Multimodal LLM

    Henry Peng Zou, Gavin Heqing Yu, Ziwei Fan, Dan Bu, Han Liu, Peng Dai, Dongmei Jia, Cornelia Caragea

  • Exploring the Impact of Table-to-Text Methods on Augmenting LLM-based Question Answering with Domain Hybrid Data

    Dehai Min, Nan Hu, Rihui Jin, Nuo Lin, Jiaoyan Chen, Yongrui Chen, Yu Li, Guilin Qi, Yun Li, Nijun Li, Qianren Wang

  • Solving General Natural-Language-Description Optimization Problems with Large Language Models

    Jihai Zhang, Wei Wang, Siyan Guo, Li Wang, Fangquan Lin, Cheng Yang, Wotao Yin

  • Self-Regulated Data-Free Knowledge Amalgamation for Text Classification

    Prashanth Vijayaraghavan, Hongzhi Wang, Luyao Shi, Tyler Baldwin, David Beymer, Ehsan Degan