Akari Asai

Ph.D. student @ Paul G. Allen School of Computer Science & Engineering, University of Washington
Visiting Student Researcher @ Meta AI

prof_pic.jpg

I am in the final year of my Ph.D. in NLP at Paul G. Allen School of Computer Science & Engineering, University of Washington. I am fortunate to be advised by Prof. Hannaneh Hajishirzi. I am also spending time at Meta AI Research as a visiting student researcher, under the supervision of Dr. Wen-tau Yih. Prior to joining UW, I obtained a B.E. in Electrical Engineering and Computer Science from The University of Tokyo, Japan.

I am on the academic job market this year! Please feel free to reach out if you’d like to discuss opportunities. I am attending NeurIPS!

My research focuses on natural language processing and machine learning, with a particular emphasis on large language models (LLMs). I investigate the core limitations of LLMs—such as hallucinations—that cannot be overcome by scaling alone. To address these challenges, I pioneered Retrieval-Augmented LMs, a novel class of LLMs that integrate large-scale text data via retrieval during inference. More specifically,

My work has received multiple paper awards at conferences like ACL and NeurIPS workshop, and has been featured in major media outlets such as Forbes and MIT Technology Review. I’m honored to be named among the EECS Rising Stars (2022), the IBM Global Ph.D. Fellows (2022-2023) and MIT Technology Review Innovators Under 35 from Japan (2024). My work is now integrated into major libraries like Hugging Face, LlamaIndex and LangChain, and used in multiple real-world systems, such as COVID-19 Research Search. Most recently, we released Ai2 OpenScholar Public Demo, assisting scientists to synthesize scientific literature more effectively and efficiently.

I am also passionate about teaching, mentoring and helping students to learn research, especially students from underrepresented groups. I have been the Head TA for CSE473: Intro to AI (undergrad) and CSE599J: Data-centric ML (graduate) at UW. To reduce barriers to starting research or pursuing a Ph.D. in this area, I am hosting weekly office hours open to everyone (please sign up from Calendly!), and was a mentor for UW CSE Ph.D. Pre-Application Mentorship Service (PAMS).

news

Nov 18, 2024 I’m SUPER excited to release OpenScholar, my latest collaboration project with amazing co-authors from UW, Ai2, Meta and CMU, Stanford, UIUC and UNC. Try out our public demo, and learn more about the project in the paper and Ai2 blog.
Oct 31, 2024 I’m honored to be chosen as MIT Technology Review Innovators Under 35 from Japan! See the MIT Technology Review article about my work on retrieval-augmented LMs to build more reliable LM-based systems.
Oct 22, 2024 We released Pangea, a new state-of-the-art multilingual and multimodal LLM! Check out our demo!
Sep 25, 2024 Scaling of retrieval datastore has been accepted at NeurIPS!
Sep 19, 2024 CopyBench has been accepted at EMNLP as a main conference paper!

selected publications

See my full publications at the publication page!

  1. OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs
    Akari Asai ,  Jacqueline He ,  Rulin Shao ,  Weijia Shi ,  Amanpreet Singh , and 20 more authors
    Arxiv, 2024
  2. Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
    Rulin Shao ,  Jacqueline He ,  Akari Asai ,  Weijia Shi ,  Tim Dettmers , and 3 more authors
    In Advances in Neural Information Processing Systems (NeurIPS) , 2024
  3. Fine-grained Hallucination Detection and Editing for Language Models
    Abhika Mishra ,  Akari Asai ,  Yizhong Wang ,  Vidhisha Balachandran ,  Graham Neubig , and 2 more authors
    In Conference on Language Modeling (COLM) , 2024
  4. Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
    Akari Asai ,  Zeqiu Wu ,  Yizhong Wang ,  Avirup Sil ,  and  Hannaneh Hajishirzi
    In The Twelfth International Conference on Learning Representations (ICLR; Oral, Top 1%) , 2024
  5. Reliable, Adaptable, and Attributable Language Models with Retrieval
    Akari Asai ,  Zexuan Zhong ,  Danqi Chen ,  Pang Wei Koh ,  Luke Zettlemoyer , and 2 more authors
    arXiv preprint, 2024
  6. When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories
    Alex Mallen* ,  Akari Asai* ,  Victor Zhong ,  Rajarshi Das ,  Daniel Khashabi , and 1 more author
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL; Oral, Best Video Paper Award – Most Viewed) , 2023
  7. Task-aware Retrieval with Instructions
    Akari Asai ,  Timo Schick ,  Patrick Lewis ,  Xilun Chen ,  Gautier Izacard , and 3 more authors
    In Findings of the Association for Computational Linguistics: ACL 2023 (Findings Spotlight) , 2023
  8. Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks
    Akari Asai ,  Matt Gardner ,  and  Hannaneh Hajishirzi
    In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL; Oral) , 2022
  9. One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval
    Akari Asai ,  Xinyan Yu ,  Jungo Kasai ,  and  Hanna Hajishirzi
    In Advances in Neural Information Processing Systems (NeurIPS) , 2021
  10. XOR QA: Cross-lingual Open-Retrieval Question Answering
    Akari Asai ,  Jungo Kasai ,  Jonathan Clark ,  Kenton Lee ,  Eunsol Choi , and 1 more author
    In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL; Oral) , 2021
  11. LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
    Ikuya Yamada ,  Akari Asai ,  Hiroyuki Shindo ,  Hideaki Takeda ,  and  Yuji Matsumoto
    In Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2020
  12. Learning to retrieve reasoning paths over wikipedia graph for question answering
    Akari Asai ,  Kazuma Hashimoto ,  Hannaneh Hajishirzi ,  Richard Socher ,  and  Caiming Xiong
    In International Conference on Learning Representations (ICLR) , 2020