S Ghosh, Yoon Kim, Ramón Fernández Astudillo, Tahira Naseem, Jacob Andreas
2023
412
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
Zhen Wang, Rameswar Panda, Leonid Karlinsky, Rogério Feris, Huan Sun, Yoon Kim
arXiv (Cornell University), 2023
413
Grammar Prompting for Domain-Specific Language Generation with Large Language Models
Bailin Wang, Zi Wang, Xuezhi Wang, Yuan Cao, Rif A. Saurous, Yoon Kim
arXiv (Cornell University), 2023
414
Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models
Sarah Zhang, Samuel Florin, Ariel N. Lee, Eamon Niknafs, Andrei Marginean, Annie Wang, Keith Tyser, Zad Chin, Yann Hicke, Nikhil Singh, Madeleine Udell, Yoon Kim, Tonio Buonassisi, Armando Solar-Lezama, Iddo Drori
arXiv (Cornell University), 2023
415
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
Yung-Sung Chuang, Yujia Xie, Hongyin Luo, Yoon Kim, James Glass, Pengcheng He
arXiv (Cornell University), 2023
416
Learning to Grow Pretrained Models for Efficient Transformer Training
Peihao Wang, Rameswar Panda, Lucas Torroba Hennigen, Philip Greengard, Leonid Karlinsky, Rogério Feris, David Cox, Shuicheng Yan, Yoon Kim
arXiv (Cornell University), 2023
417
SAIL: Search-Augmented Instruction Learning
Hongyin Luo, Yung-Sung Chuang, Yuan Gong, Tianhua Zhang, Yoon Kim, Xixin Wu, Danny G. Fox, Helen Meng, James Glass
arXiv (Cornell University), 2023
418
Gated Linear Attention Transformers with Hardware-Efficient Training
Yang Song-lin, Bailin Wang, Yikang Shen, Rameswar Panda, Yoon Kim
arXiv (Cornell University), 2023
419
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement