Page Not Found
Page not found. Your pixels are in another canvas.
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Page not found. Your pixels are in another canvas.
About me
This is a page not in th emain menu
Published:
Is LoRA an adapter? Historically no. But should we call it one?
Random Forests are often the textbook example of highly parallel algorithms that are unsuitable for GPUs. In this project, we propose and evaluate our CUDA implementation of Extremely Randomized Trees (or ExtraTrees), a variant of Random Forests which are more amenable to GPU-driven parallel programming.
[Code] [Report]
Structured Nested Dictionaries. This module provides extensions to dicts in the python standard library, providing fast and clean manipulation of nested dictionary structures.
[Code]
[Paper] [Code] [Poster] [NYU Blog Post] - Abstract
Citation: Thibault Févry and Jason Phang. Unsupervised Sentence Compression using Denoising Auto-Encoders. Proceedings of CoNLL, 2018.
[Paper] [Code] - Abstract
Citation: Jason Phang, Thibault Févry, Samuel R. Bowman. Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks. Preprint, 2018.
[Paper] - Abstract
Citation: Thibault Févry, Jason Phang, Nan Wu, S. Gene Kim, Linda Moy, Kyunghyun Cho, Krzysztof J. Geras. Improving localization-based approaches for breast cancer screening exam classification. Extended Abstract, MIDL, 2019.
[Paper] - Abstract
Citation: Jungkyu Park, Jason Phang, Yiqiu Shen, Nan Wu, S. Gene Kim, Linda Moy, Kyunghyun Cho, Krzysztof J. Geras. Screening Mammogram Classification with Prior Exams. Extended Abstract, MIDL, 2019.
[Paper] - Abstract
Citation: Yiqiu Shen, Nan Wu, Jason Phang, Jungkyu Park, Gene Kim, Linda Moy, Kyunghyun Cho, Krzysztof J. Geras. Globally-Aware Multiple Instance Classifier for Breast Cancer Screening. Extended Abstract, MIDL, 2019.
[Paper] [Code] - Abstract
Citation: Alex Warstadt, Yu Cao, Ioana Grosu, Wei Peng, Hagen Blix, Yining Nie, Anna Alsop, Shikha Bordia, Haokun Liu, Alicia Parrish, Sheng-Fu Wang, Jason Phang, Anhad Mohananey, Phu Mon Htut, Paloma Jeretič, Samuel R. Bowman. Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIs. Proceedings of EMNLP, 2019.
[Paper] [arXiv] [Code] [Data Report] [Medium Post] - Abstract
Citation: Nan Wu, Jason Phang, Jungkyu Park, Yiqiu Shen, Zhe Huang, Masha Zorin, Stanisław Jastrzębski, Thibault Févry, Joe Katsnelson, Eric Kim, Stacey Wolfson, Ujas Parikh, Sushma Gaddam, Leng Leng Young Lin, Kara Ho, Joshua D. Weinstein, Beatriu Reig, Yiming Gao, Hildegard Toth, Kristine Pysarenko, Alana Lewin, Jiyon Lee, Krystal Airola, Eralda Mema, Stephanie Chung, Esther Hwang, Naziya Samreen, S. Gene Kim, Laura Heacock, Linda Moy, Kyunghyun Cho, Krzysztof J. Geras. Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening. IEEE Transactions on Medical Imaging, 2019.
[Paper] [Code] - Abstract
Citation: Yada Pruksachatkun, Phil Yeres, Haokun Liu, Jason Phang, Phu Mon Htut, Alex Wang, Ian Tenney, Samuel R. Bowman. jiant: A Software Toolkit for Research on General-Purpose Text Understanding Models. Proceedings of ACL (demonstration track), 2020.
[Paper] - Abstract
Citation: Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, Samuel R. Bowman. Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work?. Proceedings of ACL, 2020.
[Paper] - Abstract
Citation: Jason Phang, Iacer Calixto, Phu Mon Htut, Yada Pruksachatkun, Haokun Liu, Clara Vania, Katharina Kann, Samuel R. Bowman. English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too. Proceedings of AACL, 2020.
[Paper] - Abstract
Citation: Nan Wu, Zhe Huang, Yiqiu Shen, Jungkyu Park, Jason Phang, Taro Makino, S. Gene Kim, Kyunghyun Cho, Laura Heacock, Linda Moy, Krzysztof J. Geras. Reducing false-positive biopsies with deep neural networks that utilize local and global information in screening mammograms. Preprint, 2020.
[Paper] [Code]- Abstract
Citation: Jason Phang, Jungkyu Park, Krzysztof J. Geras. Investigating and Simplifying Masking-based Saliency Methods for Model Interpretability. Preprint, 2020.
[Paper] [Code] - Abstract
Citation: Yiqiu Shen, Nan Wu, Jason Phang, Jungkyu Park, Yiqiu Shen, Kangning Liu, Sudarshini Tyagi, Laura Heacock, S. Gene Kim, Linda Moy, Kyunghyun Cho, Krzysztof J. Geras. An interpretable classifier for high-resolution breast cancer screening images utilizing weakly supervised localization. Medical Image Analysis, Vol. 68, 2021
[Paper] [Website] - Abstract
Citation: Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, Connor Leahy. The Pile: An 800GB Dataset of Diverse Text for Language Modeling. Preprint, 2021.
[Paper] - Abstract
Citation: Clara Vania, Phu Mon Htut, William Huang, Dhara Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, Kyunghyun Cho, Samuel R. Bowman. Comparing Test Sets with Item Response Theory. Preprint, 2021.
[Paper] - Abstract
Citation: Jason Phang, Haokun Liu, Samuel R. Bowman. Fine-Tuned Transformers Show Clusters of Similar Representations Across Layers. Blackbox NLP, 2021.
[Paper] - Abstract
Citation: Jason Phang, Angelica Chen, William Huang, Samuel R. Bowman Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair. Preprint, 2021.
[Paper] - Abstract
Citation: Richard Yuanzhe Pang, Alicia Parrish, Nitish Joshi, Nikita Nangia, Jason Phang, Angelica Chen, Vishakh Padmakumar, Johnny Ma, Jana Thompson, He He, Samuel R. Bowman QuALITY: Question Answering with Long Input Texts, Yes!. Proceedings of NAACL, 2022.
[Paper] - Abstract
Citation: Alicia Parrish, Harsh Trivedi, Ethan Perez, Angelica Chen, Nikita Nangia, Jason Phang, Samuel Bowman Single-Turn Debate Does Not Help Humans Answer Hard Reading-Comprehension Questions. Proceedings of the First Workshop on Learning with Natural Language Supervision.
[Paper] - Abstract
Citation: Sidney Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle McDonell, Jason Phang, Michael Pieler, Usvsn Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach GPT-NeoX-20B: An Open-Source Autoregressive Language Model. Preprint.
[Paper] - Abstract
Citation: Alex Wang, Richard Yuanzhe Pang, Angelica Chen, Jason Phang, Samuel R. Bowman SQuALITY: Building a Long-Document Summarization Dataset the Hard Way. Preprint.
[Paper] - Abstract
Citation: Big Bench Collaboration (incl. Jason Phang) Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models. Preprint.
[Paper] - Abstract
Citation: Jason Phang, Yao Zhao, Peter J. Liu Investigating Efficiently Extending Transformers for Long Input Summarization. Preprint.
[Paper] - Abstract
Citation: Julian Michael, Ari Holtzman, Alicia Parrish, Aaron Mueller, Alex Wang, Angelica Chen, Divyam Madaan, Nikita Nangia, Richard Yuanzhe Pang, Jason Phang, Samuel R. Bowman What Do NLP Researchers Believe? Results of the NLP Community Metasurvey. Preprint.
[Paper] - Abstract
Citation: Jason Phang, Herbie Bradley, Leo Gao, Louis Castricato, Stella Biderman EleutherAI: Going Beyond "Open Science" to "Science in the Open". Workshop on Broadening Research Collaborations in ML, NeurIPS 2022.
[Paper] - Abstract
Citation: Alicia Parrish, Harsh Trivedi, Nikita Nangia, Vishakh Padmakumar, Jason Phang, Amanpreet Singh Saimbhi, Samuel R. Bowman Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions. Workshop on Machine Learning Safety, NeurIPS 2022.
[Paper] - Abstract
Citation: Teven Le Scao, Thomas Wang, Daniel Hesslow, Lucile Saulnier, Stas Bekman, M Saiful Bari, Stella Biderman, Hady Elsahar, Niklas Muennighoff, Jason Phang, Ofir Press, Colin Raffel, Victor Sanh, Sheng Shen, Lintang Sutawika, Jaesung Tae, Zheng Xin Yong, Julien Launay, Iz Beltagy What Language Model to Train if You Have One Million GPU Hours?. Findings of EMNLP 2022.
[Paper] - Abstract
Citation: BLOOM Authors (incl. Jason Phang) BLOOM: A 176B-Parameter Open-Access Multilingual Language Model. Preprint.
[Paper] - Abstract
Citation: Jason Phang, Yi Mao, Pengcheng He, Weizhu Chen HyperTuning: Toward Adapting Large Language Models without Back-propagation. Preprint.
[Paper] - Abstract
Citation: Tomasz Korbak, Kejian Shi, Angelica Chen, Rasika Bhalerao, Christopher L. Buckley, Jason Phang, Samuel R. Bowman, Ethan Perez Pretraining Language Models with Human Preferences. Preprint.
[Paper] - Abstract
Citation: Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu, Zhenning Dai, Lan Yan, Xin Cong, Yaxi Lu, Weilin Zhao, Yuxiang Huang, Junxi Yan, Xu Han, Xian Sun, Dahai Li, Jason Phang, Cheng Yang, Tongshuang Wu, Heng Ji, Zhiyuan Liu, Maosong Sun Tool Learning with Foundation Models. Preprint.
Graduate course, Teaching Assistant, New York University, Center for Data Science, 2018
Graduate course, Grader, New York University, Center for Data Science, 2019
Graduate course, Teaching Assistant, New York University, Center for Data Science, 2019
Teaching Assistant, New York University, Center for Data Science, 2020