default search action
Canwen Xu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [b1]Canwen Xu:
Efficient Natural Language Processing for Language Models. University of California, San Diego, USA, 2024 - [c29]Canwen Xu, Yichong Xu, Shuohang Wang, Yang Liu, Chenguang Zhu, Julian J. McAuley:
Small Models are Valuable Plug-ins for Large Language Models. ACL (Findings) 2024: 283-294 - [c28]Tianyang Liu, Canwen Xu, Julian J. McAuley:
RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems. ICLR 2024 - [c27]Canwen Xu, Corby Rosset, Ethan C. Chau, Luciano Del Corro, Shweti Mahajan, Julian J. McAuley, Jennifer Neville, Ahmed Awadallah, Nikhil Rao:
Automatic Pair Construction for Contrastive Post-training. NAACL-HLT (Findings) 2024: 149-162 - [i28]Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar, Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul, Zhuang Li, Wen-Ding Li, Megan Risdal, Jia Li, Jian Zhu, Terry Yue Zhuo, Evgenii Zheltonozhskii, Nii Osae Osae Dade, Wenhao Yu, Lucas Krauß, Naman Jain, Yixuan Su, Xuanli He, Manan Dey, Edoardo Abati, Yekun Chai, Niklas Muennighoff, Xiangru Tang, Muhtasham Oblokulov, Christopher Akiki, Marc Marone, Chenghao Mou, Mayank Mishra, Alex Gu, Binyuan Hui, Tri Dao, Armel Zebaze, Olivier Dehaene, Nicolas Patry, Canwen Xu, Julian J. McAuley, Han Hu, Torsten Scholak, Sébastien Paquet, Jennifer Robinson, Carolyn Jane Anderson, Nicolas Chapados, et al.:
StarCoder 2 and The Stack v2: The Next Generation. CoRR abs/2402.19173 (2024) - 2023
- [c26]Canwen Xu, Julian J. McAuley:
A Survey on Model Compression and Acceleration for Pretrained Language Models. AAAI 2023: 10566-10575 - [c25]Canwen Xu, Julian J. McAuley:
A Survey on Dynamic Neural Networks for Natural Language Processing. EACL (Findings) 2023: 2325-2336 - [c24]Ryan Tran, Canwen Xu, Julian J. McAuley:
Spoiler Detection as Semantic Text Matching. EMNLP 2023: 6109-6113 - [c23]Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data. EMNLP 2023: 6268-6278 - [c22]Daya Guo, Canwen Xu, Nan Duan, Jian Yin, Julian J. McAuley:
LongCoder: A Long-Range Pre-trained Language Model for Code Completion. ICML 2023: 12098-12107 - [c21]Canwen Xu, Julian J. McAuley, Penghan Wang:
Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization. WWW (Companion Volume) 2023: 49-52 - [i27]Canwen Xu, Julian J. McAuley, Penghan Wang:
Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization. CoRR abs/2303.08697 (2023) - [i26]Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data. CoRR abs/2304.01196 (2023) - [i25]Canwen Xu, Yichong Xu, Shuohang Wang, Yang Liu, Chenguang Zhu, Julian J. McAuley:
Small Models are Valuable Plug-ins for Large Language Models. CoRR abs/2305.08848 (2023) - [i24]Tianyang Liu, Canwen Xu, Julian J. McAuley:
RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems. CoRR abs/2306.03091 (2023) - [i23]Daya Guo, Canwen Xu, Nan Duan, Jian Yin, Julian J. McAuley:
LongCoder: A Long-Range Pre-trained Language Model for Code Completion. CoRR abs/2306.14893 (2023) - [i22]Canwen Xu, Corby Rosset, Luciano Del Corro, Shweti Mahajan, Julian J. McAuley, Jennifer Neville, Ahmed Hassan Awadallah, Nikhil Rao:
Contrastive Post-training Large Language Models on Data Curriculum. CoRR abs/2310.02263 (2023) - 2022
- [c20]Canwen Xu, Zexue He, Zhankui He, Julian J. McAuley:
Leashing the Inner Demons: Self-Detoxification for Language Models. AAAI 2022: 11530-11537 - [c19]Stephen H. Bach, Victor Sanh, Zheng Xin Yong, Albert Webson, Colin Raffel, Nihal V. Nayak, Abheesht Sharma, Taewoon Kim, M. Saiful Bari, Thibault Févry, Zaid Alyafeai, Manan Dey, Andrea Santilli, Zhiqing Sun, Srulik Ben-David, Canwen Xu, Gunjan Chhablani, Han Wang, Jason Alan Fries, Maged Saeed AlShaibani, Shanya Sharma, Urmish Thakker, Khalid Almubarak, Xiangru Tang, Dragomir R. Radev, Mike Tian-Jian Jiang, Alexander M. Rush:
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts. ACL (demo) 2022: 93-104 - [c18]Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval. ACL (Findings) 2022: 3557-3569 - [c17]Wangchunshu Zhou, Canwen Xu, Julian J. McAuley:
BERT Learns to Teach: Knowledge Distillation with Meta Learning. ACL (1) 2022: 7037-7049 - [c16]Wangchunshu Zhou, Canwen Xu, Julian J. McAuley:
Efficiently Tuned Parameters Are Task Embeddings. EMNLP 2022: 5007-5014 - [c15]Nafis Sadeq, Canwen Xu, Julian J. McAuley:
InforMask: Unsupervised Informative Masking for Language Model Pretraining. EMNLP 2022: 5866-5878 - [c14]Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal V. Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Févry, Jason Alan Fries, Ryan Teehan, Teven Le Scao, Stella Biderman, Leo Gao, Thomas Wolf, Alexander M. Rush:
Multitask Prompted Training Enables Zero-Shot Task Generalization. ICLR 2022 - [c13]Han Wang, Canwen Xu, Julian J. McAuley:
Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification. NAACL-HLT 2022: 5483-5492 - [i21]Stephen H. Bach, Victor Sanh, Zheng Xin Yong, Albert Webson, Colin Raffel, Nihal V. Nayak, Abheesht Sharma, Taewoon Kim, M. Saiful Bari, Thibault Févry, Zaid Alyafeai, Manan Dey, Andrea Santilli, Zhiqing Sun, Srulik Ben-David, Canwen Xu, Gunjan Chhablani, Han Wang, Jason Alan Fries, Maged Saeed AlShaibani, Shanya Sharma, Urmish Thakker, Khalid Almubarak, Xiangru Tang, Mike Tian-Jian Jiang, Alexander M. Rush:
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts. CoRR abs/2202.01279 (2022) - [i20]Canwen Xu, Julian J. McAuley:
A Survey on Dynamic Neural Networks for Natural Language Processing. CoRR abs/2202.07101 (2022) - [i19]Canwen Xu, Julian J. McAuley:
A Survey on Model Compression for Natural Language Processing. CoRR abs/2202.07105 (2022) - [i18]Canwen Xu, Zexue He, Zhankui He, Julian J. McAuley:
Leashing the Inner Demons: Self-Detoxification for Language Models. CoRR abs/2203.03072 (2022) - [i17]Canwen Xu, Daya Guo, Nan Duan, Julian J. McAuley:
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval. CoRR abs/2203.06169 (2022) - [i16]Han Wang, Canwen Xu, Julian J. McAuley:
Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification. CoRR abs/2204.06305 (2022) - [i15]Wangchunshu Zhou, Canwen Xu, Julian J. McAuley:
Efficiently Tuned Parameters are Task Embeddings. CoRR abs/2210.11705 (2022) - [i14]Nafis Sadeq, Canwen Xu, Julian J. McAuley:
InforMask: Unsupervised Informative Masking for Language Model Pretraining. CoRR abs/2210.11771 (2022) - [i13]Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilic, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel van Strien, David Ifeoluwa Adelani, et al.:
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model. CoRR abs/2211.05100 (2022) - 2021
- [c12]Quentin Lhoest, Albert Villanova del Moral, Yacine Jernite, Abhishek Thakur, Patrick von Platen, Suraj Patil, Julien Chaumond, Mariama Drame, Julien Plu, Lewis Tunstall, Joe Davison, Mario Sasko, Gunjan Chhablani, Bhavitvya Malik, Simon Brandeis, Teven Le Scao, Victor Sanh, Canwen Xu, Nicolas Patry, Angelina McMillan-Major, Philipp Schmid, Sylvain Gugger, Clément Delangue, Théo Matussière, Lysandre Debut, Stas Bekman, Pierric Cistac, Thibault Goehringer, Victor Mustar, François Lagunas, Alexander M. Rush, Thomas Wolf:
Datasets: A Community Library for Natural Language Processing. EMNLP (Demos) 2021: 175-184 - [c11]Wangchunshu Zhou, Tao Ge, Canwen Xu, Ke Xu, Furu Wei:
Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting. EMNLP (1) 2021: 571-582 - [c10]Canwen Xu, Wangchunshu Zhou, Tao Ge, Ke Xu, Julian J. McAuley, Furu Wei:
Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression. EMNLP (1) 2021: 10653-10659 - [c9]Canwen Xu, Wangchunshu Zhou, Tao Ge, Ke Xu, Julian J. McAuley, Furu Wei:
Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge. NAACL-HLT 2021: 2139-2145 - [i12]Canwen Xu, Wangchunshu Zhou, Tao Ge, Ke Xu, Julian J. McAuley, Furu Wei:
Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World Knowledge. CoRR abs/2104.02704 (2021) - [i11]Wangchunshu Zhou, Canwen Xu, Julian J. McAuley:
Meta Learning for Knowledge Distillation. CoRR abs/2106.04570 (2021) - [i10]Quentin Lhoest, Albert Villanova del Moral, Yacine Jernite, Abhishek Thakur, Patrick von Platen, Suraj Patil, Julien Chaumond, Mariama Drame, Julien Plu, Lewis Tunstall, Joe Davison, Mario Sasko, Gunjan Chhablani, Bhavitvya Malik, Simon Brandeis, Teven Le Scao, Victor Sanh, Canwen Xu, Nicolas Patry, Angelina McMillan-Major, Philipp Schmid, Sylvain Gugger, Clement Delangue, Théo Matussière, Lysandre Debut, Stas Bekman, Pierric Cistac, Thibault Goehringer, Victor Mustar, François Lagunas, Alexander M. Rush, Thomas Wolf:
Datasets: A Community Library for Natural Language Processing. CoRR abs/2109.02846 (2021) - [i9]Canwen Xu, Wangchunshu Zhou, Tao Ge, Ke Xu, Julian J. McAuley, Furu Wei:
Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression. CoRR abs/2109.03228 (2021) - [i8]Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M. Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal V. Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen, Zheng Xin Yong, Harshit Pandey, Rachel Bawden, Thomas Wang, Trishala Neeraj, Jos Rozen, Abheesht Sharma, Andrea Santilli, Thibault Févry, Jason Alan Fries, Ryan Teehan, Stella Biderman, Leo Gao, Tali Bers, Thomas Wolf, Alexander M. Rush:
Multitask Prompted Training Enables Zero-Shot Task Generalization. CoRR abs/2110.08207 (2021) - 2020
- [c8]Yu Duan, Canwen Xu, Jiaxin Pei, Jialong Han, Chenliang Li:
Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders. ACL 2020: 253-262 - [c7]Canwen Xu, Jiaxin Pei, Hongtao Wu, Yiyu Liu, Chenliang Li:
MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization. ACL 2020: 3586-3596 - [c6]Thomas Wolf, Lysandre Debut, Victor Sanh, Julien Chaumond, Clement Delangue, Anthony Moi, Pierric Cistac, Tim Rault, Rémi Louf, Morgan Funtowicz, Joe Davison, Sam Shleifer, Patrick von Platen, Clara Ma, Yacine Jernite, Julien Plu, Canwen Xu, Teven Le Scao, Sylvain Gugger, Mariama Drame, Quentin Lhoest, Alexander M. Rush:
Transformers: State-of-the-Art Natural Language Processing. EMNLP (Demos) 2020: 38-45 - [c5]Canwen Xu, Wangchunshu Zhou, Tao Ge, Furu Wei, Ming Zhou:
BERT-of-Theseus: Compressing BERT by Progressive Module Replacing. EMNLP (1) 2020: 7859-7869 - [c4]Canwen Xu, Tao Ge, Chenliang Li, Furu Wei:
UnihanLM: Coarse-to-Fine Chinese-Japanese Language Model Pretraining with the Unihan Database. AACL/IJCNLP 2020: 201-211 - [c3]Wangchunshu Zhou, Canwen Xu, Tao Ge, Julian J. McAuley, Ke Xu, Furu Wei:
BERT Loses Patience: Fast and Robust Inference with Early Exit. NeurIPS 2020 - [i7]Canwen Xu, Wangchunshu Zhou, Tao Ge, Furu Wei, Ming Zhou:
BERT-of-Theseus: Compressing BERT by Progressive Module Replacing. CoRR abs/2002.02925 (2020) - [i6]Canwen Xu, Jiaxin Pei, Hongtao Wu, Yiyu Liu, Chenliang Li:
MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization. CoRR abs/2004.12302 (2020) - [i5]Wangchunshu Zhou, Canwen Xu, Tao Ge, Julian J. McAuley, Ke Xu, Furu Wei:
BERT Loses Patience: Fast and Robust Inference with Early Exit. CoRR abs/2006.04152 (2020)
2010 – 2019
- 2019
- [c2]Canwen Xu, Feiyang Wang, Jialong Han, Chenliang Li:
Exploiting Multiple Embeddings for Chinese Named Entity Recognition. CIKM 2019: 2269-2272 - [c1]Canwen Xu, Jing Li, Xiangyang Luo, Jiaxin Pei, Chenliang Li, Donghong Ji:
DLocRL: A Deep Learning Pipeline for Fine-Grained Location Recognition and Linking in Tweets. WWW 2019: 3391-3397 - [i4]Canwen Xu, Jing Li, Xiangyang Luo, Jiaxin Pei, Chenliang Li, Donghong Ji:
DLocRL: A Deep Learning Pipeline for Fine-Grained Location Recognition and Linking in Tweets. CoRR abs/1901.07005 (2019) - [i3]Canwen Xu, Zhenzhong Chen, Chenliang Li:
Obj-GloVe: Scene-Based Contextual Object Embedding. CoRR abs/1907.01478 (2019) - [i2]Canwen Xu, Feiyang Wang, Jialong Han, Chenliang Li:
Exploiting Multiple Embeddings for Chinese Named Entity Recognition. CoRR abs/1908.10657 (2019) - [i1]Yu Duan, Jiaxin Pei, Canwen Xu, Chenliang Li:
Pre-train and Plug-in: Flexible Conditional Text Generation with Variational Auto-Encoders. CoRR abs/1911.03882 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:25 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint