default search action

combined dblp search
author search
venue search
publication search

ask others

Chenjun Xiao

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HaoHXLLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HaoHXLLZ24
Xiaotian Hao, Jianye Hao, Chenjun Xiao, Kai Li, Dong Li, Yan Zheng:
Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces. AAAI 2024: 12304-12312
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/CheXM0GRHMS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/CheXM0GRHMS24
Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai, Ramki Gummadi, Oscar A. Ramirez, Christopher K. Harris, A. Rupam Mahmood, Dale Schuurmans:
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation. ICML 2024
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Ma0FX0H0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Ma0FX0H0L24
Haoyu Ma, Jialong Wu, Ningya Feng, Chenjun Xiao, Dong Li, Jianye Hao, Jianmin Wang, Mingsheng Long:
HarmonyDream: Task Harmonization Inside World Models. ICML 2024
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MaHLX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MaHLX24
Yi Ma, Jianye Hao, Hebin Liang, Chenjun Xiao:
Rethinking Decision Transformer via Hierarchical Reinforcement Learning. ICML 2024
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ZhangRXS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangRXS024
Hongming Zhang, Tongzheng Ren, Chenjun Xiao, Dale Schuurmans, Bo Dai:
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning. ICML 2024
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-15518
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-15518
Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr:
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models. CoRR abs/2404.15518 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-21043
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-21043
Fengdi Che, Chenjun Xiao, Jincheng Mei, Bo Dai, Ramki Gummadi, Oscar A. Ramirez, Christopher K. Harris, A. Rupam Mahmood, Dale Schuurmans:
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation. CoRR abs/2405.21043 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-16121
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-16121
Dmitry Shribak, Chen-Xiao Gao, Yitong Li, Chenjun Xiao, Bo Dai:
Diffusion Spectral Representation for Reinforcement Learning. CoRR abs/2406.16121 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04451
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04451
Chen-Xiao Gao, Shengjun Fang, Chenjun Xiao, Yang Yu, Zongzhang Zhang:
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning. CoRR abs/2407.04451 (2024)
2023
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/RenXZ0WSSD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RenXZ0WSSD23
Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai:
Latent Variable Representation for Reinforcement Learning. ICLR 2023
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/XiaoWP0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XiaoWP0W23
Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White:
The In-Sample Softmax for Offline Reinforcement Learning. ICLR 2023
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ZhangXW00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangXW00023
Hongming Zhang, Chenjun Xiao, Han Wang, Jun Jin, Bo Xu, Martin Müller:
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay. ICLR 2023
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/ZhangRXXGS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/ZhangRXXGS023
Tianjun Zhang, Tongzheng Ren, Chenjun Xiao, Wenli Xiao, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai:
Energy-based Predictive Representations for Partially Observed Reinforcement Learning. UAI 2023: 2477-2487
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/ZhaoPXCR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/ZhaoPXCR23
Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran:
Conditionally optimistic exploration for cooperative deep multi-agent reinforcement learning. UAI 2023: 2529-2540
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14372
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14372
Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White:
The In-Sample Softmax for Offline Reinforcement Learning. CoRR abs/2302.14372 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-09032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-09032
Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran:
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning. CoRR abs/2303.09032 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-05726
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-05726
Xiaohan Hu, Yi Ma, Chenjun Xiao, Yan Zheng, Zhaopeng Meng:
In-Sample Policy Iteration for Offline Reinforcement Learning. CoRR abs/2306.05726 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-00267
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-00267
Yi Ma, Chenjun Xiao, Hebin Liang, Jianye Hao:
Rethinking Decision Transformer via Hierarchical Reinforcement Learning. CoRR abs/2311.00267 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-12244
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-12244
Hongming Zhang, Tongzheng Ren, Chenjun Xiao, Dale Schuurmans, Bo Dai:
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning. CoRR abs/2311.12244 (2023)
2022
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/XiaoLDSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/XiaoLDSS22
Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvári:
The Curse of Passive Data Collection in Batch Reinforcement Learning. AISTATS 2022: 8413-8438
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/XiaoDMRGHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XiaoDMRGHS22
Chenjun Xiao, Bo Dai, Jincheng Mei, Oscar A. Ramirez, Ramki Gummadi, Chris Harris, Dale Schuurmans:
Understanding and Leveraging Overparameterization in Recursive Value Estimation. ICLR 2022
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08765
Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai:
Latent Variable Representation for Reinforcement Learning. CoRR abs/2212.08765 (2022)
2021
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/XiaoWMDL0SS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XiaoWMDL0SS21
Chenjun Xiao, Yifan Wu, Jincheng Mei, Bo Dai, Tor Lattimore, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
On the Optimality of Batch Policy Optimization Algorithms. ICML 2021: 11362-11371
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/MeiDXSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MeiDXSS21
Jincheng Mei, Bo Dai, Chenjun Xiao, Csaba Szepesvári, Dale Schuurmans:
Understanding the Effect of Stochasticity in Policy Optimization. NeurIPS 2021: 19339-19351
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02293
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02293
Chenjun Xiao, Yifan Wu, Tor Lattimore, Bo Dai, Jincheng Mei, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
On the Optimality of Batch Policy Optimization Algorithms. CoRR abs/2104.02293 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-09973
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-09973
Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvári:
On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data. CoRR abs/2106.09973 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-15572
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-15572
Jincheng Mei, Bo Dai, Chenjun Xiao, Csaba Szepesvári, Dale Schuurmans:
Understanding the Effect of Stochasticity in Policy Optimization. CoRR abs/2110.15572 (2021)
2020
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MeiXSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MeiXSS20
Jincheng Mei, Chenjun Xiao, Csaba Szepesvári, Dale Schuurmans:
On the Global Convergence Rates of Softmax Policy Gradient Methods. ICML 2020: 6820-6829
[c6]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/MeiXD0SS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MeiXD0SS20
Jincheng Mei, Chenjun Xiao, Bo Dai, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
Escaping the Gravitational Pull of Softmax. NeurIPS 2020
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-06392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-06392
Jincheng Mei, Chenjun Xiao, Csaba Szepesvári, Dale Schuurmans:
On the Global Convergence Rates of Softmax Policy Gradient Methods. CoRR abs/2005.06392 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/MeiXHS019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/MeiXHS019
Jincheng Mei, Chenjun Xiao, Ruitong Huang, Dale Schuurmans, Martin Müller:
On Principled Entropy Exploration in Policy Optimization. IJCAI 2019: 3130-3136
[c4]
- view
- export record
  dblp key:
  - conf/nips/XiaoHMS019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XiaoHMS019
Chenjun Xiao, Ruitong Huang, Jincheng Mei, Dale Schuurmans, Martin Müller:
Maximum Entropy Monte-Carlo Planning. NeurIPS 2019: 9516-9524
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-11206
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-11206
Chenjun Xiao, Yifan Wu, Chen Ma, Dale Schuurmans, Martin Müller:
Learning to Combat Compounding-Error in Model-Based Reinforcement Learning. CoRR abs/1912.11206 (2019)
2018
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XiaoM018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XiaoM018
Chenjun Xiao, Jincheng Mei, Martin Müller:
Memory-Augmented Monte Carlo Tree Search. AAAI 2018: 1455-1462
2017
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/WangXZHTW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/WangXZHTW17
Jiao Wang, Chenjun Xiao, Tan Zhu, Chu-Hsuan Hsueh, Wen-Jie Tseng, I-Chen Wu:
Only-One-Victor Pattern Learning in Computer Go. IEEE Trans. Comput. Intell. AI Games 9(1): 88-102 (2017)
2016
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XiaoM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XiaoM16
Chenjun Xiao, Martin Müller:
Factorization Ranking Model for Move Prediction in the Game of Go. AAAI 2016: 1359-1365
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcai/Xiao016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Xiao016
Chenjun Xiao, Martin Müller:
Integrating Factorization Ranked Features in MCTS: An Experimental Study. CGW@IJCAI 2016: 34-43
2012
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/icga/WangLXX12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/icga/WangLXX12
Jiao Wang, Hongye Li, Chenjun Xiao, Xin-He Xu:
The 2^nd National University Student Computer-Games Tournaments. J. Int. Comput. Games Assoc. 35(3): 182-185 (2012)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.