More Web Proxy on the site http://driver.im/

default search action

combined dblp search
author search
venue search
publication search

ask others

Dhawal Gupta

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j7]
- no documents available
  - no references & citations available
- export record
  dblp key:
  - conf/rlc/AyoubSZCGSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/AyoubSZCGSS24
Alex Ayoub, David Szepesvari, Francesco Zanini, Bryan Chan, Dhawal Gupta, Bruno Castro da Silva, Dale Schuurmans:
Mitigating the Curse of Horizon in Monte-Carlo Returns. RLJ 2: 563-572 (2024)
[j6]
- no documents available
  - no references & citations available
- export record
  dblp key:
  - conf/rlc/ChoudharyGT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/ChoudharyGT24
Kartik Choudhary, Dhawal Gupta, Philip S. Thomas:
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data. RLJ 4: 1546-1566 (2024)
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/WeqarMGU24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/WeqarMGU24
Mehwash Weqar, Shabana Mehfuz, Dhawal Gupta, Shabana Urooj:
Adaptive Switching Based Data-Communication Model for Internet of Healthcare Things Networks. IEEE Access 12: 11530-11548 (2024)
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GuptaJCLTS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GuptaJCLTS24
Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva:
From Past to Future: Rethinking Eligibility Traces. AAAI 2024: 12253-12260
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05646
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05646
Kartik Choudhary, Dhawal Gupta, Philip S. Thomas:
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data. CoRR abs/2406.05646 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-00997
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-00997
Erfan Entezami, Mahsa Sahebdel, Dhawal Gupta:
A Safe Exploration Strategy for Model-free Task Adaptation in Safety-constrained Grid Environments. CoRR abs/2408.00997 (2024)
2023
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ChowTNGRGB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChowTNGRGB23
Yinlam Chow, Aza Tulepbergenov, Ofir Nachum, Dhawal Gupta, Moonkyung Ryu, Mohammad Ghavamzadeh, Craig Boutilier:
A Mixture-of-Expert Approach to RL-based Dialogue Management. ICLR 2023
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/GuptaCJT023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuptaCJT023
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno C. da Silva:
Behavior Alignment via Reward Function Optimization. NeurIPS 2023
[c4]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/GuptaCTGB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuptaCTGB23
Dhawal Gupta, Yinlam Chow, Azamat Tulepbergenov, Mohammad Ghavamzadeh, Craig Boutilier:
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management. NeurIPS 2023
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10850
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10850
Dhawal Gupta, Yinlam Chow, Mohammad Ghavamzadeh, Craig Boutilier:
Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management. CoRR abs/2302.10850 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09838
James E. Kostas, Scott M. Jordan, Yash Chandak, Georgios Theocharous, Dhawal Gupta, Martha White, Bruno Castro da Silva, Philip S. Thomas:
Coagent Networks: Generalized and Scaled. CoRR abs/2305.09838 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09055
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09055
Simeng Sun, Dhawal Gupta, Mohit Iyyer:
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF. CoRR abs/2309.09055 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19007
Dhawal Gupta, Yash Chandak, Scott M. Jordan, Philip S. Thomas, Bruno Castro da Silva:
Behavior Alignment via Reward Function Optimization. CoRR abs/2310.19007 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12972
Dhawal Gupta, Scott M. Jordan, Shreyas Chaudhari, Bo Liu, Philip S. Thomas, Bruno Castro da Silva:
From Past to Future: Rethinking Eligibility Traces. CoRR abs/2312.12972 (2023)
2021
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/cogcom/SahaGSB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cogcom/SahaGSB21
Tulika Saha, Dhawal Gupta, Sriparna Saha, Pushpak Bhattacharyya:
Emotion Aided Dialogue Act Classification for Task-Independent Conversations in a Multi-modal Framework. Cogn. Comput. 13(2): 277-289 (2021)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/SahaGSB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/SahaGSB21
Tulika Saha, Dhawal Gupta, Sriparna Saha, Pushpak Bhattacharyya:
A hierarchical approach for efficient multi-intent dialogue policy learning. Multim. Tools Appl. 80(28-29): 35025-35050 (2021)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/talip/SahaGSB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/talip/SahaGSB21
Tulika Saha, Dhawal Gupta, Sriparna Saha, Pushpak Bhattacharyya:
A Unified Dialogue Management Strategy for Multi-intent Dialogue Conversations in Multiple Languages. ACM Trans. Asian Low Resour. Lang. Inf. Process. 20(6): 99:1-99:22 (2021)
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/GuptaMSKTW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuptaMSKTW21
Dhawal Gupta, Gabor Mihucz, Matthew Schlegel, James E. Kostas, Philip S. Thomas, Martha White:
Structural Credit Assignment in Neural Networks using Reinforcement Learning. NeurIPS 2021: 30257-30270
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/eswa/SahaGSB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eswa/SahaGSB20
Tulika Saha, Dhawal Gupta, Sriparna Saha, Pushpak Bhattacharyya:
Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning. Expert Syst. Appl. 162: 113650 (2020)
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/GhiassianP0GWW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GhiassianP0GWW20
Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gupta, Adam White, Martha White:
Gradient Temporal-Difference Learning with Regularized Corrections. ICML 2020: 3524-3534
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-00611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-00611
Sina Ghiassian, Andrew Patterson, Shivam Garg, Dhawal Gupta, Adam White, Martha White:
Gradient Temporal-Difference Learning with Regularized Corrections. CoRR abs/2007.00611 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2018
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/SahaGSB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/SahaGSB18
Tulika Saha, Dhawal Gupta, Sriparna Saha, Pushpak Bhattacharyya:
Reinforcement Learning Based Dialogue Management Strategy. ICONIP (3) 2018: 359-372

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.