More Web Proxy on the site http://driver.im/

default search action

combined dblp search
author search
venue search
publication search

ask others

Hadi Daneshmand

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MeterezJOIRD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MeterezJOIRD24
Alexandru Meterez, Amir Joudaki, Francesco Orabona, Alexander Immer, Gunnar Rätsch, Hadi Daneshmand:
Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion. ICLR 2024
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-13861
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-13861
Jiuqi Wang, Ethan Blaser, Hadi Daneshmand, Shangtong Zhang:
Transformers Learn Temporal Difference Methods for In-Context Reinforcement Learning. CoRR abs/2405.13861 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-19931
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-19931
Hadi Daneshmand:
Provable optimal transport with transformers: The essence of depth and prompt engineering. CoRR abs/2410.19931 (2024)
2023
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DaneshmandL023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DaneshmandL023
Hadi Daneshmand, Jason D. Lee, Chi Jin:
Efficient displacement convex optimization with particle gradient descent. ICML 2023: 6836-6854
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/JoudakiDB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JoudakiDB23
Amir Joudaki, Hadi Daneshmand, Francis R. Bach:
On Bridging the Gap between Mean Field and Finite Width Deep Random Multilayer Perceptron with Batch Normalization. ICML 2023: 15388-15400
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/AhnCDS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AhnCDS23
Kwangjun Ahn, Xiang Cheng, Hadi Daneshmand, Suvrit Sra:
Transformers learn to implement preconditioned gradient descent for in-context learning. NeurIPS 2023
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/JoudakiDB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JoudakiDB23
Amir Joudaki, Hadi Daneshmand, Francis R. Bach:
On the impact of activation and normalization in obtaining isometric embeddings at initialization. NeurIPS 2023
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-04753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-04753
Hadi Daneshmand, Jason D. Lee, Chi Jin:
Efficient displacement convex optimization with particle gradient descent. CoRR abs/2302.04753 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18399
Amir Joudaki, Hadi Daneshmand, Francis R. Bach:
On the impact of activation and normalization in obtaining isometric embeddings at initialization. CoRR abs/2305.18399 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00297
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00297
Kwangjun Ahn, Xiang Cheng, Hadi Daneshmand, Suvrit Sra:
Transformers learn to implement preconditioned gradient descent for in-context learning. CoRR abs/2306.00297 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02012
Alexandru Meterez, Amir Joudaki, Francesco Orabona, Alexander Immer, Gunnar Rätsch, Hadi Daneshmand:
Towards Training Without Depth Limits: Batch Normalization Without Gradient Explosion. CoRR abs/2310.02012 (2023)
2022
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-07879
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-07879
Hadi Daneshmand, Francis R. Bach:
Polynomial-time sparse measure recovery. CoRR abs/2204.07879 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13076
Amir Joudaki, Hadi Daneshmand, Francis R. Bach:
Entropy Maximization with Depth: A Variational Principle for Random Neural Networks. CoRR abs/2205.13076 (2022)
2021
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/ZhangODHS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ZhangODHS21
Peiyuan Zhang, Antonio Orvieto, Hadi Daneshmand, Thomas Hofmann, Roy S. Smith:
Revisiting the Role of Euler Numerical Integration on Acceleration and Stability in Convex Optimization. AISTATS 2021: 3979-3987
[c9]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/DaneshmandJB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DaneshmandJB21
Hadi Daneshmand, Amir Joudaki, Francis R. Bach:
Batch Normalization Orthogonalizes Representations in Deep Random Networks. NeurIPS 2021: 4896-4906
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ZhangOD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangOD21
Peiyuan Zhang, Antonio Orvieto, Hadi Daneshmand:
Rethinking the Variational Interpretation of Accelerated Optimization Methods. NeurIPS 2021: 14396-14406
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11537
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11537
Peiyuan Zhang, Antonio Orvieto, Hadi Daneshmand, Thomas Hofmann, Roy S. Smith:
Revisiting the Role of Euler Numerical Integration on Acceleration and Stability in Convex Optimization. CoRR abs/2102.11537 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-03970
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-03970
Hadi Daneshmand, Amir Joudaki, Francis R. Bach:
Batch Normalization Orthogonalizes Representations in Deep Random Networks. CoRR abs/2106.03970 (2021)
2020
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/basesearch/Daneshmand20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/Daneshmand20
Hadi Daneshmand:
Optimization for Neural Networks: Quest for Theoretical Understandings. ETH Zurich, Zürich, Switzerland, 2020
[c7]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/DaneshmandKBHL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DaneshmandKBHL20
Hadi Daneshmand, Jonas Moritz Kohler, Francis R. Bach, Thomas Hofmann, Aurélien Lucchi:
Batch normalization provably avoids ranks collapse for randomly initialised deep networks. NeurIPS 2020
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-01652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-01652
Hadi Daneshmand, Jonas Moritz Kohler, Francis R. Bach, Thomas Hofmann, Aurélien Lucchi:
Theoretical Understanding of Batch-normalization: A Markov Chain Perspective. CoRR abs/2003.01652 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/AdolphsDLH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/AdolphsDLH19
Leonard Adolphs, Hadi Daneshmand, Aurélien Lucchi, Thomas Hofmann:
Local Saddle Point Optimization: A Curvature Exploitation Approach. AISTATS 2019: 486-495
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/KohlerDLHZN19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/KohlerDLHZN19
Jonas Moritz Kohler, Hadi Daneshmand, Aurélien Lucchi, Thomas Hofmann, Ming Zhou, Klaus Neymeyr:
Exponential convergence rates for Batch Normalization: The power of length-direction decoupling in non-convex optimization. AISTATS 2019: 806-815
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-14616
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-14616
Peiyuan Zhang, Hadi Daneshmand, Thomas Hofmann:
Mixing of Stochastic Accelerated Gradient Descent. CoRR abs/1910.14616 (2019)
2018
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DaneshmandKLH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DaneshmandKLH18
Hadi Daneshmand, Jonas Moritz Kohler, Aurélien Lucchi, Thomas Hofmann:
Escaping Saddles with Stochastic Gradients. ICML 2018: 1163-1172
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-05999
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-05999
Hadi Daneshmand, Jonas Moritz Kohler, Aurélien Lucchi, Thomas Hofmann:
Escaping Saddles with Stochastic Gradients. CoRR abs/1803.05999 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-05751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-05751
Leonard Adolphs, Hadi Daneshmand, Aurélien Lucchi, Thomas Hofmann:
Local Saddle Point Optimization: A Curvature Exploitation Approach. CoRR abs/1805.05751 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-10694
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-10694
Jonas Moritz Kohler, Hadi Daneshmand, Aurélien Lucchi, Ming Zhou, Klaus Neymeyr, Thomas Hofmann:
Towards a Theoretical Understanding of Batch Normalization. CoRR abs/1805.10694 (2018)
2017
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/DaneshmandHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DaneshmandHH17
Hadi Daneshmand, Hamed Hassani, Thomas Hofmann:
Accelerated Dual Learning by Homotopic Initialization. CoRR abs/1706.03958 (2017)
2016
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/Gomez-Rodriguez16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/Gomez-Rodriguez16
Manuel Gomez-Rodriguez, Le Song, Hadi Daneshmand, Bernhard Schölkopf:
Estimating Diffusion Networks: Recovery Conditions, Sample Complexity and Soft-thresholding Algorithm. J. Mach. Learn. Res. 17: 90:1-90:29 (2016)
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DaneshmandLH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DaneshmandLH16
Hadi Daneshmand, Aurélien Lucchi, Thomas Hofmann:
Starting Small - Learning with Adaptive Sample Sizes. ICML 2016: 1463-1471
[c2]
- view
- export record
  dblp key:
  - conf/nips/MokhtariDLHR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MokhtariDLHR16
Aryan Mokhtari, Hadi Daneshmand, Aurélien Lucchi, Thomas Hofmann, Alejandro Ribeiro:
Adaptive Newton Method for Empirical Risk Minimization to Statistical Accuracy. NIPS 2016: 4062-4070
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/DaneshmandLH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DaneshmandLH16
Hadi Daneshmand, Aurélien Lucchi, Thomas Hofmann:
Starting Small - Learning with Adaptive Sample Sizes. CoRR abs/1603.02839 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/DaneshmandLH16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DaneshmandLH16a
Hadi Daneshmand, Aurélien Lucchi, Thomas Hofmann:
DynaNewton - Accelerating Newton's Method for Machine Learning. CoRR abs/1605.06561 (2016)
2014
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/DaneshmandGSS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DaneshmandGSS14
Hadi Daneshmand, Manuel Gomez-Rodriguez, Le Song, Bernhard Schölkopf:
Estimating Diffusion Network Structures: Recovery Conditions, Sample Complexity & Soft-thresholding Algorithm. ICML 2014: 793-801
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/DaneshmandGSS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DaneshmandGSS14
Hadi Daneshmand, Manuel Gomez-Rodriguez, Le Song, Bernhard Schölkopf:
Estimating Diffusion Network Structures: Recovery Conditions, Sample Complexity & Soft-thresholding Algorithm. CoRR abs/1405.2936 (2014)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.