Update TH, THC, THNN, THCUNN #1097

soumith · 2017-03-24T23:20:44Z

contbuilds

Currently in-place and out-of-place updateGradOutput will produce different results for input=max_val or input=min_val - in-place won't backprop gradient where input=max_val or input=min_val, out-of-place will backprop gradient in this case.

in-place and out-of-place updateGradOutput results are different where input=min_val or input=max_val

Take-over of pytorch#1097 * Add fast CUDA focal loss implementation. * Enable fast math for CUDA focal loss. * Correct typo. * replace deprecated macros * Add fast CUDA focal loss implementation. * Enable fast math for CUDA focal loss. * Correct typo. * replace deprecated macros * TORCH_CUDA_CHECK -> AT_CUDA_CHECK The former is defined in torch/csrc/profiler/cuda.cpp so it's not available usually. The latter however is defined in ATen/cuda/Exceptions.h as an alias of C10_CUDA_CHECK. * add test * clean up * guard for torchvision Co-authored-by: Wil Kong <alpha0422@gmail.com>

* Squashed 'src/composable_kernel/' content from commit f6edda6 git-subtree-dir: src/composable_kernel git-subtree-split: f6edda6 * add solver ConvIgemmFwdV6r1DlopsNchwKcyxNkhw; rename static ck source files * Squashed 'src/composable_kernel/' changes from f6edda6..5781adf 5781adf Update develop (pytorch#5) (pytorch#6) 97e6d51 Merge pull request pytorch#4 from ROCmSoftwarePlatform/separate_online_compile 7b1ec41 refactor 49c33aa refactor 54b3e73 rename git-subtree-dir: src/composable_kernel git-subtree-split: 5781adf * fix * refactor * remove online compilation from CK * refactor * fix * add ctest * add c-style pointer cast * vector/scalar pointer cast use c-style pointer cast instead of reinterpret_cast * fix clang warning suppression * tidy * suppress cppcheck * fix enum issue * revert chagnes to hip build * fix kernel filename * update CK build script * rename * rename * make innner product compatiable on gfx900 * Update src/include/miopen/solver/ck_utility_common.hpp Co-authored-by: JD <Jehandad.Khan@amd.com> * compiler parameter use stream * use int instead of index_t in kernel wrapper * DynamicBuffer, StaticBuffer, amd_buffer_load support customized value for invalid element * refactor * refactor * change cmakelist * change ck common utility * fix Co-authored-by: JD <Jehandad.Khan@amd.com>

ngimel and others added 9 commits March 23, 2017 17:22

Fix inconsistent in-place and out-of-place for HardTanh

3acbbb3

in-place and out-of-place updateGradOutput results are different where input=min_val or input=max_val

Make rinfo_ optional in btrifact

269b77a

Make rinfo_ argument optional in btrifact

e8196f9

Use zero instead of mul when beta == 0 in addr

b4fe5ad

Merge commit 'e8196f990db4ba368010f0d950bebf1fb13c2888'

25c8a11

Merge commit 'b4fe5ad641181f30bdcc4749c949206a3ebb04b4'

84aa418

Merge commit '52911f9e47f679045a238eb9dfdc5db55bf98cc9'

f2f6377

Merge commit '3acbbb30f2bdc6ccf4ffb6f7d568e7916d4e384d'

cce0307

soumith merged commit cce0307 into master Mar 24, 2017

soumith deleted the THSUpdate branch March 24, 2017 23:49

jjsjann123 pushed a commit to jjsjann123/pytorch that referenced this pull request Nov 5, 2021

Fix segmenter bug in combine reduction pass (pytorch#1097)

352dc9a

rraminen pushed a commit to rraminen/pytorch that referenced 8000 this pull request Sep 29, 2022

Set magma backend for test_out/warning (pytorch#1097)

da248ac

alugorey mentioned this pull request Mar 22, 2023

Enable hipSOLVER in ROCm builds #97370

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update TH, THC, THNN, THCUNN #1097

Update TH, THC, THNN, THCUNN #1097

Update TH, THC, THNN, THCUNN #1097

Update TH, THC, THNN, THCUNN #1097

Conversation