Implemented SGD optimizer with momentum and added tests. #217

joshbrown901 · 2025-03-31T03:34:40Z

I implemented the SGD optimizer with momentum and updated local repository at the kernel, starpu and tensor levels. I also provided tests for the sgd with momentum on the home directory to verify and check that the optimizer works correctly

Muxas · 2025-03-31T07:57:23Z

include/nntile/kernel.hh

@@ -34,6 +34,7 @@
 #include <nntile/kernel/prod_inplace.hh>
 #include <nntile/kernel/randn.hh>
 #include <nntile/kernel/relu.hh>
+#include <nntile/kernel/sgd_momentum.hh>


rename as sgd_step

Muxas · 2025-03-31T07:57:37Z

include/nntile/kernel/sgd_momentum.hh

+ * distributed-memory heterogeneous systems based on StarPU runtime system.
+ *
+ * @file include/nntile/kernel/sgd_momentum.hh
+ * SGD_MOMENTUM low-level kernels


Muxas · 2025-03-31T07:59:03Z

include/nntile/kernel/sgd_momentum/cpu.hh

+
+// SGD_MOMENTUM operation on a buffer
+template<typename T>
+void cpu(Index nelems, T *data)


Totally incorrect API, where are the gradients, parameters like learning rate?

Muxas · 2025-03-31T07:59:26Z

include/nntile/kernel/sgd_momentum/cuda.hh

+{
+
+template<typename T>
+void cuda(cudaStream_t stream, Index nelems, T *data)


Parameters of the function are incorrect

Muxas · 2025-03-31T07:59:48Z

include/nntile/starpu.hh

@@ -127,10 +128,11 @@ void init()
    gemm::init();
    hypot::init();
    hypot_scalar_inverse::init();
-    nrm2::init();
+   nrm2::init();


why remove space?

Muxas · 2025-03-31T08:15:16Z

tests/kernel/pow

do not put binary files into git repo

Muxas · 2025-03-31T08:15:25Z

tests/kernel/sqrt

do not put binary files into git repo

Muxas · 2025-03-31T08:15:50Z

test_sgd_momentum1.cpp

tests shall appear in test directory

Muxas · 2025-03-31T08:18:01Z

wrappers/python/nntile/optimizer/sgd_momentum.py

+# NNTile is software framework for fast training of big neural networks on
+# distributed-memory heterogeneous systems based on StarPU runtime system.
+#
+# @file wrappers/python/nntile/optimizer/sgd.py


take a look at wrappers/python/optimizer/adam.py

Muxas · 2025-03-31T08:20:00Z

wrappers/python/nntile/nntile_core.cc

@@ -516,6 +516,15 @@ void def_mod_tensor(py::module_ &m)
    m.def("relu_fp32", &relu<fp32_t>);
    m.def("relu_fp32_fast_tf32", &relu<fp32_fast_tf32_t>);

+    // Add activation functions for Tensor<T>
+    m.def("sgd_momentum_async_fp64", &sgd_momentum_async<fp64_t>);


This function call shall be propagated also into wrappers/python/nntile/functions.py (find adam_step)

Implemented SGD optimizer with momentum and added tests.

3f004ce

Muxas requested changes Mar 31, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented SGD optimizer with momentum and added tests. #217

Implemented SGD optimizer with momentum and added tests. #217

Implemented SGD optimizer with momentum and added tests. #217

Are you sure you want to change the base?

Implemented SGD optimizer with momentum and added tests. #217

Conversation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment