Memory reuse algorithms

(1) Introduction

We have implemented three algorithms in C.

Large tensor first: A reuse algorithm that determines the order of tensor memory allocation in descending order of tensor size.
Short lifetime first: A reuse algorithm that determines the order of tensor memory allocation in ascending order of lifetime length.
Eager reuse: Our algorithm, a reuse algorithm generated by exploiting lifetime relative position relationships between tensors.

For the large tensor first algorithm, we implemented two versions: v2, v1. Compared with v1, v2 always has better performance. We use v2 for testing by default, while v1 is also in the test.c.

(2) Evaluation Platform

We are simulating based on C language, and gcc environment is enough. Ubuntu 22.04 is recommended.

(3) how to compile

$ make

(4) how to test

$ ./test.sh

Simple results output in screen, More details can be viewed in 'result/' and 'visual_result'.

(5) how to change graph

Modify the content of 'input/input0', the format is as follows: We use Figure 1 as an example for illustration.

Each line includes node ID, its output tensor size, the number of adjacent nodes. if the number is greater than zero, the subsequent number represents the ID of adjacent node.

Node_ID	Tensor_size	Adjacent_nodes_num	Adjacent_nodes_id
1	5	3	2 3 4
2	3	1	5
3	3	1	5
4	3	1	5
5	1	1	6
6	0	0

note1: The number of Adjacent_nodes_id depends on the Adjacent_nodes_num.
note2: Our algorithm supports the scheduling order of breadth search by default, and more scheduling algorithm support belongs to our future work.

After change the graph and test it, the result can be viewed in 'result/result0'. The following is the text result of input0.

----------------output the graph in adjacency list----------------
 [1]----->[2]----->[3]
 [2]----->[4]
 [3]----->[5]
 [4]----->[6]
 [5]----->[6]
 [6]----->[7]
 [7]

Eager Reuse		:
Tensor_ID[ 1]	Base_address[ 1]	Base+Size[ 4]	lifetime_begin[ 1]	lifetime_end[ 3]
Tensor_ID[ 2]	Base_address[ 4]	Base+Size[ 6]	lifetime_begin[ 2]	lifetime_end[ 4]
Tensor_ID[ 3]	Base_address[ 9]	Base+Size[15]	lifetime_begin[ 3]	lifetime_end[ 5]
Tensor_ID[ 4]	Base_address[ 0]	Base+Size[ 4]	lifetime_begin[ 4]	lifetime_end[ 6]
Tensor_ID[ 5]	Base_address[ 4]	Base+Size[ 9]	lifetime_begin[ 5]	lifetime_end[ 6]
Tensor_ID[ 6]	Base_address[ 9]	Base+Size[10]	lifetime_begin[ 6]	lifetime_end[ 7]
Tensor_ID[ 7]	Base_address[ 9]	Base+Size[ 9]	lifetime_begin[ 7]	lifetime_end[ 7]
Eager Reuse		 Result:	memory_size=15	overhead=25 us


Large tensor first	:
Tensor_ID[ 1]	Base_address[ 6]	Base+Size[ 9]	lifetime_begin[ 1]	lifetime_end[ 3]
Tensor_ID[ 2]	Base_address[ 9]	Base+Size[11]	lifetime_begin[ 2]	lifetime_end[ 4]
Tensor_ID[ 3]	Base_address[ 0]	Base+Size[ 6]	lifetime_begin[ 3]	lifetime_end[ 5]
Tensor_ID[ 4]	Base_address[11]	Base+Size[15]	lifetime_begin[ 4]	lifetime_end[ 6]
Tensor_ID[ 5]	Base_address[ 6]	Base+Size[11]	lifetime_begin[ 5]	lifetime_end[ 6]
Tensor_ID[ 6]	Base_address[ 0]	Base+Size[ 1]	lifetime_begin[ 6]	lifetime_end[ 7]
Tensor_ID[ 7]	Base_address[ 0]	Base+Size[ 0]	lifetime_begin[ 7]	lifetime_end[ 7]
Large tensor first	 Result:	memory_size=15	overhead=2 us


Short lifetime first	:
Tensor_ID[ 1]	Base_address[ 0]	Base+Size[ 3]	lifetime_begin[ 1]	lifetime_end[ 3]
Tensor_ID[ 2]	Base_address[ 3]	Base+Size[ 5]	lifetime_begin[ 2]	lifetime_end[ 4]
Tensor_ID[ 3]	Base_address[ 5]	Base+Size[11]	lifetime_begin[ 3]	lifetime_end[ 5]
Tensor_ID[ 4]	Base_address[11]	Base+Size[15]	lifetime_begin[ 4]	lifetime_end[ 6]
Tensor_ID[ 5]	Base_address[ 0]	Base+Size[ 5]	lifetime_begin[ 5]	lifetime_end[ 6]
Tensor_ID[ 6]	Base_address[ 5]	Base+Size[ 6]	lifetime_begin[ 6]	lifetime_end[ 7]
Tensor_ID[ 7]	Base_address[ 0]	Base+Size[ 0]	lifetime_begin[ 7]	lifetime_end[ 7]
Short lifetime first	 Result:	memory_size=15	overhead=0 us


Global Optimization	:
Tensor_ID[ 1]	Base_address[ 0]	Base+Size[ 3]	lifetime_begin[ 1]	lifetime_end[ 3]
Tensor_ID[ 2]	Base_address[ 3]	Base+Size[ 5]	lifetime_begin[ 2]	lifetime_end[ 4]
Tensor_ID[ 3]	Base_address[ 5]	Base+Size[11]	lifetime_begin[ 3]	lifetime_end[ 5]
Tensor_ID[ 4]	Base_address[11]	Base+Size[15]	lifetime_begin[ 4]	lifetime_end[ 6]
Tensor_ID[ 5]	Base_address[ 0]	Base+Size[ 5]	lifetime_begin[ 5]	lifetime_end[ 6]
Tensor_ID[ 6]	Base_address[ 5]	Base+Size[ 6]	lifetime_begin[ 6]	lifetime_end[ 7]
Tensor_ID[ 7]	Base_address[ 6]	Base+Size[ 6]	lifetime_begin[ 7]	lifetime_end[ 7]
Global Optimization	 Result:	memory_size=15	overhead=1911 us

(6) visualization of output results

We generate the visualization of output results by gnuplot, which could be viewed in 'visual_result/'

input0

memory consumption	12	13	12

input1

memory consumption	13	13	13

input2

memory consumption	14	17	14

input3

memory consumption	17	15	15

input4

memory consumption	21	23	21

input5

memory consumption	24	20	20

input6

memory consumption	26	26	26

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
example_figures		example_figures
input		input
result		result
visual_result		visual_result
Makefile		Makefile
README.md		README.md
common.c		common.c
common.h		common.h
eager_reuse.c		eager_reuse.c
global_optimization.c		global_optimization.c
large_tensor_first.c		large_tensor_first.c
memory_reuse_test		memory_reuse_test
short_lifetime_first.c		short_lifetime_first.c
shortlifetimefirst.c		shortlifetimefirst.c
shortlifetimefirst.h		shortlifetimefirst.h
test.c		test.c
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory reuse algorithms

(1) Introduction

(2) Evaluation Platform

(3) how to compile

(4) how to test

(5) how to change graph

(6) visualization of output results

input0

input1

input2

input3

input4

input5

input6

About

Releases

Packages

Languages

anonymousForPaperss/memory_reuse_algorithm

Folders and files

Latest commit

History

Repository files navigation

Memory reuse algorithms

(1) Introduction

(2) Evaluation Platform

(3) how to compile

(4) how to test

(5) how to change graph

(6) visualization of output results

input0

input1

input2

input3

input4

input5

input6

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages