Computer Science > Hardware Architecture

arXiv:2310.00574 (cs)

[Submitted on 1 Oct 2023 (v1), last revised 23 Nov 2023 (this version, v3)]

Title:YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs

Authors:Cyrus Zhou, Zack Hassman, Ruize Xu, Dhirpal Shah, Vaugnn Richard, Yanjing Li

View PDF

Abstract:We address the challenges associated with deploying neural networks on CPUs, with a particular focus on minimizing inference time while maintaining accuracy. Our novel approach is to use the dataflow (i.e., computation order) of a neural network to explore data reuse opportunities using heuristic-guided analysis and a code generation framework, which enables exploration of various Single Instruction, Multiple Data (SIMD) implementations to achieve optimized neural network execution. Our results demonstrate that the dataflow that keeps outputs in SIMD registers while also maximizing both input and weight reuse consistently yields the best performance for a wide variety of inference workloads, achieving up to 3x speedup for 8-bit neural networks, and up to 4.8x speedup for binary neural networks, respectively, over the optimized implementations of neural networks today.

Subjects:	Hardware Architecture (cs.AR); Machine Learning (cs.LG); Performance (cs.PF)
ACM classes:	B.8.2
Cite as:	arXiv:2310.00574 [cs.AR]
	(or arXiv:2310.00574v3 [cs.AR] for this version)
	https://doi.org/10.48550/arXiv.2310.00574

Submission history

From: Cyrus Zhou [view email]
[v1] Sun, 1 Oct 2023 05:11:54 UTC (5,610 KB)
[v2] Tue, 3 Oct 2023 18:29:44 UTC (5,610 KB)
[v3] Thu, 23 Nov 2023 16:24:39 UTC (12,077 KB)

Computer Science > Hardware Architecture

Title:YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Hardware Architecture

Title:YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators