[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3149457.3149470acmotherconferencesArticle/Chapter ViewAbstractPublication PageshpcasiaConference Proceedingsconference-collections
research-article

Performance improvement of the general-purpose CFD code FrontFlow/blue on the K computer

Published: 28 January 2018 Publication History

Abstract

The general-purpose fluid simulation software FrontFlow/blue (FFB) is based on the finite element method (FEM). It was designed to accept extremely large-scale simulations and is an important application in the manufacturing field in Japan. Moreover, since this application is significant in both the manufacturing field and the development of the post-K supercomputer, it is employed as an important application for the new post-K supercomputer that is under development. The K computer is still the important infrastructure in Japan. And there are some supercomputers having the same architecture to the K computer. Therefore we continue to improve the performance of the FFB on the K computer. On significant subroutines, several improvement techniques, store order based loop modification decreasing total load and store operations, unrolled loop rerolling to employ SIMD load instruction, adjusting number of arrays in loop, using sector cache function, and so on, were employed. As a result, an improvement of 160% was obtained on a single CPU performance. This paper shows and discusses the detail of these improvements.

References

[1]
Alexander Joel Chorin. 1968. Numerical Solution of the Navier-Stokes Equations. Math. Comp. 22, 104 (1968), 745--762.
[2]
RIKEN Advanced Institute for Computational Science. 2017. FLAGSHIP 2020 Project. (2017). Retrieved August 9, 2017 from http://www.aics.riken.jp/fs2020p/en/
[3]
Center for Computational Sciences (University of Tsukuba) / Information Technology Center (The University of Tokyo). 2017. JCAHPC: Joint Center for Advanced HPC. (2017). Retrieved August 9, 2017 from http://jcahpc.jp/eng/ofp_intro.html
[4]
Collaborative Center for Reserch on Innovative Simulation Software. 2017. Download --- Center for Reserch on Innovative Simulation Software. (2017). Retrieved July 7, 2017 from http://www.ciss.iis.u-tokyo.ac.jp/english/dl/index.php
[5]
Yukihiro Hasegawa, Junichi Iwata, Miwako Tsuji, Daisuke Takahashi, Atsushi Oshiyama, Kazuo Minami, Taisuke Boku, Hikaru Inoue, Yoshito Kitazawa, Ikuo Miyoshi, and Mitsuo Yokokawa. 2013. Performance evaluation of ultra-large-scale first-principles electronic structure calculation code on the K computer. International Journal of High Performance Computing Applications 28, 3 (2013), 335--355.
[6]
Michael Heroux Jack Dongarra and Piotr Luszczek. 2017. HPCG. (2017). Retrieved August 9, 2017 from http://www.hpcg-benchmark.org/custom/index.html
[7]
JAMSTEC. 2017. EARTH SIMULATOR. (May 2017). Retrieved August 10, 2017 from https://www.jamstec.go.jp/es/en/index.html
[8]
Kiyoshi Kumahata, Shunsuke Inoue, and Kazuo Minami. 2013. Kernel performance improvement for the FEM-based fluid analysis code on the K computer. Procedia Computer Science 18 (2013), 2496--2499.
[9]
Kiyoshi Kumahata, Kazuo Minami, and Naoya Maruyama. 2015. High-performance conjugate gradient performance improvement on the K computer. International Journal of High Performance Computing Applications 30, 1 (2015), 55--70.
[10]
The Graph 500 List. 2017. Graph 500. (2017). Retrieved August 9, 2017 from http://graph500.org/
[11]
Takumi Maruyama. 2009. SPARC64(TM) VIIIfx: Fujitsu's New Generation Octo Core Processor for PETA Scale computing. Hot Chips 21 (2009).
[12]
Takumi Maruyama, Toshio Yoshida, Ryuji Kan, Iwao Yamazaki, Shuji Yamamura, Noriyuki Takahashi, Mikio Houdou, and Hiroshi Okano. 2010. Sparc64 VIIIfx: A New-Generation Octocore Processor for Petascale Computing. IEEE Micro 30, 2 (2010), 30--40.
[13]
John D. McCalpin. 1995. Memory Bandwidth and Machine Balance in Current High Performance Computers. IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter (Dec. 1995), 19--25.
[14]
Kazuo Minami, Shunsuke Inoue, Syuichi Chiba, Kiyoshi Kumahata, and Mitsuo Yokokawa. 2016. Performance Estimation of Programs by an Extension of the Roofline Model Considering Cache Effects. IPSJ Transactions on Advanced Computing Systems 9, 2 (2016), 1--14.
[15]
Masaki Satoh, Hirofumi Tomita, Hisashi Yashiro, Hiroaki Miura, Chihiro Kodama, Tatsuya Seiki, Akira T. Noda, Yohei Yamada, Daisuke Goto, Masahiro Sawada, Takemasa Miyoshi, Yosuke Niwa, Masayuki Hara, Tomoki Ohno, Shin-ichi Iga, Takashi Arakawa, Takahiro Inoue, and Hiroyasu Kubokawa. 2014. The Non-hydrostatic Icosahedral Atmospheric Model: description and development. Progress in Earth and Planetary Science 1, 1 (14 Oct 2014), 18.
[16]
J. Smagorinsky. 1963. General Circulation Experiments with the Primitive Equations. Monthly Weather Review 91, 3 (1963), 99--164.
[17]
Atsuhiro Tamura, Kazuo Kikuchi, and Tadayasu Takahashi. 1997. Residual Cutting Method for Elliptic Boundary Value Problems: Application to Poisson's Equation. J. Comput. Phys. 137 (1997), 247--264.
[18]
Samuel Williams, Andrew Waterman, and David Patterson. 2009. Roofline: An Insightful Visual Performance Model for Multicore Architectures. Cmmuication ACM 52, 4 (2009), 65--76.

Index Terms

  1. Performance improvement of the general-purpose CFD code FrontFlow/blue on the K computer

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    HPCAsia '18: Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region
    January 2018
    322 pages
    ISBN:9781450353724
    DOI:10.1145/3149457
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 January 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Finite Element Method
    2. Fluid Analysis
    3. Performance Improvement
    4. The K computer

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    HPC Asia 2018

    Acceptance Rates

    HPCAsia '18 Paper Acceptance Rate 30 of 67 submissions, 45%;
    Overall Acceptance Rate 69 of 143 submissions, 48%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 62
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 20 Dec 2024

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media