トップ   編集 差分 バックアップ 添付 複製 名前変更 リロード   新規 一覧 単語検索 最終更新   ヘルプ   最終更新のRSS

Top / 計算機性能評価 / NAS Parallel Benchmark / NAS Parallel OMP HP ProLiant DL360 Gen10 Silver4112 RHEL7.3x64 gcc

#author("2018-01-22T13:50:14+09:00","default:kow","kow")
* NAS Parallel OMP HP ProLiant DL360 Gen10 Silver4112 RHEL7.3x64 gcc [#rd614ec7]
~
** Nodeスペック: [#o3d148cd]
CENTER:
|LEFT:100|LEFT:350|c
|Company   |HPE|
|Product   |ProLiant DL360 Gen10|
|CPU       |Silver 4112 CPU @ 2.60GHz|
|Core      |4Core/Node (4Core/1CPU)|
|Memory    |16GByte|
|OS        |Red Hat Enterprise Linux Server release 7.3 x64|
|compiler  |gcc バージョン 4.8.5 20150623 (Red Hat 4.8.5-11) (GCC)|
** 結果値: [#o09b8098]
|CENTER:実行時Core数|>|CENTER:bt|>|CENTER:cg|>|CENTER:lu|h
|~|CENTER:Mop/s total|CENTER:Mop/s/thread|CENTER:Mop/s total|CENTER:Mop/s/thread|CENTER:Mop/s total|CENTER:Mop/s/thread|h
|CENTER:100|CENTER:|CENTER:|CENTER:|CENTER:|CENTER:|CENTER:|c
|処理性能|9292.09|2323.02|2012.24|503.06|5316.84|1329.21|
** 実行結果ログ: [#kb4d91d1]
~
***bt.C.x 並列実行結果 [#vcf46861]
#highlight(php:nogutter:collapse){{
 NAS Parallel Benchmarks (NPB3.3-OMP) - BT Benchmark

 No input file inputbt.data. Using compiled defaults
 Size:  162x 162x 162
 Iterations:  200       dt:   0.0001000
 Number of available threads:     4

 Time step    1
 Time step   20
 Time step   40
 Time step   60
 Time step   80
 Time step  100
 Time step  120
 Time step  140
 Time step  160
 Time step  180
 Time step  200
 Verification being performed for class C
 accuracy setting for epsilon =  0.1000000000000E-07
 Comparison of RMS-norms of residual
           1 0.6239811655176E+04 0.6239811655176E+04 0.1457567554973E-14
           2 0.5079323919042E+03 0.5079323919042E+03 0.6714683264956E-15
           3 0.1542353009301E+04 0.1542353009301E+04 0.8402939808271E-14
           4 0.1330238792929E+04 0.1330238792929E+04 0.1247766821685E-13
           5 0.1160408742844E+05 0.1160408742844E+05 0.1410787770505E-14
 Comparison of RMS-norms of solution error
           1 0.1646200836909E+03 0.1646200836909E+03 0.8114564840929E-14
           2 0.1149710790382E+02 0.1149710790382E+02 0.8034242734913E-14
           3 0.4120744620746E+02 0.4120744620746E+02 0.2069167981484E-14
           4 0.3708765105969E+02 0.3708765105969E+02 0.1245300699957E-13
           5 0.3621105305184E+03 0.3621105305184E+03 0.1569780883738E-14
 Verification Successful


 BT Benchmark Completed.
 Class           =                        C
 Size            =            162x 162x 162
 Iterations      =                      200
 Time in seconds =                   308.46
 Total threads   =                        4
 Avail threads   =                        4
 Mop/s total     =                  9292.09
 Mop/s/thread    =                  2323.02
 Operation type  =           floating point
 Verification    =               SUCCESSFUL
 Version         =                    3.3.1
 Compile date    =              22 Jan 2018

 Compile options:
    F77          = gfortran
    FLINK        = $(F77)
    F_LIB        = (none)
    F_INC        = (none)
    FFLAGS       = -O3 -fopenmp -mcmodel=medium
    FLINKFLAGS   = -O3 -fopenmp -mcmodel=medium
    RAND         = (none)


 Please send all errors/feedbacks to:

 NPB Development Team
 npb@nas.nasa.gov
}}
~
***cg.C.x 並列実行結果 [#v257ea19]
#highlight(php:nogutter:collapse){{
 NAS Parallel Benchmarks (NPB3.3-OMP) - CG Benchmark

 Size:      150000
 Iterations:                     75
 Number of available threads:     4

 Initialization time =           4.556 seconds

   iteration           ||r||                 zeta
        1       0.35342969571864E-12   109.9994423237398
        2       0.80219133297630E-15    27.3920437146526
        3       0.85574447652758E-15    28.0339761840275
        4       0.87844870170470E-15    28.4191507551287
        5       0.89263848334069E-15    28.6471670038893
        6       0.89522822865143E-15    28.7812969418412
        7       0.89605126130110E-15    28.8600458537343
        8       0.90669293134117E-15    28.9063145572685
        9       0.90905888366470E-15    28.9335649214621
       10       0.91285873129235E-15    28.9496695449003
       11       0.91135377762715E-15    28.9592263353755
       12       0.91005071915392E-15    28.9649233484889
       13       0.90881369904544E-15    28.9683359248936
       14       0.91384433028569E-15    28.9703903960005
       15       0.90790081125761E-15    28.9716336228371
       16       0.91352094218022E-15    28.9723898600853
       17       0.90997449028923E-15    28.9728522692229
       18       0.90872021543804E-15    28.9731364797778
       19       0.90655993278499E-15    28.9733120569288
       20       0.91485184416931E-15    28.9734210663747
       21       0.91526153116107E-15    28.9734890760554
       22       0.91467892989921E-15    28.9735317064871
       23       0.90804202956715E-15    28.9735585497259
       24       0.91591023116125E-15    28.9735755256648
       25       0.90996813466886E-15    28.9735863059382
       26       0.90294660645813E-15    28.9735931787373
       27       0.91505411065189E-15    28.9735975767371
       28       0.91037046197223E-15    28.9736004009989
       29       0.90782357195105E-15    28.9736022206771
       30       0.91010422340857E-15    28.9736033967627
       31       0.91039915176318E-15    28.9736041591157
       32       0.90526154774440E-15    28.9736046546405
       33       0.91143699305928E-15    28.9736049775614
       34       0.91184967788946E-15    28.9736051885099
       35       0.91007517436503E-15    28.9736053266246
       36       0.90583328732782E-15    28.9736054172455
       37       0.90790990594248E-15    28.9736054768226
       38       0.90865567889760E-15    28.9736055160639
       39       0.91115118922591E-15    28.9736055419573
       40       0.90801783879124E-15    28.9736055590699
       41       0.90853292825276E-15    28.9736055703976
       42       0.90703438188556E-15    28.9736055779073
       43       0.90932825447369E-15    28.9736055828934
       44       0.90582786619525E-15    28.9736055862072
       45       0.90902796052357E-15    28.9736055884132
       46       0.90478092877073E-15    28.9736055898825
       47       0.90558418708826E-15    28.9736055908630
       48       0.90918105785008E-15    28.9736055915179
       49       0.90686679960520E-15    28.9736055919554
       50       0.91002595932896E-15    28.9736055922485
       51       0.90556204206949E-15    28.9736055924450
       52       0.90130864201723E-15    28.9736055925761
       53       0.90444624529116E-15    28.9736055926646
       54       0.90830940804740E-15    28.9736055927235
       55       0.90268330213574E-15    28.9736055927632
       56       0.90657830982943E-15    28.9736055927905
       57       0.90467991779118E-15    28.9736055928077
       58       0.90395196141934E-15    28.9736055928208
       59       0.90609125891058E-15    28.9736055928281
       60       0.90904345253292E-15    28.9736055928339
       61       0.90689467752087E-15    28.9736055928375
       62       0.90781147636321E-15    28.9736055928397
       63       0.90711500539411E-15    28.9736055928419
       64       0.90286033085507E-15    28.9736055928432
       65       0.90546511392042E-15    28.9736055928440
       66       0.90827756760431E-15    28.9736055928438
       67       0.90638818282857E-15    28.9736055928450
       68       0.90593177369828E-15    28.9736055928445
       69       0.90925683287830E-15    28.9736055928450
       70       0.90266416577580E-15    28.9736055928455
       71       0.90740357269818E-15    28.9736055928456
       72       0.90881436839946E-15    28.9736055928457
       73       0.90419525691135E-15    28.9736055928458
       74       0.90438949889961E-15    28.9736055928450
       75       0.90193406197394E-15    28.9736055928458
 Benchmark completed
 VERIFICATION SUCCESSFUL
 Zeta is     0.2897360559285E+02
 Error is    0.2758926827966E-13


 CG Benchmark Completed.
 Class           =                        C
 Size            =                   150000
 Iterations      =                       75
 Time in seconds =                    71.24
 Total threads   =                        4
 Avail threads   =                        4
 Mop/s total     =                  2012.24
 Mop/s/thread    =                   503.06
 Operation type  =           floating point
 Verification    =               SUCCESSFUL
 Version         =                    3.3.1
 Compile date    =              22 Jan 2018

 Compile options:
    F77          = gfortran
    FLINK        = $(F77)
    F_LIB        = (none)
    F_INC        = (none)
    FFLAGS       = -O3 -fopenmp -mcmodel=medium
    FLINKFLAGS   = -O3 -fopenmp -mcmodel=medium
    RAND         = randi8


 Please send all errors/feedbacks to:

 NPB Development Team
 npb@nas.nasa.gov
}}
~
***lu.C.x 並列実行結果 [#a50edfba]
#highlight(php:nogutter:collapse){{
 NAS Parallel Benchmarks (NPB3.3-OMP) - LU Benchmark

 Size:  162x 162x 162
 Iterations:                    250
 Number of available threads:     4

 Time step    1
 Time step   20
 Time step   40
 Time step   60
 Time step   80
 Time step  100
 Time step  120
 Time step  140
 Time step  160
 Time step  180
 Time step  200
 Time step  220
 Time step  240
 Time step  250

 Verification being performed for class C
 Accuracy setting for epsilon =  0.1000000000000E-07
 Comparison of RMS-norms of residual
           1   0.1037669803235E+05 0.1037669803235E+05 0.1893192371701E-13
           2   0.8922124588010E+03 0.8922124588010E+03 0.5861378064841E-14
           3   0.2562388145827E+04 0.2562388145827E+04 0.2662051912161E-14
           4   0.2191943438578E+04 0.2191943438578E+04 0.2489557036279E-14
           5   0.1780780572611E+05 0.1780780572611E+05 0.2042912452576E-15
 Comparison of RMS-norms of solution error
           1   0.2159863997170E+03 0.2159863997169E+03 0.7105874773843E-14
           2   0.1557895592399E+02 0.1557895592399E+02 0.9691941622131E-14
           3   0.5413188630772E+02 0.5413188630772E+02 0.0000000000000E+00
           4   0.4822626431540E+02 0.4822626431540E+02 0.5893408878723E-15
           5   0.4559029100433E+03 0.4559029100433E+03 0.3241762352955E-14
 Comparison of surface integral
               0.6664045535722E+02 0.6664045535722E+02 0.1919219965579E-14
 Verification Successful


 LU Benchmark Completed.
 Class           =                        C
 Size            =            162x 162x 162
 Iterations      =                      250
 Time in seconds =                   383.50
 Total threads   =                        4
 Avail threads   =                        4
 Mop/s total     =                  5316.84
 Mop/s/thread    =                  1329.21
 Operation type  =           floating point
 Verification    =               SUCCESSFUL
 Version         =                    3.3.1
 Compile date    =              22 Jan 2018

 Compile options:
    F77          = gfortran
    FLINK        = $(F77)
    F_LIB        = (none)
    F_INC        = (none)
    FFLAGS       = -O3 -fopenmp -mcmodel=medium
    FLINKFLAGS   = -O3 -fopenmp -mcmodel=medium
    RAND         = (none)


 Please send all errors/feedbacks to:

 NPB Development Team
 npb@nas.nasa.gov
}}
~
** 特記事項: [#g54ca308]
~
~
#highlight(end)
~
~
&tag(CPU,Hewlett-Packard,Benchmark,NAS,Silver4112,gcc);
~
~
[[Goto Top Page>/]]
~


Top / 計算機性能評価 / NAS Parallel Benchmark / NAS Parallel OMP HP ProLiant DL360 Gen10 Silver4112 RHEL7.3x64 gcc