トップ   編集 差分 バックアップ 添付 複製 名前変更 リロード   新規 一覧 単語検索 最終更新   ヘルプ   最終更新のRSS

Top / 計算機性能評価 / CuDA n-body simulation / CuDA n-body HP Z8 G4 Workstation QuadroP400 CuDA9.1-387.26

#author("2018-02-02T16:45:28+09:00","default:kow","kow")
* CuDA n-body HP Z8 G4 Workstation QuadroP400 CuDA9.1-387.26 [#w60200ea]
#author("2018-02-02T17:00:25+09:00","default:kow","kow")
* CuDA n-body HP Z8 G4 Workstation QuadroP400 CuDA9.1-387.26 [#g263cb1a]
~
** Nodeスペック: [#dc8db053]
CENTER:
|LEFT:100|LEFT:350|c
|Company   |HP|
|Product   |Z8 G4 Workstation|
|CPU       |Gold 6132 @ 2.60GHz|
|Core      |28Core/Node (14Core/1CPU)|
|GPU       |Quadro P400|
|CuDA info |release 9.1, V9.1.85|
|CuDA Driver|387.26|
|Memory    |64GByte|
|OS        |CentOS Linux release 7.4.1708 x64|
|compiler  |gcc バージョン 4.8.5 20150623|
** 結果値: [#m950f99a]
|BGCOLOR(#9BC2E6):COLOR(#000000):CENTER:CPU/GPU|BGCOLOR(#9BC2E6):COLOR(#000000):CENTER:single-precision GFLOP/s|BGCOLOR(#9BC2E6):COLOR(#000000):CENTER:double-precision GFLOP/s|
|COLOR(#000000):CENTER:Quadro P400|COLOR(#000000):CENTER:442.033|COLOR(#000000):CENTER:|
|COLOR(#000000):CENTER:Quadro P400|COLOR(#000000):CENTER:442.033|COLOR(#000000):CENTER:16.179|
** 実行結果ログ: [#kb4d91d1]
~
***nvidia-smi info [#t93d0e52]
***nvidia-smi info [#l16caea0]
#highlight(php:nogutter:collapse){{

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 387.26                 Driver Version: 387.26                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Quadro P400         Off  | 00000000:2D:00.0  On |                  N/A |
| 34%   45C    P0    N/A /  N/A |    119MiB /  1991MiB |    100%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|    0      3357      G   /usr/bin/X                                    37MiB |
|    0      3399      G   /usr/bin/gnome-shell                           7MiB |
|    0    109358      C   ./nbody                                       63MiB |
+-----------------------------------------------------------------------------+

==============NVSMI LOG==============

Timestamp                           : Fri Feb  2 16:38:10 2018
Driver Version                      : 387.26

Attached GPUs                       : 1
GPU 00000000:2D:00.0
    Product Name                    : Quadro P400
    Product Brand                   : Quadro
    Display Mode                    : Enabled
    Display Active                  : Enabled
    Persistence Mode                : Disabled
    Accounting Mode                 : Disabled
    Accounting Mode Buffer Size     : 1920
    Driver Model
        Current                     : N/A
        Pending                     : N/A
    Serial Number                   : ***************
    GPU UUID                        : ***************
    Minor Number                    : 0
    VBIOS Version                   : 86.07.3B.00.47
    MultiGPU Board                  : No
    Board ID                        : 0x2d00
    GPU Part Number                 : 900-5G212-0300-001
    Inforom Version
        Image Version               : G212.0500.00.01
        OEM Object                  : 1.1
        ECC Object                  : N/A
        Power Management Object     : N/A
    GPU Operation Mode
        Current                     : N/A
        Pending                     : N/A
    GPU Virtualization Mode
        Virtualization mode         : None
    PCI
        Bus                         : 0x2D
        Device                      : 0x00
        Domain                      : 0x0000
        Device Id                   : 0x1CB310DE
        Bus Id                      : 00000000:2D:00.0
        Sub System Id               : 0x11BE103C
        GPU Link Info
            PCIe Generation
                Max                 : 3
                Current             : 3
            Link Width
                Max                 : 16x
                Current             : 16x
        Bridge Chip
            Type                    : N/A
            Firmware                : N/A
        Replays since reset         : 0
        Tx Throughput               : 0 KB/s
        Rx Throughput               : 0 KB/s
    Fan Speed                       : 34 %
    Performance State               : P0
    Clocks Throttle Reasons
        Idle                        : Not Active
        Applications Clocks Setting : Not Active
        SW Power Cap                : Not Active
        HW Slowdown                 : Not Active
            HW Thermal Slowdown     : Not Active
            HW Power Brake Slowdown : Not Active
        Sync Boost                  : Not Active
        SW Thermal Slowdown         : Not Active
    FB Memory Usage
        Total                       : 1991 MiB
        Used                        : 121 MiB
        Free                        : 1870 MiB
    BAR1 Memory Usage
        Total                       : 256 MiB
        Used                        : 13 MiB
        Free                        : 243 MiB
    Compute Mode                    : Default
    Utilization
        Gpu                         : 100 %
        Memory                      : 1 %
        Encoder                     : 0 %
        Decoder                     : 0 %
    Encoder Stats
        Active Sessions             : 0
        Average FPS                 : 0
        Average Latency             : 0
    Ecc Mode
        Current                     : N/A
        Pending                     : N/A
    ECC Errors
        Volatile
            Single Bit
                Device Memory       : N/A
                Register File       : N/A
                L1 Cache            : N/A
                L2 Cache            : N/A
                Texture Memory      : N/A
                Texture Shared      : N/A
                CBU                 : N/A
                Total               : N/A
            Double Bit
                Device Memory       : N/A
                Register File       : N/A
                L1 Cache            : N/A
                L2 Cache            : N/A
                Texture Memory      : N/A
                Texture Shared      : N/A
                CBU                 : N/A
                Total               : N/A
        Aggregate
            Single Bit
                Device Memory       : N/A
                Register File       : N/A
                L1 Cache            : N/A
                L2 Cache            : N/A
                Texture Memory      : N/A
                Texture Shared      : N/A
                CBU                 : N/A
                Total               : N/A
            Double Bit
                Device Memory       : N/A
                Register File       : N/A
                L1 Cache            : N/A
                L2 Cache            : N/A
                Texture Memory      : N/A
                Texture Shared      : N/A
                CBU                 : N/A
                Total               : N/A
    Retired Pages
        Single Bit ECC              : N/A
        Double Bit ECC              : N/A
        Pending                     : N/A
    Temperature
        GPU Current Temp            : 45 C
        GPU Shutdown Temp           : 103 C
        GPU Slowdown Temp           : 100 C
        GPU Max Operating Temp      : N/A
        Memory Current Temp         : N/A
        Memory Max Operating Temp   : N/A
    Power Readings
        Power Management            : N/A
        Power Draw                  : N/A
        Power Limit                 : N/A
        Default Power Limit         : N/A
        Enforced Power Limit        : N/A
        Min Power Limit             : N/A
        Max Power Limit             : N/A
    Clocks
        Graphics                    : 1252 MHz
        SM                          : 1252 MHz
        Memory                      : 2004 MHz
        Video                       : 1126 MHz
    Applications Clocks
        Graphics                    : 1227 MHz
        Memory                      : 2005 MHz
    Default Applications Clocks
        Graphics                    : 1227 MHz
        Memory                      : 2005 MHz
    Max Clocks
        Graphics                    : 1252 MHz
        SM                          : 1252 MHz
        Memory                      : 2005 MHz
        Video                       : 1126 MHz
    Max Customer Boost Clocks
        Graphics                    : 1252 MHz
    Clock Policy
        Auto Boost                  : N/A
        Auto Boost Default          : N/A
    Processes
        Process ID                  : 3357
            Type                    : G
            Name                    : /usr/bin/X
            Used GPU Memory         : 37 MiB
        Process ID                  : 3399
            Type                    : G
            Name                    : /usr/bin/gnome-shell
            Used GPU Memory         : 8 MiB
        Process ID                  : 109358
            Type                    : C
            Name                    : ./nbody
            Used GPU Memory         : 63 MiB
}}
~
***single-precision [#k8c133c4]
***single-precision [#f53c818d]
#highlight(php:nogutter:collapse){{
$ ./nbody -benchmark -numbodies=204800 -fp32
Run "nbody -benchmark [-numbodies=<numBodies>]" to measure performance.
        -fullscreen       (run n-body simulation in fullscreen mode)
        -fp64             (use double precision floating point values for simulation)
        -hostmem          (stores simulation data in host memory)
        -benchmark        (run benchmark to measure performance)
        -numbodies=<N>    (number of bodies (>= 1) to run in simulation)
        -device=<d>       (where d=0,1,2.... for the CUDA device to use)
        -numdevices=<i>   (where i=(number of CUDA devices > 0) to use for simulation)
        -compare          (compares simulation results running once on the default GPU and once on the CPU)
        -cpu              (run n-body simulation on the CPU)
        -tipsy=<file.bin> (load a tipsy model file for simulation)

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.

> Windowed mode
> Simulation data stored in video memory
> Single precision floating point simulation
> 1 Devices used for simulation
GPU Device 0: "Quadro P400" with compute capability 6.1

> Compute 6.1 CUDA device: [Quadro P400]
number of bodies = 204800
204800 bodies, total time for 10 iterations: 18977.334 ms
= 22.102 billion interactions per second
= 442.033 single-precision GFLOP/s at 20 flops per interaction
$
}}
~
***double-precision [#l669bac3]
***double-precision [#r7529792]
#highlight(php:nogutter:collapse){{
$ ./nbody -benchmark -numbodies=204800 -fp64
Run "nbody -benchmark [-numbodies=<numBodies>]" to measure performance.
        -fullscreen       (run n-body simulation in fullscreen mode)
        -fp64             (use double precision floating point values for simulation)
        -hostmem          (stores simulation data in host memory)
        -benchmark        (run benchmark to measure performance)
        -numbodies=<N>    (number of bodies (>= 1) to run in simulation)
        -device=<d>       (where d=0,1,2.... for the CUDA device to use)
        -numdevices=<i>   (where i=(number of CUDA devices > 0) to use for simulation)
        -compare          (compares simulation results running once on the default GPU and once on the CPU)
        -cpu              (run n-body simulation on the CPU)
        -tipsy=<file.bin> (load a tipsy model file for simulation)

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.

> Windowed mode
> Simulation data stored in video memory
> Double precision floating point simulation
> 1 Devices used for simulation
GPU Device 0: "Quadro P400" with compute capability 6.1

> Compute 6.1 CUDA device: [Quadro P400]
number of bodies = 204800
204800 bodies, total time for 10 iterations: 777731.125 ms
= 0.539 billion interactions per second
= 16.179 double-precision GFLOP/s at 30 flops per interaction
$
}}
~
~
** 特記事項: [#g54ca308]
~
~
#highlight(end)
~
~
&tag(CPU,GPU,HP,CuDA,n-body,Benchmark,Quadro P400,gcc);
~
~
[[Goto Top Page>/]]
~


Top / 計算機性能評価 / CuDA n-body simulation / CuDA n-body HP Z8 G4 Workstation QuadroP400 CuDA9.1-387.26