InfiniteMac OSx86  


Reply
 
Thread tools Display modes
  #1  
Old 09-01-2009, 11:31 AM
mitchde mitchde is offline
Jaguar
 
Join Date: Aug 2009
Posts: 38
OpenCL GPU+CPU GALAXY Bench

10.6 only.





Using:
Start Galaxy
key s = switch compute Modes
>CPU>Single/Multi, CPU-Vector/SSE Single/Multi>GPU> GPU+CPU> (bold=start Mode)
key SPACE = Pause/go on
key 6 = Reset Szene
key Q = QUIT

DOWNLOAD 6 MB:
http://rapidshare.com/files/27355290...CL_Galaxis.zip

LLMV GCC 4.2 Compiler compiliert = high optimized Code

Some results (posted):

mitch (C2D 3GHZ, NV 9600 GT , 1440x1140)
24 Gigaflops / around 70 U/sec : CPU ( SIM: Vector Multi-Core CPU. Mode)
73 Gigaflops / around 220 U/sec : NV 9600 GT

1600x1200
21 Gigaflops / around 65 U/sec : CPU ( SIM: Vector Multi-Core CPU. Mode)
60 Gigaflops / around 190 U/sec : Nvidia 9600 GT

Users:
170 Gigaflops / around 505 U/sec : Nvidia 9800 GTX+

MacPro Early 2008
CPU : 95 Gigaflops
GPU 60 Gigaflops NV 8800GT
Reply With Quote
  #2  
Old 10-10-2009, 06:49 AM
iyohmamma iyohmamma is offline
Cheetah
 
Join Date: Jul 2009
Posts: 3
no work

links no work!!!!!!!! instructions??????
Reply With Quote
  #3  
Old 10-10-2009, 01:21 PM
mitchde mitchde is offline
Jaguar
 
Join Date: Aug 2009
Posts: 38
Hi, someone posted wrong (very OLD=Version1=removed) Links:

Here are my (i compiled that stuff) - all need 10.6.x.
ATI has problems with OpenCL, so only the Displacement works here. Galaxies cant run (need 10.6.2+...get fixed by Apple/ATI)

OpenCLBench_Displacement_Version 2
http://rapidshare.com/files/28747429...acement_V2.zip
Galaxies_8K_Version 2 - for slow OpenCL GPUs
http://rapidshare.com/files/28623515...xies_8K_V2.zip
Galaxies_32K_Version 2 - for fast gpus
http://rapidshare.com/files/28623429...ies_32K_V2.zip

Galaxies does an Starsimulation. If you have slow OpenCL GPU and/or slow CPU best use first the 8K Version.

Displacement does some rendering/shining.


Very soon an OpenCL Smoke Particles demo will be available:


Last edited by mitchde; 10-10-2009 at 01:31 PM.
Reply With Quote
  #4  
Old 10-10-2009, 04:48 PM
x986123's Avatar
x986123 x986123 is offline
Panther
 
Join Date: Aug 2009
Posts: 199
SMOKE! Whered you get that! I want try it =D


OS: Snow Leopard Mac OS X 10.6.2 (10C540) | Vanilla Kernel | 64 bit Kernel | EFI64

CPU: Intel Core i7 920 @ 3.6 GHz
GPU: 2x SLI Dual GTX 260 (OpenCL working)(QE+CI)
RAM: 6GB DDR3 1600MHz 7-7-7-20

AUDIO: Realtek (5.1 config)
HDD: RAID 0- 1 + 1TB HDD's | 320GB HDD
MOBO: MSI X58 Platinum SLI (Intel ICH10)

CASE: Haf 932 Full Tower Case + Red Cathodes
MONITOR: Asus VK266H 26" 1920x1200 LCD

All working
Reply With Quote
  #5  
Old 10-10-2009, 05:40 PM
thorazine74 thorazine74 is offline
 
Join Date: May 2009
Location: Spain
Posts: 411
Intel C2D 2,66@3,20 GHz + GeForce 8600GTS 512 Mb

Code:
1 OpenCL platform found!

[Platform 0]
    Name:         Apple
    Vendor:     Apple
    Version:     OpenCL 1.0 (Jul 15 2009 23:07:32)
    Profile:     FULL_PROFILE


[OpenCL-only Context]
2 OpenCL devices found!

[Device 0]
    Name:             GeForce 8600 GTS
    Vendor:         NVIDIA
    Type:             GPU 
    Device Version:     OpenCL 1.0
    Driver Version:     CLH 1.0
    Compute Units:         32
    Work Group Size:     512
    Clock:             1450 MHz
    Global Memory:         512 MB
    Local Memory:         16 KB
    Cache Size:         0 KB
    Cache Line Size:     0 Bytes
    Available:         Yes
    Double-Precision:     No
    Extensions: 
                cl_khr_byte_addressable_store
                cl_khr_global_int32_base_atomics
                cl_khr_global_int32_extended_atomics
                cl_APPLE_gl_sharing
                cl_APPLE_SetMemObjectDestructor
                cl_APPLE_ContextLoggingFunctions

[Device 1]
    Name:             Intel(R) Core(TM)2 Duo CPU     E8200  @ 2.66GHz
    Vendor:         Intel
    Type:             CPU 
    Device Version:     OpenCL 1.0
    Driver Version:     1.0
    Compute Units:         2
    Work Group Size:     1
    Clock:             3228 MHz
    Global Memory (Total):     2048 MB
    Global Memory (Host):     1536 MB
    Global Memory (PCIe):     512 MB
    Local Memory:         16 KB
    Cache Size:         6144 KB
    Cache Line Size:     64 Bytes
    Available:         Yes
    Double-Precision:     Yes
    Extensions: 
                cl_khr_fp64
                cl_khr_global_int32_base_atomics
                cl_khr_global_int32_extended_atomics
                cl_khr_local_int32_base_atomics
                cl_khr_local_int32_extended_atomics
                cl_khr_byte_addressable_store
                cl_APPLE_gl_sharing
                cl_APPLE_SetMemObjectDestructor
                cl_APPLE_ContextLoggingFunctions

[shared OpenCL+OpenGL Context]
2 OpenCL devices found!

[Device 0]
    Name:             GeForce 8600 GTS
    Vendor:         NVIDIA
    Type:             GPU 
    Device Version:     OpenCL 1.0
    Driver Version:     CLH 1.0
    Compute Units:         32
    Work Group Size:     512
    Clock:             1450 MHz
    Global Memory:         512 MB
    Local Memory:         16 KB
    Cache Size:         0 KB
    Cache Line Size:     0 Bytes
    Available:         Yes
    Double-Precision:     No
    Extensions: 
                cl_khr_byte_addressable_store
                cl_khr_global_int32_base_atomics
                cl_khr_global_int32_extended_atomics
                cl_APPLE_gl_sharing
                cl_APPLE_SetMemObjectDestructor
                cl_APPLE_ContextLoggingFunctions

[Device 1]
    Name:             Intel(R) Core(TM)2 Duo CPU     E8200  @ 2.66GHz
    Vendor:         Intel
    Type:             CPU 
    Device Version:     OpenCL 1.0
    Driver Version:     1.0
    Compute Units:         2
    Work Group Size:     1
    Clock:             3228 MHz
    Global Memory (Total):     2048 MB
    Global Memory (Host):     1536 MB
    Global Memory (PCIe):     512 MB
    Local Memory:         16 KB
    Cache Size:         6144 KB
    Cache Line Size:     64 Bytes
    Available:         Yes
    Double-Precision:     Yes
    Extensions: 
                cl_khr_fp64
                cl_khr_global_int32_base_atomics
                cl_khr_global_int32_extended_atomics
                cl_khr_local_int32_base_atomics
                cl_khr_local_int32_extended_atomics
                cl_khr_byte_addressable_store
                cl_APPLE_gl_sharing
                cl_APPLE_SetMemObjectDestructor
                cl_APPLE_ContextLoggingFunctions

logout
Display Res: 1680x1050

Displacement V2:

[GPU]: Shader 1/Shader 2
Compute: 24 ms. / 19 ms.
Display: 41 fps. / 52 fps.

Galaxy 8K V2:
Vector Single Core: 12
Vector MultiCore: 22
GeForce 8600GTS: 52
Hybrid MultiCore CPU+GPU: 29

I didnt try the 32K, I suppose is too much for this old card...

AsRock P45TS | C2D E8200 | GeForce 8600GTS
Mac OS X Snow Leopard 10.6.1 + Windows 7 Ultimate 6.1.7600 + Fedora 11
Reply With Quote
  #6  
Old 10-11-2009, 02:27 PM
mitchde mitchde is offline
Jaguar
 
Join Date: Aug 2009
Posts: 38
Smoke Partikels and all others are from Apple Dev Source. Just compiled.
For those only want to look, here is the Smoke Particles video:
http://www.youtube.com/watch?v=-7yTRxJhVps

On that Link you see also the DL Link (10 MB) and how to run instructions (easy )
Only CUDA driver must be installed, Links also at the video link above.

CUDA has the advantage to run also in OS X 10.5 , OpenCL is 10.6!
Disadvantage (like ATI STEAM) is thats vendoe specific - CUDA app runs only on NVIDIA , and ATI STEAM (no OS X) only on ATI GPUS. OpenCL should be universal. Until now its not, because ATI gpus have trouble with OpenCL.
OpenCL main difference to CUDA + ATI STEAM is, that OpenCL part of the App is compiled at runtime ! So The dev must NOT
compile that for an specific gpu. The openCL Source will compiled for the specific gpu the OpenCL framework finds at runtime.
But OpenCL has indeed also some things to code to optimize the source for the complete different (in features+speed) gpus.
So OpenCL has great feature but is more work (also brain work) to get an big bandwith of really universal gpu computing.

Last edited by mitchde; 10-11-2009 at 02:39 PM.
Reply With Quote
  #7  
Old 10-17-2009, 12:24 AM
hys17 hys17 is offline
Cheetah
 
Join Date: Oct 2009
Location: Toronto
Posts: 10
I passed all the test except the VolumeRender.Here's the content of the txt file:


/Users/xxx/Downloads/OpenCL_Bench_SET_V2/from Nvidia Sources/VolumeRender_OpenCL/oclVolumeRender Starting...

Press '=' and '-' to change density
']' and '[' to change brightness
';' and ''' to modify transfer function offset
'.' and ',' to modify transfer function scale

CL_DEVICE_VENDOR: NVIDIA
CL_DEVICE_NAME: GeForce GTX 280
CL_DRIVER_VERSION: CLH 1.0
CL_DEVICE_TYPE: CL_DEVICE_TYPE_GPU
CL_DEVICE_MAX_COMPUTE_UNITS: 240
CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
CL_DEVICE_MAX_WORK_ITEM_SIZES: 512 / 512 / 64
CL_DEVICE_MAX_WORK_GROUP_SIZE: 512
CL_DEVICE_MAX_CLOCK_FREQUENCY: 1350 MHz
CL_DEVICE_ADDRESS_BITS: 32
CL_DEVICE_IMAGE_SUPPORT: 1
CL_DEVICE_MAX_READ_IMAGE_ARGS: 128
CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 8
CL_DEVICE_IMAGE_MAX_WIDTH: 2d width 8192, 2d height 8192, 3d width 2048, 3d height 2048, 3d depth 2048
CL_DEVICE_MAX_MEM_ALLOC_SIZE: 256 MByte
CL_DEVICE_GLOBAL_MEM_SIZE: 1024 MByte
CL_DEVICE_ERROR_CORRECTION_SUPPORT: no
CL_DEVICE_LOCAL_MEM_TYPE: local
CL_DEVICE_LOCAL_MEM_SIZE: 16 KByte
CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64 KByte
CL_DEVICE_QUEUE_PROPERTIES: CL_QUEUE_PROFILING_ENABLE
CL_DEVICE_EXTENSIONS:
cl_khr_byte_addressable_store
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics
cl_APPLE_gl_sharing
cl_APPLE_SetMemObjectDestructor
cl_APPLE_ContextLoggingFunctions
cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics
CL_DEVICE_PREFERRED_VECTOR_WIDTH: char 1, short 1, int 1, long 1, float 1, double 0


!!! Error # 0 at line 161 , in file oclVolumeRender.cpp !!!


Starting Cleanup...

TEST FAILED !!!...

oclVolumeRender.exe Exiting...
Press <Enter> to Quit


Is it because it's [from Nvidia Sources]?so the test doesn't work with unofficially supported GPU?



💡 Deploy cloud instances seamlessly on DigitalOcean. Free credits ($100) for InfMac readers.


i7 920
MSI X58 Platinum SLI
DDR3 1066 2GB x3
EVGA Geforce GTX 280 1024MB
1TB SATA2 x2 - RAID
1.5TB SATA2 (JMicron Port,GPT,installed Snow Leopard)
BenQ G2412HD 1920x1080 LCD

Snow Leopard [Vanilla] 10.6.2
Chameleon RC4
Reply With Quote
Reply