#1
|
|||
|
|||
OpenCL GPU+CPU GALAXY Bench
10.6 only.
Using: Start Galaxy key s = switch compute Modes >CPU>Single/Multi, CPU-Vector/SSE Single/Multi>GPU> GPU+CPU> (bold=start Mode) key SPACE = Pause/go on key 6 = Reset Szene key Q = QUIT DOWNLOAD 6 MB: http://rapidshare.com/files/27355290...CL_Galaxis.zip LLMV GCC 4.2 Compiler compiliert = high optimized Code Some results (posted): mitch (C2D 3GHZ, NV 9600 GT , 1440x1140) 24 Gigaflops / around 70 U/sec : CPU ( SIM: Vector Multi-Core CPU. Mode) 73 Gigaflops / around 220 U/sec : NV 9600 GT 1600x1200 21 Gigaflops / around 65 U/sec : CPU ( SIM: Vector Multi-Core CPU. Mode) 60 Gigaflops / around 190 U/sec : Nvidia 9600 GT Users: 170 Gigaflops / around 505 U/sec : Nvidia 9800 GTX+ MacPro Early 2008 CPU : 95 Gigaflops GPU 60 Gigaflops NV 8800GT |
#2
|
|||
|
|||
no work
links no work!!!!!!!! instructions??????
|
#3
|
|||
|
|||
Hi, someone posted wrong (very OLD=Version1=removed) Links:
Here are my (i compiled that stuff) - all need 10.6.x. ATI has problems with OpenCL, so only the Displacement works here. Galaxies cant run (need 10.6.2+...get fixed by Apple/ATI) OpenCLBench_Displacement_Version 2 http://rapidshare.com/files/28747429...acement_V2.zip Galaxies_8K_Version 2 - for slow OpenCL GPUs http://rapidshare.com/files/28623515...xies_8K_V2.zip Galaxies_32K_Version 2 - for fast gpus http://rapidshare.com/files/28623429...ies_32K_V2.zip Galaxies does an Starsimulation. If you have slow OpenCL GPU and/or slow CPU best use first the 8K Version. Displacement does some rendering/shining. Very soon an OpenCL Smoke Particles demo will be available: Last edited by mitchde; 10-10-2009 at 01:31 PM. |
#4
|
||||
|
||||
SMOKE! Whered you get that! I want try it =D
OS: Snow Leopard Mac OS X 10.6.2 (10C540) | Vanilla Kernel | 64 bit Kernel | EFI64 CPU: Intel Core i7 920 @ 3.6 GHz GPU: 2x SLI Dual GTX 260 (OpenCL working)(QE+CI) RAM: 6GB DDR3 1600MHz 7-7-7-20 AUDIO: Realtek (5.1 config) HDD: RAID 0- 1 + 1TB HDD's | 320GB HDD MOBO: MSI X58 Platinum SLI (Intel ICH10) CASE: Haf 932 Full Tower Case + Red Cathodes MONITOR: Asus VK266H 26" 1920x1200 LCD All working |
#5
|
|||
|
|||
Intel C2D 2,66@3,20 GHz + GeForce 8600GTS 512 Mb
Code:
1 OpenCL platform found! [Platform 0] Name: Apple Vendor: Apple Version: OpenCL 1.0 (Jul 15 2009 23:07:32) Profile: FULL_PROFILE [OpenCL-only Context] 2 OpenCL devices found! [Device 0] Name: GeForce 8600 GTS Vendor: NVIDIA Type: GPU Device Version: OpenCL 1.0 Driver Version: CLH 1.0 Compute Units: 32 Work Group Size: 512 Clock: 1450 MHz Global Memory: 512 MB Local Memory: 16 KB Cache Size: 0 KB Cache Line Size: 0 Bytes Available: Yes Double-Precision: No Extensions: cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_APPLE_gl_sharing cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions [Device 1] Name: Intel(R) Core(TM)2 Duo CPU E8200 @ 2.66GHz Vendor: Intel Type: CPU Device Version: OpenCL 1.0 Driver Version: 1.0 Compute Units: 2 Work Group Size: 1 Clock: 3228 MHz Global Memory (Total): 2048 MB Global Memory (Host): 1536 MB Global Memory (PCIe): 512 MB Local Memory: 16 KB Cache Size: 6144 KB Cache Line Size: 64 Bytes Available: Yes Double-Precision: Yes Extensions: cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_APPLE_gl_sharing cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions [shared OpenCL+OpenGL Context] 2 OpenCL devices found! [Device 0] Name: GeForce 8600 GTS Vendor: NVIDIA Type: GPU Device Version: OpenCL 1.0 Driver Version: CLH 1.0 Compute Units: 32 Work Group Size: 512 Clock: 1450 MHz Global Memory: 512 MB Local Memory: 16 KB Cache Size: 0 KB Cache Line Size: 0 Bytes Available: Yes Double-Precision: No Extensions: cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_APPLE_gl_sharing cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions [Device 1] Name: Intel(R) Core(TM)2 Duo CPU E8200 @ 2.66GHz Vendor: Intel Type: CPU Device Version: OpenCL 1.0 Driver Version: 1.0 Compute Units: 2 Work Group Size: 1 Clock: 3228 MHz Global Memory (Total): 2048 MB Global Memory (Host): 1536 MB Global Memory (PCIe): 512 MB Local Memory: 16 KB Cache Size: 6144 KB Cache Line Size: 64 Bytes Available: Yes Double-Precision: Yes Extensions: cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_APPLE_gl_sharing cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions logout Displacement V2: [GPU]: Shader 1/Shader 2 Compute: 24 ms. / 19 ms. Display: 41 fps. / 52 fps. Galaxy 8K V2: Vector Single Core: 12 Vector MultiCore: 22 GeForce 8600GTS: 52 Hybrid MultiCore CPU+GPU: 29 I didnt try the 32K, I suppose is too much for this old card... AsRock P45TS | C2D E8200 | GeForce 8600GTS Mac OS X Snow Leopard 10.6.1 + Windows 7 Ultimate 6.1.7600 + Fedora 11 |
#6
|
|||
|
|||
Smoke Partikels and all others are from Apple Dev Source. Just compiled.
For those only want to look, here is the Smoke Particles video: http://www.youtube.com/watch?v=-7yTRxJhVps On that Link you see also the DL Link (10 MB) and how to run instructions (easy ) Only CUDA driver must be installed, Links also at the video link above. CUDA has the advantage to run also in OS X 10.5 , OpenCL is 10.6! Disadvantage (like ATI STEAM) is thats vendoe specific - CUDA app runs only on NVIDIA , and ATI STEAM (no OS X) only on ATI GPUS. OpenCL should be universal. Until now its not, because ATI gpus have trouble with OpenCL. OpenCL main difference to CUDA + ATI STEAM is, that OpenCL part of the App is compiled at runtime ! So The dev must NOT compile that for an specific gpu. The openCL Source will compiled for the specific gpu the OpenCL framework finds at runtime. But OpenCL has indeed also some things to code to optimize the source for the complete different (in features+speed) gpus. So OpenCL has great feature but is more work (also brain work) to get an big bandwith of really universal gpu computing. Last edited by mitchde; 10-11-2009 at 02:39 PM. |
#7
|
|||
|
|||
I passed all the test except the VolumeRender.Here's the content of the txt file:
/Users/xxx/Downloads/OpenCL_Bench_SET_V2/from Nvidia Sources/VolumeRender_OpenCL/oclVolumeRender Starting... Press '=' and '-' to change density ']' and '[' to change brightness ';' and ''' to modify transfer function offset '.' and ',' to modify transfer function scale CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_NAME: GeForce GTX 280 CL_DRIVER_VERSION: CLH 1.0 CL_DEVICE_TYPE: CL_DEVICE_TYPE_GPU CL_DEVICE_MAX_COMPUTE_UNITS: 240 CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3 CL_DEVICE_MAX_WORK_ITEM_SIZES: 512 / 512 / 64 CL_DEVICE_MAX_WORK_GROUP_SIZE: 512 CL_DEVICE_MAX_CLOCK_FREQUENCY: 1350 MHz CL_DEVICE_ADDRESS_BITS: 32 CL_DEVICE_IMAGE_SUPPORT: 1 CL_DEVICE_MAX_READ_IMAGE_ARGS: 128 CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 8 CL_DEVICE_IMAGE_MAX_WIDTH: 2d width 8192, 2d height 8192, 3d width 2048, 3d height 2048, 3d depth 2048 CL_DEVICE_MAX_MEM_ALLOC_SIZE: 256 MByte CL_DEVICE_GLOBAL_MEM_SIZE: 1024 MByte CL_DEVICE_ERROR_CORRECTION_SUPPORT: no CL_DEVICE_LOCAL_MEM_TYPE: local CL_DEVICE_LOCAL_MEM_SIZE: 16 KByte CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64 KByte CL_DEVICE_QUEUE_PROPERTIES: CL_QUEUE_PROFILING_ENABLE CL_DEVICE_EXTENSIONS: cl_khr_byte_addressable_store cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_APPLE_gl_sharing cl_APPLE_SetMemObjectDestructor cl_APPLE_ContextLoggingFunctions cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics CL_DEVICE_PREFERRED_VECTOR_WIDTH: char 1, short 1, int 1, long 1, float 1, double 0 !!! Error # 0 at line 161 , in file oclVolumeRender.cpp !!! Starting Cleanup... TEST FAILED !!!... oclVolumeRender.exe Exiting... Press <Enter> to Quit Is it because it's [from Nvidia Sources]?so the test doesn't work with unofficially supported GPU? 💡 Deploy cloud instances seamlessly on DigitalOcean. Free credits ($100) for InfMac readers. i7 920 MSI X58 Platinum SLI DDR3 1066 2GB x3 EVGA Geforce GTX 280 1024MB 1TB SATA2 x2 - RAID 1.5TB SATA2 (JMicron Port,GPT,installed Snow Leopard) BenQ G2412HD 1920x1080 LCD Snow Leopard [Vanilla] 10.6.2 Chameleon RC4 |