|
楼主 |
发表于 2009-8-11 10:39:36
|
显示全部楼层
Posted 4 Aug 2009 12:33:55 UTC
Last modified: 4 Aug 2009 12:40:24 UTC
Hi everyone,
I'd like to give you a bit of background information about thisrelease. First of all we follow the "release early, release often"approach WRT CUDA. The reason for this is that it's one thing to getCUDA code running in a known environment, but a significantly differentstory when it comes to the heterogeneous outside world and BOINC.Therefore this beta release serves as a first test of the whole buildand release chain as well as to identify potential BOINC-relatedproblems. Lessons learned can then be used to simplify the deploymentof upcoming, improved CUDA releases of the ABP1 radio pulsar search,but also - even more important - the upcoming CUDA version of the S5hierarchical search.
初始版本问题可能会比较多,得来的测试经验将用来改进ABP1的新CUDA程序,以及更重要的,S5的CUDA程序!
So far we know that...
- The ABP1 CUDA release yields a performance improvement of factor~2 (our own tests on Windows, Linux and Mac OS X) as we used the mosttrivial approach which still runs 30%-50% on the CPU
- We claim a full CPU core for the application because BOINCwon't reset the task to idle priority when less than one core is claimed
- On-board devices typically can't handle this application due to lack of memory
- Only CUDA 2.1 seems to work reliably (2.2 might cause segfaults, consistent with findings at GPUGrid)
- This is release is targeted for 32 bit systems
- The BOINC client's internal task scheduler seems to have problems with CUDA device identification and task distribution
ABP1的CUDA版本大概提速一倍,同时仍需要占用30-50%的CPU。
板载显卡一般是不能用来跑这个程序的,主要是内存不够。
目前似乎只有CUDA 2.1可以可靠地运行,2.2可能会出错。
目前只有32位版本。
BOINC客户端在CUDA设备的识别和任务调度上似乎还有问题。
Thanks for supporting these tests! Please stay tuned for more as we'reworking hard on getting a first beta release of the hierarchical searchas well as improved CPU and GPU versions of the ABP search.
感觉大家的支持!
Cheers,
Oliver |
|