找回密码
 新注册用户
搜索
查看: 4193|回复: 6

[注意] 能源2階段可能有另一種形式的計算錯誤

[复制链接]
发表于 2012-1-30 11:34:39 | 显示全部楼层 |阅读模式
本帖最后由 447968925 于 2012-1-30 11:37 编辑

能源2階段可能有另一種形式的計算錯誤,
而且會被當作正確驗證成功,
誰懂英文的能不能給他們彙報一下。

出錯環境:對於我個人的電腦是打開很多個網頁(即高CPU和內存佔用率)
出錯表現:計算用時大幅減小,但實際上沒有算完(每一個包裏面有16個小包,出錯就是說只算到前面幾個包後面的就不算了,直接被提交了上去)
出錯後果:這樣一些尚未全部計算完成的結果被提交上去 並且被驗證通過 可能造成應有結果的遺漏
回复

使用道具 举报

 楼主| 发表于 2012-1-30 12:11:57 | 显示全部楼层
截圖因為是前幾天的事情了 網站上那條的記錄已經沒了

我先以一個正常的包來進行說明吧

结果日志  

结果名称: E205533_ 174_ C.32.C27H15NOS2Si.00740961.4.set1d06_ 0--



<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[13:11:21] Number of jobs = 16
[13:11:21] Starting job 0,CPU time has been restored to 0.000000.
[15:51:23] Finished Job #0
[15:51:23] Starting job 1,CPU time has been restored to 183.312500.
[16:02:29] Finished Job #1
[16:02:29] Starting job 2,CPU time has been restored to 745.328125.
16:26:22 (259224): No heartbeat from core client for 30 sec - exiting
No heartbeat: Exiting
[16:26:53] Number of jobs = 16
[16:26:53] Starting job 2,CPU time has been restored to 745.328125.
Quit requested: Exiting
[20:53:15] Number of jobs = 16
[20:53:15] Starting job 2,CPU time has been restored to 745.328125.
[02:54:16] Finished Job #2
[02:54:16] Starting job 3,CPU time has been restored to 19457.375000.
[03:03:47] Finished Job #3
[03:03:47] Starting job 4,CPU time has been restored to 20022.812500.
[03:11:29] Finished Job #4
[03:11:29] Starting job 5,CPU time has been restored to 20481.859375.
[03:19:26] Finished Job #5
[03:19:26] Starting job 6,CPU time has been restored to 20955.484375.
[03:27:06] Finished Job #6
[03:27:06] Starting job 7,CPU time has been restored to 21414.828125.
[03:37:26] Finished Job #7
[03:37:26] Starting job 8,CPU time has been restored to 22031.515625.
[03:44:52] Finished Job #8
[03:44:52] Starting job 9,CPU time has been restored to 22474.656250.
[03:53:34] Finished Job #9
[03:53:34] Starting job 10,CPU time has been restored to 22993.218750.
[04:12:25] Finished Job #10
[04:12:25] Starting job 11,CPU time has been restored to 24122.093750.
[04:23:08] Finished Job #11
[04:23:08] Starting job 12,CPU time has been restored to 24759.656250.
Application exited with RC = 0x1
[06:51:10] Finished Job #12
[06:51:10] Starting job 13,CPU time has been restored to 33586.140625.
[06:51:10] Skipping Job #13
[06:51:10] Starting job 14,CPU time has been restored to 33586.140625.
[06:51:10] Skipping Job #14
[06:51:10] Starting job 15,CPU time has been restored to 33586.140625.
[06:51:10] Skipping Job #15
06:51:21 (1940): called boinc_finish

</stderr_txt>
]]>
回复

使用道具 举报

 楼主| 发表于 2012-1-30 12:14:59 | 显示全部楼层
本帖最后由 447968925 于 2012-1-30 12:20 编辑

先說明  以上是一個正常的包,我是用來說明問題的

可以看到一共有16個包,每一個任務後面有一個CPU時間

最后三個包的時間確實應該是一樣的 (後面三個包是正常的)

而出錯就是說從前面某一個包開始  那一串數字就不再增加了

也就是說後面的包根本沒有計算
回复

使用道具 举报

发表于 2012-1-31 10:57:59 | 显示全部楼层
我记得我的电脑是算到12个小时左右后面的子包就不再计算直接上传
如果一个存盘点没过然后就停掉了然后反复几次也会出现这种状况
回复

使用道具 举报

发表于 2012-2-12 14:46:48 | 显示全部楼层
我就不开新帖了,汇报另外一个问题吧。
HCMD2遇到了一个包仅仅计算了14分钟就结束,有个包8分钟80%的现象
回复

使用道具 举报

发表于 2012-2-13 11:13:13 | 显示全部楼层
HCMD2理论上还有7天就结束了,现在即使上报bug,也没太大意义了吧?
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 新注册用户

本版积分规则

论坛官方淘宝店开业啦~
欢迎大家多多支持基金会~

Archiver|手机版|小黑屋|中国分布式计算总站 ( 沪ICP备05042587号 )

GMT+8, 2024-5-7 01:51

Powered by Discuz! X3.5

© 2001-2024 Discuz! Team.

快速回复 返回顶部 返回列表