找回密码
 新注册用户
搜索
查看: 2595|回复: 10

[求助] FAH计算错误怎么回事?

[复制链接]
发表于 2022-11-7 03:05:13 | 显示全部楼层 |阅读模式
3070ti 7月刚买的最近更新了522 526驱动,计算一会儿没完成就出错提示cuda错误,然后就上传数据包没了,换回了517驱动错误不一样了算一会儿出错但是不会结束,而是退回一部分继续算,日志我贴上了帮我看看
18:55:33:WU00:FS01:0x22:An exception occurred at step 167918: Particle coordinate is nan
18:55:33:WU00:FS01:0x22:ERROR:98: Attempting to restart from last good checkpoint by restarting core.
18:55:33:WU00:FS01:0x22:Folding@home Core Shutdown: CORE_RESTART
18:55:34:WARNING:WU00:FS01:FahCore returned: CORE_RESTART (98 = 0x62)
18:55:34:WU00:FS01:Starting
18:55:34:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\ProgramData\FAHClient\cores/cores.foldingathome.org/win/64bit/22-0.0.20/Core_22.fah/FahCore_22.exe -dir 00 -suffix 01 -version 706 -lifeline 9096 -checkpoint 15 -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu-vendor nvidia -gpu 0 -gpu-usage 100
18:55:34:WU00:FS01:Started FahCore on PID 11356
18:55:34:WU00:FS01:Core PID:7108
18:55:34:WU00:FS01:FahCore 0x22 started
18:55:35:WU00:FS01:0x22:*********************** Log Started 2022-11-06T18:55:34Z ***********************
18:55:35:WU00:FS01:0x22:*************************** Core22 Folding@home Core ***************************
18:55:35:WU00:FS01:0x22:       Core: Core22
18:55:35:WU00:FS01:0x22:       Type: 0x22
18:55:35:WU00:FS01:0x22:    Version: 0.0.20
18:55:35:WU00:FS01:0x22:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
18:55:35:WU00:FS01:0x22:  Copyright: 2020 foldingathome.org
18:55:35:WU00:FS01:0x22:   Homepage: https://foldingathome.org/
18:55:35:WU00:FS01:0x22:       Date: Jan 20 2022
18:55:35:WU00:FS01:0x22:       Time: 01:15:36
18:55:35:WU00:FS01:0x22:   Revision: 3f211b8a4346514edbff34e3cb1c0e0ec951373c
18:55:35:WU00:FS01:0x22:     Branch: HEAD
18:55:35:WU00:FS01:0x22:   Compiler: Visual C++
18:55:35:WU00:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
18:55:35:WU00:FS01:0x22:             -DOPENMM_VERSION="\"7.7.0\""
18:55:35:WU00:FS01:0x22:   Platform: win32 10
18:55:35:WU00:FS01:0x22:       Bits: 64
18:55:35:WU00:FS01:0x22:       Mode: Release
18:55:35:WU00:FS01:0x22:Maintainers: John Chodera <john.chodera@choderalab.org> and Peter Eastman
18:55:35:WU00:FS01:0x22:             <peastman@stanford.edu>
18:55:35:WU00:FS01:0x22:       Args: -dir 00 -suffix 01 -version 706 -lifeline 11356 -checkpoint 15
18:55:35:WU00:FS01:0x22:             -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu-vendor
18:55:35:WU00:FS01:0x22:             nvidia -gpu 0 -gpu-usage 100
18:55:35:WU00:FS01:0x22:************************************ libFAH ************************************
18:55:35:WU00:FS01:0x22:       Date: Jan 20 2022
18:55:35:WU00:FS01:0x22:       Time: 01:14:17
18:55:35:WU00:FS01:0x22:   Revision: 9f4ad694e75c2350d4bb6b8b5b769ba27e483a2f
18:55:35:WU00:FS01:0x22:     Branch: HEAD
18:55:35:WU00:FS01:0x22:   Compiler: Visual C++
18:55:35:WU00:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
18:55:35:WU00:FS01:0x22:   Platform: win32 10
18:55:35:WU00:FS01:0x22:       Bits: 64
18:55:35:WU00:FS01:0x22:       Mode: Release
18:55:35:WU00:FS01:0x22:************************************ CBang *************************************
18:55:35:WU00:FS01:0x22:       Date: Jan 20 2022
18:55:35:WU00:FS01:0x22:       Time: 01:13:20
18:55:35:WU00:FS01:0x22:   Revision: ab023d155b446906d55b0f6c9a1eedeea04f7a1a
18:55:35:WU00:FS01:0x22:     Branch: HEAD
18:55:35:WU00:FS01:0x22:   Compiler: Visual C++
18:55:35:WU00:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
18:55:35:WU00:FS01:0x22:   Platform: win32 10
18:55:35:WU00:FS01:0x22:       Bits: 64
18:55:35:WU00:FS01:0x22:       Mode: Release
18:55:35:WU00:FS01:0x22:************************************ System ************************************
18:55:35:WU00:FS01:0x22:        CPU: AMD Ryzen 9 5950X 16-Core Processor
18:55:35:WU00:FS01:0x22:     CPU ID: AuthenticAMD Family 25 Model 33 Stepping 2
18:55:35:WU00:FS01:0x22:       CPUs: 32
18:55:35:WU00:FS01:0x22:     Memory: 31.92GiB
18:55:35:WU00:FS01:0x22:Free Memory: 22.31GiB
18:55:35:WU00:FS01:0x22:    Threads: WINDOWS_THREADS
18:55:35:WU00:FS01:0x22: OS Version: 6.2
18:55:35:WU00:FS01:0x22:Has Battery: false
18:55:35:WU00:FS01:0x22: On Battery: false
18:55:35:WU00:FS01:0x22: UTC Offset: 8
18:55:35:WU00:FS01:0x22:        PID: 7108
18:55:35:WU00:FS01:0x22:        CWD: C:\ProgramData\FAHClient\work
18:55:35:WU00:FS01:0x22:************************************ OpenMM ************************************
18:55:35:WU00:FS01:0x22:    Version: 7.7.0
18:55:35:WU00:FS01:0x22:********************************************************************************
18:55:35:WU00:FS01:0x22:Project: 18450 (Run 8, Clone 51, Gen 210)
18:55:35:WU00:FS01:0x22:Digital signatures verified
18:55:35:WU00:FS01:0x22:Folding@home GPU Core22 Folding@home Core
18:55:35:WU00:FS01:0x22:Version 0.0.20
18:55:35:WU00:FS01:0x22:  Checkpoint write interval: 50000 steps (2%) [50 total]
18:55:35:WU00:FS01:0x22:  JSON viewer frame write interval: 25000 steps (1%) [100 total]
18:55:35:WU00:FS01:0x22:  XTC frame write interval: 2500000 steps (1e+02%) [1 total]
18:55:35:WU00:FS01:0x22:  Global context and integrator variables write interval: disabled
18:55:35:WU00:FS01:0x22:There are 4 platforms available.
18:55:35:WU00:FS01:0x22:Platform 0: Reference
18:55:35:WU00:FS01:0x22:Platform 1: CPU
18:55:35:WU00:FS01:0x22:Platform 2: OpenCL
18:55:35:WU00:FS01:0x22:  opencl-device 0 specified
18:55:35:WU00:FS01:0x22:Platform 3: CUDA
18:55:35:WU00:FS01:0x22:  cuda-device 0 specified
18:55:48:WU00:FS01:0x22:Attempting to create CUDA context:
18:55:48:WU00:FS01:0x22:  Configuring platform CUDA
18:55:51:WU00:FS01:0x22:  Using CUDA and gpu 0
18:55:51:WU00:FS01:0x22:Completed 150000 out of 2500000 steps (6%)
18:57:35:WU00:FS01:0x22:Completed 175000 out of 2500000 steps (7%)
18:59:20:WU00:FS01:0x22:Completed 200000 out of 2500000 steps (8%)
18:59:21:WU00:FS01:0x22:Checkpoint completed at step 200000
19:01:05:WU00:FS01:0x22:Completed 225000 out of 2500000 steps (9%)
19:02:48:WU00:FS01:0x22:Completed 250000 out of 2500000 steps (10%)
19:02:49:WU00:FS01:0x22:Checkpoint completed at step 250000
19:03:48:WU00:FS01:0x22:An exception occurred at step 262043: Particle coordinate is nan
19:03:48:WU00:FS01:0x22:ERROR:98: Attempting to restart from last good checkpoint by restarting core.
19:03:48:WU00:FS01:0x22:Folding@home Core Shutdown: CORE_RESTART
19:03:49:WARNING:WU00:FS01:FahCore returned: CORE_RESTART (98 = 0x62)
19:03:49:WU00:FS01:Starting
19:03:49:WU00:FS01:Running FahCore: "C:\Program Files (x86)\FAHClient/FAHCoreWrapper.exe" C:\ProgramData\FAHClient\cores/cores.foldingathome.org/win/64bit/22-0.0.20/Core_22.fah/FahCore_22.exe -dir 00 -suffix 01 -version 706 -lifeline 9096 -checkpoint 15 -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu-vendor nvidia -gpu 0 -gpu-usage 100
19:03:49:WU00:FS01:Started FahCore on PID 10696
19:03:49:WU00:FS01:Core PID:5292
19:03:49:WU00:FS01:FahCore 0x22 started
19:03:49:WU00:FS01:0x22:*********************** Log Started 2022-11-06T19:03:49Z ***********************
19:03:49:WU00:FS01:0x22:*************************** Core22 Folding@home Core ***************************
19:03:49:WU00:FS01:0x22:       Core: Core22
19:03:49:WU00:FS01:0x22:       Type: 0x22
19:03:49:WU00:FS01:0x22:    Version: 0.0.20
19:03:49:WU00:FS01:0x22:     Author: Joseph Coffland <joseph@cauldrondevelopment.com>
19:03:49:WU00:FS01:0x22:  Copyright: 2020 foldingathome.org
19:03:49:WU00:FS01:0x22:   Homepage: https://foldingathome.org/
19:03:49:WU00:FS01:0x22:       Date: Jan 20 2022
19:03:49:WU00:FS01:0x22:       Time: 01:15:36
19:03:49:WU00:FS01:0x22:   Revision: 3f211b8a4346514edbff34e3cb1c0e0ec951373c
19:03:49:WU00:FS01:0x22:     Branch: HEAD
19:03:49:WU00:FS01:0x22:   Compiler: Visual C++
19:03:49:WU00:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:03:49:WU00:FS01:0x22:             -DOPENMM_VERSION="\"7.7.0\""
19:03:49:WU00:FS01:0x22:   Platform: win32 10
19:03:49:WU00:FS01:0x22:       Bits: 64
19:03:49:WU00:FS01:0x22:       Mode: Release
19:03:49:WU00:FS01:0x22:Maintainers: John Chodera <john.chodera@choderalab.org> and Peter Eastman
19:03:49:WU00:FS01:0x22:             <peastman@stanford.edu>
19:03:49:WU00:FS01:0x22:       Args: -dir 00 -suffix 01 -version 706 -lifeline 10696 -checkpoint 15
19:03:49:WU00:FS01:0x22:             -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu-vendor
19:03:49:WU00:FS01:0x22:             nvidia -gpu 0 -gpu-usage 100
19:03:49:WU00:FS01:0x22:************************************ libFAH ************************************
19:03:49:WU00:FS01:0x22:       Date: Jan 20 2022
19:03:49:WU00:FS01:0x22:       Time: 01:14:17
19:03:49:WU00:FS01:0x22:   Revision: 9f4ad694e75c2350d4bb6b8b5b769ba27e483a2f
19:03:49:WU00:FS01:0x22:     Branch: HEAD
19:03:49:WU00:FS01:0x22:   Compiler: Visual C++
19:03:49:WU00:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:03:49:WU00:FS01:0x22:   Platform: win32 10
19:03:49:WU00:FS01:0x22:       Bits: 64
19:03:49:WU00:FS01:0x22:       Mode: Release
19:03:49:WU00:FS01:0x22:************************************ CBang *************************************
19:03:49:WU00:FS01:0x22:       Date: Jan 20 2022
19:03:49:WU00:FS01:0x22:       Time: 01:13:20
19:03:49:WU00:FS01:0x22:   Revision: ab023d155b446906d55b0f6c9a1eedeea04f7a1a
19:03:49:WU00:FS01:0x22:     Branch: HEAD
19:03:49:WU00:FS01:0x22:   Compiler: Visual C++
19:03:49:WU00:FS01:0x22:    Options: /TP /nologo /EHa /wd4297 /wd4103 /O2 /Zc:throwingNew /MT
19:03:49:WU00:FS01:0x22:   Platform: win32 10
19:03:49:WU00:FS01:0x22:       Bits: 64
19:03:49:WU00:FS01:0x22:       Mode: Release
19:03:49:WU00:FS01:0x22:************************************ System ************************************
19:03:49:WU00:FS01:0x22:        CPU: AMD Ryzen 9 5950X 16-Core Processor
19:03:49:WU00:FS01:0x22:     CPU ID: AuthenticAMD Family 25 Model 33 Stepping 2
19:03:49:WU00:FS01:0x22:       CPUs: 32
19:03:49:WU00:FS01:0x22:     Memory: 31.92GiB
19:03:49:WU00:FS01:0x22:Free Memory: 21.78GiB
19:03:49:WU00:FS01:0x22:    Threads: WINDOWS_THREADS
19:03:49:WU00:FS01:0x22: OS Version: 6.2
19:03:49:WU00:FS01:0x22:Has Battery: false
19:03:49:WU00:FS01:0x22: On Battery: false
19:03:49:WU00:FS01:0x22: UTC Offset: 8
19:03:49:WU00:FS01:0x22:        PID: 5292
19:03:49:WU00:FS01:0x22:        CWD: C:\ProgramData\FAHClient\work
19:03:49:WU00:FS01:0x22:************************************ OpenMM ************************************
19:03:49:WU00:FS01:0x22:    Version: 7.7.0
19:03:49:WU00:FS01:0x22:********************************************************************************
19:03:49:WU00:FS01:0x22:Project: 18450 (Run 8, Clone 51, Gen 210)
19:03:49:WU00:FS01:0x22:Digital signatures verified
19:03:49:WU00:FS01:0x22:Folding@home GPU Core22 Folding@home Core
19:03:49:WU00:FS01:0x22:Version 0.0.20
19:03:49:WU00:FS01:0x22:  Checkpoint write interval: 50000 steps (2%) [50 total]
19:03:49:WU00:FS01:0x22:  JSON viewer frame write interval: 25000 steps (1%) [100 total]
19:03:49:WU00:FS01:0x22:  XTC frame write interval: 2500000 steps (1e+02%) [1 total]
19:03:49:WU00:FS01:0x22:  Global context and integrator variables write interval: disabled
19:03:49:WU00:FS01:0x22:There are 4 platforms available.
19:03:49:WU00:FS01:0x22:Platform 0: Reference
19:03:49:WU00:FS01:0x22:Platform 1: CPU
19:03:49:WU00:FS01:0x22:Platform 2: OpenCL
19:03:49:WU00:FS01:0x22:  opencl-device 0 specified
19:03:49:WU00:FS01:0x22:Platform 3: CUDA
19:03:49:WU00:FS01:0x22:  cuda-device 0 specified
19:04:03:WU00:FS01:0x22:Attempting to create CUDA context:
19:04:03:WU00:FS01:0x22:  Configuring platform CUDA
19:04:06:WU00:FS01:0x22:  Using CUDA and gpu 0
19:04:06:WU00:FS01:0x22:Completed 250000 out of 2500000 steps (10%)


回复

使用道具 举报

 楼主| 发表于 2022-11-7 05:01:07 | 显示全部楼层
显卡肯定没有问题,又换会刚买的时候用的驱动版本516.59了看看还有没有错误。看了官网近些年发布的所有版本驱动
屏幕截图 2022-11-05 202952.jpg
回复

使用道具 举报

 楼主| 发表于 2022-11-7 05:02:41 | 显示全部楼层
最近两版本522 526是支持4090的,看游戏有报错还有任务管理器显示不正常,我换回了517还是错误现在又换回516看看,这个版本7月卡买回来就用一直正常
回复

使用道具 举报

发表于 2022-11-7 13:24:33 | 显示全部楼层
FAH没记错的话,同一个存盘点允许最多1次错误,超过就会自动抛弃,看描述更像是体质差、不够稳定
我的3080ti在522驱动下没有任何问题,30系注意别超频太狠,核心和显存体质都挺烂的
回复

使用道具 举报

发表于 2022-11-7 13:30:09 | 显示全部楼层
FAH没记错的话,同一个存盘点允许最多1次错误,超过就会自动抛弃,看描述更像是体质差、不够稳定
我的3080ti在522驱动下没有任何问题,30系注意别超频太狠,核心和显存体质都挺烂的
回复

使用道具 举报

 楼主| 发表于 2022-11-9 00:10:01 | 显示全部楼层
牵牛星 发表于 2022-11-7 13:24
FAH没记错的话,同一个存盘点允许最多1次错误,超过就会自动抛弃,看描述更像是体质差、不够稳定
我的3080t ...

就默认的啊现在用的刚买回来的516驱动还是有错误然后继续算,522的直接cuda错误
回复

使用道具 举报

 楼主| 发表于 2022-11-9 01:20:47 | 显示全部楼层
牵牛星 发表于 2022-11-7 13:30
FAH没记错的话,同一个存盘点允许最多1次错误,超过就会自动抛弃,看描述更像是体质差、不够稳定
我的3080t ...

我再试试看522驱动吧,这个版本据说解锁了lhr,还不行只能停算了卡没问题本身
回复

使用道具 举报

 楼主| 发表于 2022-11-9 03:39:22 | 显示全部楼层
牵牛星 发表于 2022-11-7 13:30
FAH没记错的话,同一个存盘点允许最多1次错误,超过就会自动抛弃,看描述更像是体质差、不够稳定
我的3080t ...

522.25还是有问题,几次错误就不能继续计算了
19:36:16:WU00:FS01:0x22:An exception occurred at step 1649822: Particle coordinate is nan
19:36:16:WU00:FS01:0x22:Max number of attempts to resume from last checkpoint (2) reached. Aborting.
19:36:16:WU00:FS01:0x22:ERROR:114: Max number of attempts to resume from last checkpoint reached.
19:36:16:WU00:FS01:0x22:Saving result file ..\logfile_01.txt
19:36:16:WU00:FS01:0x22:Saving result file science.log
19:36:16:WU00:FS01:0x22:Saving result file state.xml
19:36:17:WU00:FS01:0x22:Saving result file xtcAtoms.csv.bz2
19:36:17:WU00:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
19:36:18:WARNING:WU00:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
19:36:18:WU00:FS01:Sending unit results: id:00 state:SEND error:FAULTY project:18450 run:11 clone:375 gen:84 core:0x22 unit:0x0000017700000054000048120000000b
19:36:18:WU00:FS01:Uploading 51.11MiB to 129.32.209.202
19:36:18:WU00:FS01:Connecting to 129.32.209.202:8080
19:36:24:WU00:FS01:Upload 71.41%
19:36:27:WU00:FS01:Upload complete
19:36:27:WU00:FS01:Server responded WORK_ACK (400)
19:36:27:WU00:FS01:Cleaning up

回复

使用道具 举报

 楼主| 发表于 2022-11-9 04:33:47 | 显示全部楼层
牵牛星 发表于 2022-11-7 13:30
FAH没记错的话,同一个存盘点允许最多1次错误,超过就会自动抛弃,看描述更像是体质差、不够稳定
我的3080t ...

算了个boinc的显卡任务正常的不知道是不是FAH的问题
回复

使用道具 举报

发表于 2022-11-9 12:26:01 | 显示全部楼层
看描述确实也有点像显卡体质问题,降压降功耗跑跑看
回复

使用道具 举报

 楼主| 发表于 2023-1-15 07:40:30 | 显示全部楼层
zflowers 发表于 2022-11-9 12:26
看描述确实也有点像显卡体质问题,降压降功耗跑跑看

内存频率降低了之后好了
回复

使用道具 举报

您需要登录后才可以回帖 登录 | 新注册用户

本版积分规则

论坛官方淘宝店开业啦~

Archiver|手机版|小黑屋|中国分布式计算总站 ( 沪ICP备05042587号 )

GMT+8, 2024-4-29 23:30

Powered by Discuz! X3.5

© 2001-2024 Discuz! Team.

快速回复 返回顶部 返回列表