|
发表于 2006-12-2 12:06:45
|
显示全部楼层
November 29, 2006 - 19:00 UTC
Last night we had a blip of an outage due to our data download server losing its mount of the file server holding the workunits. This was strictly a random failure, and it worked itself out on its own. We saw similar behavior back when we had a heavier load on the entire system. We released the enhanced client which greatly reduced the rate of workunit/result exchange and therefore reduced the occurrence of these load-related problems. Thanks to Moore's Law and an ever-increasing user base, we'll need to address this issue sooner than later.
28是的停机是由系统负担过重导致的,在发布了Enhanced程序后,系统负担已经减轻不少,但由于大家的机器越来越强,用户的数目也越来越多,这个问题也越发严重。
The other lingering, randomly-occurring problem has to do with "rough periods" accessing the database (see eariler tech news items for details). Basically, what's going on is this: every 24 hours or so a process dumps all useful user/host/team stats to XML files which other sites can upload and generate leader boards, graphs, etc. These tables have continually grown in size, and apparently when this process runs they can knock the result table out of memory. The feeder process, which keeps a healthy queue of available work to send out to users, needs the result table in memory or else a sub-second query to select more work becomes a multi-minute query to read the whole result table back into memory from disk. We're looking into making these queries more efficient.
项目每天要把所有用户主机和团队的统计信息导出到XML文件,以供各个统计网站使用,这个过程也越来越耗费系统资源。我们正在考虑改进相关的查询操作。
We're also looking at setting up a new BOINC database server (remember that the BOINC database is separate for the SETI@home-specific science database which already is on a new server and working well). Recently Intel donated several pieces of hardware to us, including a quad dual-core Xeon processor system (i.e. 8 3GHz processors total). We're currently working out some system quirks, but when we begin trusting it we'll make this our master BOINC database server, and the current one will be a replica. This will provide an immediate backup if needed, and remove the necessity for the weekly outages. More to come on that. Another recently Intel system has already been set up and is being used as a backend science CPU server (and to read new data from hard drives sent up from Arecibo).
感谢Intel最近又捐赠了一台牛机器(4芯8核)!
The last of the known never-touched classic data tapes has been read last week and is in the splitter queue. Next we will start reading tapes that have gone through the pipeline in some form or another, but for some reason never made it into our master database. Possible reasons include: bad data (but hopefully not), a tape drive failure that caused the tapes to remain unread (surprisingly more common than you'd think), poor initial analysis or database corruption leading to failure during redundancy checking. So don't be upset when tapes from the late 90's appear on the queue. Data from 1998 is worth the same as data taken in 2006. The ETs we are looking for come from light years away. A few years won't make any difference when looking for signals consistently repeating over time.
以前SETI Classic时代储存原始观测数据的磁带中最后一卷从未读过的也在上星期进入了任务队列,接下来我们将重新检查部分之前已经读过的磁带,这些磁带由于各种原因还没有进入过任务队列。如果你看到你计算的任务包是来自1998年,不要感到泄气,这些数据和2006的数据同样有意义。我们要寻找的ET至少也来自于数个光年之外,采集数据的时间相差几年并没有什么不同。 |
评分
-
查看全部评分
|