查看: 7560|回复: 38

[范例] 一个双节点RAC的故障案例

[复制链接]
论坛徽章:
5
授权会员
日期:2005-10-30 17:05:33会员2006贡献徽章
日期:2006-04-17 13:46:34ITPUB9周年纪念徽章
日期:2010-10-08 09:28:522011新春纪念徽章
日期:2011-02-18 11:43:35迷宫蛋
日期:2011-11-02 16:14:29
发表于 2010-9-16 10:09 | 显示全部楼层 |阅读模式
操作系统:Linux  2.6.9-42.ELlargesmp Red Hat Enterprise Linux AS release 4 (Nahant Update 4) X86-64
数据库版本: 10.2.0.4.0
服务器:HP580 16G内存 4X4核CPU
存储:ISCSI链接方式
网络设置:内网心跳采用直连方式
运行应用:某财务系统
运行特点:应用设计为只针对一个节点进行访问,当此节点故障失效后才切换到另一个节点
问题描述:

当系统负载高或不高时(大部分情况发生在负载高的时候),内网网卡会自动重启数次,导致节点呗驱逐出集群,数据库会重启操作系统,具体信息如下:

/VAR/LOG/MESSAGES

Aug 31 11:36:54 easrac1 kernel: bnx2: eth1 NIC Link is Up, 1000 Mbps full duplex, receive & transmit flow control ON

Aug 31 11:37:04 easrac1 kernel: bnx2: eth1 NIC Link is Down

Aug 31 11:37:06 easrac1 kernel: bnx2: eth1 NIC Link is Up, 1000 Mbps full duplex, receive & transmit flow control ON

Aug 31 11:40:07 easrac1 kernel: bnx2: eth1 NIC Link is Down

Aug 31 11:40:09 easrac1 kernel: bnx2: eth1 NIC Link is Up, 1000 Mbps full duplex, receive & transmit flow control ON

-------------------------------------------------------------------------------------------------------------------------------------------

alter.log

无任何有价值信息,只有重启信息

--------------------------------------------------------------------------------------------------------------------------------------------
OCSS.LOG

[    CSSD]2010-08-31 11:37:20.993 [1241577824] >TRACE:   clssnmPollingThread: diskTimeout set to (57000)ms impending reconfig
status(1)
[    CSSD]2010-08-31 11:37:21.994 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 50 3.118321e-317artbeat fat
al, eviction in 28.540 seconds
[    CSSD]2010-08-31 11:37:36.000 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 75 3.118345e-317artbeat fat
al, eviction in 14.530 seconds
[    CSSD]2010-08-31 11:37:36.992 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 75 3.118368e-317artbeat fat
al, eviction in 13.540 seconds
[    CSSD]2010-08-31 11:37:44.998 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 90 3.118392e-317artbeat fat
al, eviction in 5.540 seconds
[    CSSD]2010-08-31 11:37:46.000 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 90 3.118416e-317artbeat fat
al, eviction in 4.530 seconds
[    CSSD]2010-08-31 11:37:46.991 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 90 3.118440e-317artbeat fat
al, eviction in 3.540 seconds
[    CSSD]2010-08-31 11:37:47.993 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 90 3.118463e-317artbeat fat
al, eviction in 2.540 seconds
[    CSSD]2010-08-31 11:37:48.995 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 90 3.118487e-317artbeat fat
al, eviction in 1.540 seconds
[    CSSD]2010-08-31 11:37:49.997 [1241577824] >WARNING: clssnmPollingThread: node easrac2 (2) at 90 3.118511e-317artbeat fat
al, eviction in 0.540 seconds
[    CSSD]2010-08-31 11:37:50.539 [1241577824] >TRACE:   clssnmPollingThread: Eviction started for node easrac2 (2), flags 0x
040d, state 3, wt4c 0
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmDoSyncUpdate: Initiating sync 3
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmDoSyncUpdate: diskTimeout set to (57000)ms
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmSetupAckWait: Ack message type (11)
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmSetupAckWait: node(1) is ALIVE
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmSendSync: syncSeqNo(3)
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmWaitForAcks: Ack message type(11), ackCount(1)
[    CSSD]2010-08-31 11:37:50.539 [1189128544] >TRACE:   clssnmHandleSync: diskTimeout set to (57000)ms
[    CSSD]2010-08-31 11:37:50.539 [1189128544] >TRACE:   clssnmHandleSync: Acknowledging sync: src[1] srcName[easrac1] seq[1]
sync[3]
[    CSSD]2010-08-31 11:37:50.539 [2538585888] >USER:    NMEVENT_SUSPEND [00][00][00][06]
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmWaitForAcks: done, msg type(11)
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmDoSyncUpdate: Terminating node 2, easrac2, misstime(60000) sta
te(5)
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmSetupAckWait: Ack message type (13)
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmSetupAckWait: node(1) is ACTIVE
[    CSSD]2010-08-31 11:37:50.539 [1262557536] >TRACE:   clssnmWaitForAcks: Ack message type(13), ackCount(1)
[    CSSD]2010-08-31 11:37:50.539 [1189128544] >TRACE:   clssnmSendVoteInfo: node(1) syncSeqNo(3)
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmWaitForAcks: done, msg type(13)
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmCheckDskInfo: Checking disk info...
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmEvict: Start
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmEvict: Evicting node 2, easrac2, birth 1, death 3, impendingrc
fg 1, stateflags 0x40d
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmSendShutdown: req to node 2, kill time 66182584
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmDiscHelper: easrac2, node(2) connection failed, con (0x73dfb0)
, probe((nil))
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmWaitOnEvictions: Start
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmWaitOnEvictions: node 2, easrac2, undead 0
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmSetupAckWait: Ack message type (15)
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmSetupAckWait: node(1) is ACTIVE
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmSendUpdate: syncSeqNo(3)
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmWaitForAcks: Ack message type(15), ackCount(1)
[    CSSD]2010-08-31 11:37:50.540 [1189128544] >TRACE:   clssnmUpdateNodeState: node 0, state (0/0) unique (0/0) prevConuni(0
) birth (0/0) (old/new)
[    CSSD]2010-08-31 11:37:50.540 [1189128544] >TRACE:   clssnmUpdateNodeState: node 1, state (3/3) unique (1283158869/128315
8869) prevConuni(0) birth (2/2) (old/new)
[    CSSD]2010-08-31 11:37:50.540 [1189128544] >TRACE:   clssnmUpdateNodeState: node 2, state (5/0) unique (1283158707/128315
8707) prevConuni(1283158707) birth (1/1) (old/new)
[    CSSD]2010-08-31 11:37:50.540 [1189128544] >TRACE:   clssnmDeactivateNode: node 2 (easrac2) left cluster

[    CSSD]2010-08-31 11:37:50.540 [1189128544] >USER:    clssnmHandleUpdate: SYNC(3) from node(1) completed
[    CSSD]2010-08-31 11:37:50.540 [1189128544] >USER:    clssnmHandleUpdate: NODE 1 (easrac1) IS ACTIVE MEMBER OF CLUSTER
[    CSSD]2010-08-31 11:37:50.540 [1189128544] >TRACE:   clssnmHandleUpdate: diskTimeout set to (200000)ms
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmWaitForAcks: done, msg type(15)
[    CSSD]2010-08-31 11:37:50.540 [1262557536] >TRACE:   clssnmDoSyncUpdate: Sync 3 complete!
[    CSSD]2010-08-31 11:37:50.540 [1273047392] >TRACE:   clssgmReconfigThread:  started for reconfig (3)
[    CSSD]2010-08-31 11:37:50.540 [1273047392] >USER:    NMEVENT_RECONFIG [00][00][00][02]
[    CSSD]2010-08-31 11:37:50.540 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock crs_version type 2
[    CSSD]2010-08-31 11:37:50.540 [1273047392] >TRACE:   clssgmCleanupOrphanMembers: cleaning up remote mbr(0) grock(crs_vers
ion) birth(1/1)
[    CSSD]2010-08-31 11:37:50.540 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock DB+ASM type 2
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupOrphanMembers: cleaning up remote mbr(1) grock(DB+ASM)
birth(1/1)
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock DG+ASM type 2
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupOrphanMembers: cleaning up remote mbr(1) grock(DG+ASM)
birth(1/1)
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock DG_DATA type 2
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupOrphanMembers: cleaning up remote mbr(0) grock(DG_DATA)
birth(1/1)
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupOrphanMembers: cleaning up remote mbr(2) grock(DG_DATA)
birth(1/1)
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock ORA_CLSRD_1_easrac type 2
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock ORA_CLSRD_1_easrac type 3
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock ORA_CLSRD_2_easrac type 2
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupOrphanMembers: cleaning up remote mbr(0) grock(ORA_CLSR
D_2_easrac) birth(1/1)
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock ORA_CLSRD_2_easrac type 3
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupOrphanMembers: cleaning up remote mbr(0) grock(ORA_CLSR
D_2_easrac) birth(1/1)
[    CSSD]2010-08-31 11:37:50.541 [1273047392] >TRACE:   clssgmCleanupGrocks: cleaning up grock OSM_ALL type 2


其他日志无有价值信息

根据时间分析大致是这样的一个流程:

1)由于某些原因,内网网卡自动重启2-3次后,心跳misscount累计到达设定的60次

2)数据库被踢出集群,并重启服务器

3)重启后加入集群


这个现象每个月都会出现数次,且并无特别明显的规律而言,那位朋友有遇到过类似的指点一二,不胜感激。
论坛徽章:
3
2010新春纪念徽章
日期:2010-03-01 11:19:50ITPUB9周年纪念徽章
日期:2010-10-08 09:32:25ITPUB十周年纪念徽章
日期:2011-11-01 16:20:28
发表于 2010-9-16 12:43 | 显示全部楼层
心跳还是建议用交换机

另外也有可能是网卡驱动的问题,我就碰到过,压力测试一大,心跳网卡直接DOWN掉,
更新驱动就再也没发生这个问题了

使用道具 举报

回复
论坛徽章:
5
授权会员
日期:2005-10-30 17:05:33会员2006贡献徽章
日期:2006-04-17 13:46:34ITPUB9周年纪念徽章
日期:2010-10-08 09:28:522011新春纪念徽章
日期:2011-02-18 11:43:35迷宫蛋
日期:2011-11-02 16:14:29
 楼主| 发表于 2010-9-16 15:54 | 显示全部楼层
谢谢楼上的朋友,目前已经改为交换机连接,持续观察一下,但更新驱动因没有回退方案,一直没采纳。

使用道具 举报

回复
认证徽章
论坛徽章:
76
双子座
日期:2015-07-28 14:26:072012新春纪念徽章
日期:2012-02-13 15:09:52ITPUB十周年纪念徽章
日期:2011-11-01 16:21:15鲜花蛋
日期:2011-08-26 02:02:24管理团队成员
日期:2011-05-07 01:45:082010广州亚运会纪念徽章:皮划艇
日期:2011-04-18 11:24:412011新春纪念徽章
日期:2011-02-18 11:43:342011新春纪念徽章
日期:2011-01-25 15:42:562011新春纪念徽章
日期:2011-01-25 15:42:332011新春纪念徽章
日期:2011-01-25 15:42:15
发表于 2010-9-17 07:24 | 显示全部楼层
oracle 不建议心跳线采用直连的方式
心跳之间的网络带宽也建议GB以上

使用道具 举报

回复
论坛徽章:
5
授权会员
日期:2005-10-30 17:05:33会员2006贡献徽章
日期:2006-04-17 13:46:34ITPUB9周年纪念徽章
日期:2010-10-08 09:28:522011新春纪念徽章
日期:2011-02-18 11:43:35迷宫蛋
日期:2011-11-02 16:14:29
 楼主| 发表于 2010-9-17 09:18 | 显示全部楼层
我们的心跳是千兆带宽,用NLOAD观察过内网流量,有时能达到100GBIT,感觉不太可能,这个问题挺苦恼,大家有没有继续甄别的方法介绍一二?

使用道具 举报

回复
论坛徽章:
5
授权会员
日期:2005-10-30 17:05:33会员2006贡献徽章
日期:2006-04-17 13:46:34ITPUB9周年纪念徽章
日期:2010-10-08 09:28:522011新春纪念徽章
日期:2011-02-18 11:43:35迷宫蛋
日期:2011-11-02 16:14:29
 楼主| 发表于 2010-9-17 09:23 | 显示全部楼层
前天实施了将直连改为插交换机上,昨天未发生网卡重启,但一个节点依然报心跳故障,导致另一个节点重启

OCSSD.LOG:

[    CSSD]2010-09-16 16:31:38.729 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 50 3.118297e-317artbeat fatal, eviction in 59.470 seconds
[    CSSD]2010-09-16 16:31:38.729 [1241577824] >TRACE:   clssnmPollingThread: node easrac1 (1) is impending reconfig, flag 1037, misstime 60530
[    CSSD]2010-09-16 16:31:38.729 [1241577824] >TRACE:   clssnmPollingThread: diskTimeout set to (117000)ms impending reconfig status(1)
[    CSSD]2010-09-16 16:32:08.725 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 75 3.118321e-317artbeat fatal, eviction in 29.470 seconds
[    CSSD]2010-09-16 16:32:26.728 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118345e-317artbeat fatal, eviction in 11.470 seconds
[    CSSD]2010-09-16 16:32:27.730 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118368e-317artbeat fatal, eviction in 10.460 seconds
[    CSSD]2010-09-16 16:32:28.722 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118392e-317artbeat fatal, eviction in 9.470 seconds
[    CSSD]2010-09-16 16:32:29.723 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118416e-317artbeat fatal, eviction in 8.470 seconds
[    CSSD]2010-09-16 16:32:30.726 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118440e-317artbeat fatal, eviction in 7.470 seconds
[    CSSD]2010-09-16 16:32:31.729 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118463e-317artbeat fatal, eviction in 6.470 seconds
[    CSSD]2010-09-16 16:32:32.722 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118487e-317artbeat fatal, eviction in 5.470 seconds
[    CSSD]2010-09-16 16:32:33.724 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118511e-317artbeat fatal, eviction in 4.470 seconds
[    CSSD]2010-09-16 16:32:34.726 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118534e-317artbeat fatal, eviction in 3.470 seconds
[    CSSD]2010-09-16 16:32:35.727 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118558e-317artbeat fatal, eviction in 2.470 seconds
[    CSSD]2010-09-16 16:32:36.729 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118582e-317artbeat fatal, eviction in 1.470 seconds
[    CSSD]2010-09-16 16:32:37.731 [1241577824] >WARNING: clssnmPollingThread: node easrac1 (1) at 90 3.118606e-317artbeat fatal, eviction in 0.460 seconds
[    CSSD]2010-09-16 16:32:38.193 [1241577824] >TRACE:   clssnmPollingThread: Eviction started for node easrac1 (1), flags 0x040d, state 3, wt4c 0
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmDoSyncUpdate: Initiating sync 2
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmDoSyncUpdate: diskTimeout set to (117000)ms
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmSetupAckWait: Ack message type (11)
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmSetupAckWait: node(2) is ALIVE
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmSendSync: syncSeqNo(2)
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmWaitForAcks: Ack message type(11), ackCount(1)
[    CSSD]2010-09-16 16:32:38.193 [1189128544] >TRACE:   clssnmHandleSync: diskTimeout set to (117000)ms
[    CSSD]2010-09-16 16:32:38.193 [1189128544] >TRACE:   clssnmHandleSync: Acknowledging sync: src[2] srcName[easrac2] seq[5] sync[2]
[    CSSD]2010-09-16 16:32:38.193 [2538585888] >USER:    NMEVENT_SUSPEND [00][00][00][06]
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmWaitForAcks: done, msg type(11)
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmDoSyncUpdate: Terminating node 1, easrac1, misstime(120000) state(5)
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmSetupAckWait: Ack message type (13)
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmSetupAckWait: node(2) is ACTIVE
[    CSSD]2010-09-16 16:32:38.193 [1262557536] >TRACE:   clssnmWaitForAcks: Ack message type(13), ackCount(1)
[    CSSD]2010-09-16 16:32:38.193 [1189128544] >TRACE:   clssnmSendVoteInfo: node(2) syncSeqNo(2)
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmWaitForAcks: done, msg type(13)
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmCheckDskInfo: Checking disk info...
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmEvict: Start
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmEvict: Evicting node 1, easrac1, birth 1, death 2, impendingrcfg 1, stateflags 0x40d
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmSendShutdown: req to node 1, kill time 68217204
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmDiscHelper: easrac1, node(1) connection failed, con (0x782f50), probe((nil))
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmWaitOnEvictions: Start
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmWaitOnEvictions: node 1, easrac1, undead 0
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmSetupAckWait: Ack message type (15)
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmSetupAckWait: node(2) is ACTIVE
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmSendUpdate: syncSeqNo(2)
[    CSSD]2010-09-16 16:32:38.194 [1262557536] >TRACE:   clssnmWaitForAcks: Ack message type(15), ackCount(1)
[    CSSD]2010-09-16 16:32:38.194 [1189128544] >TRACE:   clssnmUpdateNodeState: node 0, state (0/0) unique (0/0) prevConuni(0) birth (0/0) (old/new)
[    CSSD]2010-09-16 16:32:38.194 [1189128544] >TRACE:   clssnmUpdateNodeState: node 1, state (5/0) unique (1284556913/1284556913) prevConuni(1284556913) birth (1/1) (old/new)
[    CSSD]2010-09-16 16:32:38.194 [1189128544] >TRACE:   clssnmDeactivateNode: node 1 (easrac1) left cluster

[    CSSD]2010-09-16 16:32:38.194 [1189128544] >TRACE:   clssnmUpdateNodeState: node 2, state (3/3) unique (1284556908/1284556908) prevConuni(0) birth (1/1) (old/new)
[    CSSD]2010-09-16 16:32:38.194 [1189128544] >USER:    clssnmHandleUpdate: SYNC(2) from node(2) completed
[    CSSD]2010-09-16 16:32:38.194 [1189128544] >USER:    clssnmHandleUpdate: NODE 2 (easrac2) IS ACTIVE MEMBER OF CLUSTER
[    CSSD]2010-09-16 16:32:38.194 [1189128544] >TRACE:   clssnmHandleUpdate: diskTimeout set to (200000)ms
                                                                                                           72701,1

使用道具 举报

回复
论坛徽章:
3
2010新春纪念徽章
日期:2010-03-01 11:19:50ITPUB9周年纪念徽章
日期:2010-10-08 09:32:25ITPUB十周年纪念徽章
日期:2011-11-01 16:20:28
发表于 2010-9-17 10:54 | 显示全部楼层
检查下ALERT 呢,我当时是linux5.3 4台机器,我的情况是死了都死了,还不得重启

使用道具 举报

回复
论坛徽章:
5
授权会员
日期:2005-10-30 17:05:33会员2006贡献徽章
日期:2006-04-17 13:46:34ITPUB9周年纪念徽章
日期:2010-10-08 09:28:522011新春纪念徽章
日期:2011-02-18 11:43:35迷宫蛋
日期:2011-11-02 16:14:29
 楼主| 发表于 2010-9-17 13:07 | 显示全部楼层
告警日志中午任何有价值信息,所以比较棘手

使用道具 举报

回复
论坛徽章:
27
参与WIN7挑战赛纪念
日期:2009-11-06 16:05:25ITPUB元老
日期:2011-04-23 17:54:35ITPUB十周年纪念徽章
日期:2011-11-01 16:24:04奥迪
日期:2013-08-05 09:30:132015年新春福章
日期:2015-03-02 10:21:23弗兰奇
日期:2016-12-28 13:54:54
发表于 2010-9-17 23:40 | 显示全部楼层
这种问题比较郁闷。。。
我这边的rac也经常出现这种问题,,,,log基本差不多。。。
原来以后是时间不同步,。。后台配置了ntp。
但是问题还是存在!!

使用道具 举报

回复
论坛徽章:
9
ITPUB9周年纪念徽章
日期:2010-10-08 09:28:522010广州亚运会纪念徽章:击剑
日期:2010-11-03 11:00:36ITPUB十周年纪念徽章
日期:2011-11-01 16:25:512012新春纪念徽章
日期:2012-01-04 11:56:19奥运会纪念徽章:摔跤
日期:2012-08-21 10:04:04优秀写手
日期:2014-02-15 06:00:132014年新春福章
日期:2014-02-18 16:44:08马上有对象
日期:2014-02-18 16:44:08马上加薪
日期:2014-05-19 11:17:08
发表于 2010-9-19 11:42 | 显示全部楼层
原帖由 paulyibinyi 于 2010-9-17 07:24 发表
oracle 不建议心跳线采用直连的方式
心跳之间的网络带宽也建议GB以上


我想问问,为什末不建议心跳线采用直连的方式???

使用道具 举报

回复

您需要登录后才可以回帖 登录 | 注册

本版积分规则 发表回复

TOP技术积分榜 社区积分榜 徽章 团队 统计 知识索引树 积分竞拍 文本模式 帮助
  ITPUB首页 | ITPUB论坛 | 数据库技术 | 企业信息化 | 开发技术 | 微软技术 | 软件工程与项目管理 | IBM技术园地 | 行业纵向讨论 | IT招聘 | IT文档
  ChinaUnix | ChinaUnix博客 | ChinaUnix论坛
CopyRight 1999-2011 itpub.net All Right Reserved. 北京盛拓优讯信息技术有限公司版权所有 联系我们 
京ICP备09055130号-4  北京市公安局海淀分局网监中心备案编号:11010802021510 广播电视节目制作经营许可证:编号(京)字第1149号
  
快速回复 返回顶部 返回列表