|
今天又重启了两次,同样是节点2 。 按照Yong Huang 提供的文档检查是否GFS 文件系统导致的,的确是有几个项目有不满足的。 但是
不知道是不是和这个有关 。 硬件问题也同步请HP的人查看 。
Isolation of Red Hat Global File System (GFS) Issues:
If an issue is suspected by Oracle Support to be GFS software related, the issue would be transferred to
Red Hat Support after advising the customer to collect the following information required by Red Hat Support.
The collection of this information is the customers responsibility.
Please verify all of the items below to determine that a case is due to GFS software
The output of hostname and uname -n should be identical.
All systems should be able to ping each other by hostname.
Verify that the kernel is not tainted by executing lsmod.
The command rpm -qa | grep GFS should state that GFSUserToolsRPM and GFSKernelModsRPM are installed.
The command rpm -q perl-Net-Telnet should state that the perl-Net-Telnet package is installed.
Verify that the system times on all nodes/servers are within 5 minutes of each other.
If network storage is being used, all systems should be able to see attached LUNS.
The output of iptables -L should not show any traffic being prevented between any systems in the GFS environment.
Customers should be advised that the Red Hat Support requires a sysreport from all systems experiencing problems. Sysreport can be installed by running up2date sysreport and then executed by entering sysreport from a shell prompt
---------------------------------------------------
Check Items (Some items don't meet requirment) :
---------------------------------------------------
[root@hou249bbodb3112 ~]# su - oracle
hou249bbodb3112<*wmb2bprd2*/home/oracle>$
hou249bbodb3112<*wmb2bprd2*/home/oracle>$sqlplus / as sysdba
SQL> show parameter filesystemio_options
NAME TYPE VALUE
------------------------------------ ----------- ------------------------------
filesystemio_options string directIO
SQL>
1. The output of hostname and uname -n should be identical.
hou249bbodb3112<*wmb2bprd2*/home/oracle>$hostname
hou249bbodb3112
hou249bbodb3112<*wmb2bprd2*/home/oracle>$uname -a
Linux hou249bbodb3112 2.6.18-128.1.16.el5xen #1 SMP Fri Jun 26 11:10:46 EDT 2009 x86_64 x86_64 x86_64 GNU/Linux
hou249bbodb3112<*wmb2bprd2*/home/oracle>$
2. All systems should be able to ping each other by hostname.
[root@hou249bbodb3111 log]# ping hou249bbodb3112
PING hou249bbodb3112 (10.18.223.112) 56(84) bytes of data.
64 bytes from hou249bbodb3112 (10.18.223.112): icmp_seq=1 ttl=64 time=0.195 ms
64 bytes from hou249bbodb3112 (10.18.223.112): icmp_seq=2 ttl=64 time=0.162 ms
64 bytes from hou249bbodb3112 (10.18.223.112): icmp_seq=3 ttl=64 time=0.175 ms
64 bytes from hou249bbodb3112 (10.18.223.112): icmp_seq=4 ttl=64 time=0.171 ms
hou249bbodb3112<*wmb2bprd2*/home/oracle>$ping hou249bbodb3111
PING hou249bbodb3111 (10.18.223.111) 56(84) bytes of data.
64 bytes from hou249bbodb3111 (10.18.223.111): icmp_seq=1 ttl=64 time=0.187 ms
64 bytes from hou249bbodb3111 (10.18.223.111): icmp_seq=2 ttl=64 time=0.152 ms
64 bytes from hou249bbodb3111 (10.18.223.111): icmp_seq=3 ttl=64 time=0.162 ms
64 bytes from hou249bbodb3111 (10.18.223.111): icmp_seq=4 ttl=64 time=0.171 ms
64 bytes from hou249bbodb3111 (10.18.223.111): icmp_seq=5 ttl=64 time=0.159 ms
3. Verify that the kernel is not tainted by executing lsmod.
[root@hou249bbodb3112 ~]# lsmod
Module Size Used by
blktap 151653 2 [permanent]
blkbk 54777 0 [permanent]
ipt_MASQUERADE 36800 1
iptable_nat 40773 1
ip_nat 52973 2 ipt_MASQUERADE,iptable_nat
xt_state 35265 1
ip_conntrack 91237 4 ipt_MASQUERADE,iptable_nat,ip_nat,xt_state
nfnetlink 40457 2 ip_nat,ip_conntrack
ipt_REJECT 38849 2
xt_tcpudp 36289 4
iptable_filter 36161 1
ip_tables 55201 2 iptable_nat,iptable_filter
x_tables 50377 6 ipt_MASQUERADE,iptable_nat,xt_state,ipt_REJECT,xt_tcpudp,ip_tables
mptctl 63817 1
mptbase 113381 1 mptctl
ipmi_si 75680 3
ipmi_devintf 44432 6
ipmi_msghandler 72052 2 ipmi_si,ipmi_devintf
autofs4 57033 2
hidp 83521 2
gfs 324124 6
rfcomm 104809 0
l2cap 89281 10 hidp,rfcomm
bluetooth 118597 5 hidp,rfcomm,l2cap
lock_dlm 51425 0
gfs2 523820 1 lock_dlm
dlm 159425 30 gfs,lock_dlm
configfs 62301 2 dlm
bridge 91761 0
netloop 40129 0
netbk 130305 0 [permanent]
sunrpc 197513 1
bonding 120737 0
dm_multipath 55385 0
scsi_dh 41665 1 dm_multipath
video 53069 0
hwmon 36553 0
backlight 39873 1 video
sbs 49921 0
i2c_ec 38593 1 sbs
i2c_core 56129 1 i2c_ec
button 40545 0
battery 43849 0
asus_acpi 50917 0
ac 38729 0
ipv6 424737 116
xfrm_nalgo 43333 1 ipv6
crypto_api 42945 1 xfrm_nalgo
parport_pc 62313 0
lp 47121 0
parport 73293 2 parport_pc,lp
joydev 43969 0
pcspkr 36289 0
sg 69865 0
serial_core 56257 0
bnx2 207496 0
shpchp 70509 0
hpilo 43217 0
serio_raw 40517 0
ide_cd 73441 0
cdrom 68713 1 ide_cd
dm_raid45 98897 0
dm_message 36289 1 dm_raid45
dm_region_hash 46273 1 dm_raid45
dm_mem_cache 39489 1 dm_raid45
dm_snapshot 51593 0
dm_zero 35265 0
dm_mirror 54217 0
dm_log 44865 3 dm_raid45,dm_region_hash,dm_mirror
dm_mod 100497 18 dm_multipath,dm_raid45,dm_snapshot,dm_zero,dm_mirror,dm_log
ata_piix 56901 0
libata 208721 1 ata_piix
cciss 98633 3
ext3 167633 2
jbd 94001 1 ext3
uhci_hcd 57561 0
ohci_hcd 56053 0
ehci_hcd 65741 0
qla2xxx 1015212 31
sd_mod 56385 8
scsi_mod 196697 7 mptctl,scsi_dh,sg,libata,cciss,qla2xxx,sd_mod
qla2xxx_conf 334856 1
intermodule 37508 2 qla2xxx,qla2xxx_conf
[root@hou249bbodb3112 ~]#
4. The command rpm -qa | grep GFS should state that GFSUserToolsRPM and GFSKernelModsRPM are installed.
Result : Fail .
GFSUserToolsRPM and GFSKernelModsRPM are NOT installed .
[root@hou249bbodb3112 ~]# rpm -qa | grep GFS
[root@hou249bbodb3112 ~]#
[root@hou249bbodb3112 ~]#
5. The command rpm -q perl-Net-Telnet should state that the perl-Net-Telnet package is installed.
Result : OK
[root@hou249bbodb3112 ~]# rpm -q perl-Net-Telnet
perl-Net-Telnet-3.03-5
[root@hou249bbodb3112 ~]#
6. Verify that the system times on all nodes/servers are within 5 minutes of each other.
Result : OK
7. If network storage is being used, all systems should be able to see attached LUNS.
Result : OK
8. The output of iptables -L should not show any traffic being prevented between any systems in the GFS environment.
Result : Fail
[root@hou249bbodb3112 ~]# iptables -L
Chain INPUT (policy ACCEPT)
target prot opt source destination
ACCEPT udp -- anywhere anywhere udp dpt:domain
ACCEPT tcp -- anywhere anywhere tcp dpt:domain
ACCEPT udp -- anywhere anywhere udp dpt:bootps
ACCEPT tcp -- anywhere anywhere tcp dpt:bootps
Chain FORWARD (policy ACCEPT)
target prot opt source destination
ACCEPT all -- anywhere 192.168.122.0/24 state RELATED,ESTABLISHED
ACCEPT all -- 192.168.122.0/24 anywhere
ACCEPT all -- anywhere anywhere
REJECT all -- anywhere anywhere reject-with icmp-port-unreachable
REJECT all -- anywhere anywhere reject-with icmp-port-unreachable
Chain OUTPUT (policy ACCEPT)
target prot opt source destination
[root@hou249bbodb3112 ~]#
[root@hou249bbodb3111 log]# iptables -L
Chain INPUT (policy ACCEPT)
target prot opt source destination
ACCEPT udp -- anywhere anywhere udp dpt:domain
ACCEPT tcp -- anywhere anywhere tcp dpt:domain
ACCEPT udp -- anywhere anywhere udp dpt:bootps
ACCEPT tcp -- anywhere anywhere tcp dpt:bootps
Chain FORWARD (policy ACCEPT)
target prot opt source destination
ACCEPT all -- anywhere 192.168.122.0/24 state RELATED,ESTABLISHED
ACCEPT all -- 192.168.122.0/24 anywhere
ACCEPT all -- anywhere anywhere
REJECT all -- anywhere anywhere reject-with icmp-port-unreachable
REJECT all -- anywhere anywhere reject-with icmp-port-unreachable
Chain OUTPUT (policy ACCEPT)
target prot opt source destination
[root@hou249bbodb3111 log]# |
|