ASM diskgroup dismount with "Waited 15 secs for write IO to PST" (文档 ID 1581684.1) 转到底部

In this Document

Symptoms
  Cause
  Solution
  References

APPLIES TO:

Oracle Database - Enterprise Edition - Version 11.2.0.3 to 12.1.0.1 [Release 11.2 to 12.1]
Information in this document applies to any platform.

SYMPTOMS

Normal or high redundancy diskgroup is dismounted with these WARNING messages.

//ASM alert.log

Mon Jul 01 09:10:47 2013
WARNING: Waited 15 secs for write IO to PST disk 1 in group 6.
WARNING: Waited 15 secs for write IO to PST disk 4 in group 6.
WARNING: Waited 15 secs for write IO to PST disk 1 in group 6.
WARNING: Waited 15 secs for write IO to PST disk 4 in group 6.
....
GMON dismounting group 6 at 72 for pid 44, osid 8782162

CAUSE

Generally this kind messages comes in ASM alertlog file on below situations,

Delayed ASM PST heart beats on ASM disks in normal or high redundancy diskgroup,
thus the ASM instance dismount the diskgroup.
By default, it is 15 seconds.

By the way the heart beat delays are sort of ignored for external redundancy diskgroup.
ASM instance stop issuing more PST heart beat until it succeeds PST revalidation,
but the heart beat delays do not dismount external redundancy diskgroup directly.

The ASM disk could go into unresponsiveness, normally in the following scenarios:

+    Some of the paths of the physical paths of the multipath device are offline or lost
+    During path 'failover' in a multipath set up
+    Server load, or any sort of storage/multipath/OS maintenance

The Doc ID 10109915.8  briefs about Bug 10109915(this fix introduce this underscore parameter). And the issue is with no OS/Storage tunable timeout mechanism in a case of a Hung NFS Server/Filer. And then  _asm_hbeatiowait  helps in setting the time out.

SOLUTION

1]    Check with OS and Storage admin that there is disk unresponsiveness.

2]    Possibly keep the disk responsiveness to below 15 seconds.

This will depend on various factors like
+    Operating System
+    Presence of Multipath ( and Multipath Type )
+    Any kernel parameter

So you need to find out, what is the 'maximum' possible disk unresponsiveness for your set up.

For example, on AIX  rw_timeout  setting affects this and defaults to 30 seconds.

Another example is Linux with native multipathing. In such set up, number of physical paths and  polling_interval value in multipath.conf file, will dictate this maximum disk unresponsiveness.

So for your set up ( combination of OS / multipath / storage ), you need to find out this.

3]    If you can not keep the disk unresponsiveness to below 15 seconds, then the below parameter can be set in the ASM instance ( on all the Nodes of RAC ):

_asm_hbeatiowait
    
As per internal bug 17274537 , based on internal testing the value should be increased to 120 secs, the same will be fixed in 12.2

Run below in asm instance to set desired value for _asm_hbeatiowait

alter system set "_asm_hbeatiowait"=<value> scope=spfile sid='*';

And then restart asm instance / crs, to take new parameter value in effect.

REFERENCES

BUG:17043894 - DISKGROUP DISMOUNTS IF 2 OUT OF 8 PATHS LOST
BUG:10109915 - ASM HANGS IN HIGH REDUNDANCY CONFIG IF 1 OF 5 DISKS GOES OFFLINE
NOTE:1910315.1 - How to Create a Normal Redundancy Diskgroup Best Practices

ASM diskgroup dismount with Waited 15 secs for write IO to PST (文档 ID 1581684.1)相关推荐

  1. 【ORACLE】RAC 磁盘超时,导致数据库重启 WARNING: Waited 15 secs for write IO to PST disk 0 in group 1.

    项目场景: 采用云资源上部署的oracle RAC 11.2.0.4数据库两节点不定期重启 问题描述 现场反馈,数据库两节点不断重启,检查crs,无重大报错.检查asm日志,发现如下报错. Fri S ...

  2. WARNING: Waited 15 secs for write IO to PST disk 4 in group 3 in alert_asm.log

    昨天同事遇到一个 2节点,基于AIX 7.1的的ASM ocr访问超时的问题,Node2无法正常访问,检查Node2的alert_asm.log日志如下: Reference :ASM diskgro ...

  3. Troubleshooting: WAITED TOO LONG FOR A ROW CACHE ENQUEUE LOCK! (文档 ID 278316.1)

    目录(?)[-] APPLIES TO PURPOSE TROUBLESHOOTING STEPS What is a Row Cache Enqueue Lock What is the meani ...

  4. 【python办公自动化(15)】python编程实现Word文档内容样式批量调整(会议议程模板的创建)

    对文字字体样式进行修改 run.font.样式 = xxx import os os.chdir('D:\\python_major\\auto_office15') from docx import ...

  5. linux清除asm磁盘分区,ASM Diskgroup添加与删除

    ENV:10G RAC 系统架设有2年多了,当初规划的8TB硬盘所剩无几,随即申请停机时间,添加ASM DiskGroup --搭建一套与实体机一样的RAC,做了测试,这是测试记录-- --添加方法: ...

  6. 针对11.2 RAC丢失OCR和Votedisk所在ASM Diskgroup的恢复手段

    本文转载自:http://www.askmaclean.com/archives/11-2-lost-ocr-votedisk-group-recovery.html 之前有同学在我的Oracle A ...

  7. oracle rebalance参数,【案例】Oracle ASM扩展新LAN加入asm diskgroup asm rebalance 原理

    天萃荷净 Oracle研究中心案例分析:运维DBA反映Oracle数据库的ASM空间不足,需要扩展.通过划新的LAN加入asm diskgroup并分析asm rebalance 原理. 本站文章除注 ...

  8. oracle 函数 disk,Oracle能够识别asm diskgroup,但却不能够访问disk

    初步怀疑是asm的问题, asm diskgroup虽然mount上了, 但是diskgroup下的disk没有打开?! 以下是asm的后台日志: SQL> alter diskgroup RA ...

  9. oracle 12c dbca 无法发现 asm diskgroup

    DBCA 日志 路径 [root@oracle73 ~]# cd /u01/app/oracle/cfgtoollogs/dbca/ [root@oracle73 dbca]# ll total 17 ...

最新文章

  1. [转载]AIX 上 Lotus Domino 的内存使用
  2. 信息安全之仿射密码加密和解密
  3. UVA - 540:Team Queue
  4. 使用awk取passwd10-20行然后重定向
  5. linux同步数据库,Linux两台服务器mysql数据库同步
  6. python csv合并单元格_python中合并表格的两种方法
  7. 妈妈别哭,我去了天堂[组图]【值得你去看,值得你去深思】
  8. Oracle之:查询锁表,删除锁表
  9. SpriteBuilder改变布局后App运行出错代码排查
  10. 使用vs2005经验与教训(关于masterpage,menu,gridview及对exce的l操作)
  11. 脑电时频分析II:时频分析
  12. vue侧边栏菜单一二级模板
  13. 学习 stm32(TTL)串口通信控制16路舵机控制板(维特智能)
  14. 8个超好用的免费工具/软件/网站
  15. redis设置过期时间与直接detele key有什么区别
  16. 弗洛伊德算法(求最短路径)
  17. P2p流媒体直播与视频会议有什么不同?
  18. 39.SSH远程终端连接工具
  19. 可爱的EGO所有中文游戏下载
  20. Windows 2000系统编程 (1)

热门文章

  1. 更换oracle nls,oracle_更改数据库字符集 (NLS_CHARACTERSET)
  2. 软件工程大作业:自动售货机系统
  3. 【千纸诗书】—— PHP/MySQL二手书网站后台开发之项目设计
  4. 使用WebGL 自定义 3D 摄像头监控模型
  5. timesat数据如何读取_判二手车调表车另类方法。如何利用OBD读取可靠数据(技术类)...
  6. 主板显卡服务器维修,【转】电脑主板待机供电和显卡供电电路的维修
  7. 《数据结构》邓俊辉 网课习题详细解析(第五章:二叉树)
  8. 写好的java项目如何部署在公司服务器上,并能让局域网内其他同事访问到,以及遇到的坑!
  9. MacBook雷电3接口失灵不可用
  10. 基于51单片机数码管显示