生产数据库实例在晚上的时候突然重启(真是悲剧的事情)~

1.环境:
DB version:mariadb 10.0.28 x64

OS version:centos6.6 x64

kernel:2.6.32-504.el6.x86_64

系统sem:kernel.sem = 1000 40960001000 4096

2.error log
InnoDB: ###### Diagnostic info printed to the standard error stream
InnoDB: Warning: a long semaphore wait:
--Thread 139562561287936 has waited at row0ins.cc line 2730 for 923.00 seconds the semaphore:
X-lock (wait_ex) on RW-latch at 0x7ef7b756b4a0 '&block->lock'
a writer (thread id 139562561287936) has reserved it in mode  wait exclusive
number of readers 1, waiters flag 1, lock_word: ffffffffffffffff
Last time read locked in file row0sel.cc line 4152
Last time write locked in file /home/buildbot/buildbot/build/mariadb-10.0.28/storage/xtradb/row/row0ins.cc line 2730
InnoDB: Warning: a long semaphore wait:
--Thread 139562523481856 has waited at row0ins.cc line 2730 for 921.00 seconds the semaphore:
X-lock (wait_ex) on RW-latch at 0x7ef9c7eb5500 '&block->lock'
a writer (thread id 139562523481856) has reserved it in mode  wait exclusive
number of readers 1, waiters flag 1, lock_word: ffffffffffffffff
Last time read locked in file buf0flu.cc line 1069
Last time write locked in file /home/buildbot/buildbot/build/mariadb-10.0.28/storage/xtradb/btr/btr0sea.cc line 979
InnoDB: Warning: a long semaphore wait:
--Thread 139562609211136 has waited at row0ins.cc line 2730 for 911.00 seconds the semaphore:
X-lock (wait_ex) on RW-latch at 0x7ef7a6ef37e0 '&block->lock'
a writer (thread id 139562609211136) has reserved it in mode  wait exclusive
number of readers 1, waiters flag 0, lock_word: ffffffffffffffff
Last time read locked in file buf0flu.cc line 1069
Last time write locked in file /home/buildbot/buildbot/build/mariadb-10.0.28/storage/xtradb/row/row0ins.cc line 2730
InnoDB: Warning: a long semaphore wait:
--Thread 139562551437056 has waited at trx0undo.ic line 171 for 907.00 seconds the semaphore:
X-lock (wait_ex) on RW-latch at 0x7ef9c5d1e4e0 '&block->lock'
a writer (thread id 139562551437056) has reserved it in mode  wait exclusive
number of readers 1, waiters flag 0, lock_word: ffffffffffffffff
Last time read locked in file buf0flu.cc line 1069
Last time write locked in file /home/buildbot/buildbot/build/mariadb-10.0.28/storage/xtradb/include/trx0undo.ic line 171
InnoDB: Warning: a long semaphore wait:
--Thread 139560391223040 has waited at row0ins.cc line 2730 for 906.00 seconds the semaphore:
X-lock on RW-latch at 0x7ef9c7eb5500 '&block->lock'
a writer (thread id 139562523481856) has reserved it in mode  wait exclusive
number of readers 1, waiters flag 1, lock_word: ffffffffffffffff
Last time read locked in file buf0flu.cc line 1069
Last time write locked in file /home/buildbot/buildbot/build/mariadb-10.0.28/storage/xtradb/btr/btr0sea.cc line 979
InnoDB: ###### Diagnostic info printed to the standard error stream
InnoDB: Error: semaphore wait has lasted > 600 seconds
InnoDB: We intentionally crash the server, because it appears to be hung.
2017-03-30 21:11:18 7eee7d9fe700  InnoDB: Assertion failure in thread 139562774947584 in file srv0srv.cc line 2222
InnoDB: We intentionally generate a memory trap.
InnoDB: Submit a detailed bug report to http://bugs.mysql.com.
InnoDB: If you get repeated assertion failures or crashes, even
InnoDB: immediately after the mysqld startup, there may be
InnoDB: corruption in the InnoDB tablespace. Please refer to
InnoDB: http://dev.mysql.com/doc/refman/5.6/en/forcing-innodb-recovery.html
InnoDB: about forcing recovery.
3.临时方案
查看手册的时候发现自适应哈希索引可能会导致btr0sea.c文件的rw latch获取争用,从而导致SEMAPHORES问题。

详细链接:https://dev.mysql.com/doc/refman/5.7/en/innodb-adaptive-hash.html。

临时解决方案:set global innodb_adaptive_hash_index=0;

4.后续处理
给mariadb提了一个bug,发现centos6.6有坑。 Haswell-based Servers在centos6.6内核下可能会导致hang死(跟我的环境一模一样~)。详细链接
https://www.infoq.com/news/2015/05/redhat-futex
https://groups.google.com/forum/?hl=zh-Cn#!starred/codership-team/Ne6WsTWixH8
当然在后续如果出现问题希望能够及时gdb dump出来文件提供给官方进行研究吧。

Error: semaphore wait has lasted 大于 600 seconds导致数据库实例重启相关推荐

  1. Error fetching server time: Detected 31.361999988555908 seconds time difference between your browser

    prometheus页面出现如下警告 Error fetching server time: Detected 31.361999988555908 seconds time difference b ...

  2. mysql 1326_Mysql应用SQL Server 出现Error: 1326错误(管理器无法连接远程数据库)问题解决方案...

    <Mysql应用SQL Server 出现Error: 1326错误(管理器无法连接远程数据库)问题解决方案>要点: 本文介绍了Mysql应用SQL Server 出现Error: 132 ...

  3. mysql 1326_SQL Server 出现Error: 1326错误(管理器无法连接远程数据库)问题解决方案...

    SQL Server 出现Error: 1326错误 我们在在使用SQL Server时都会遇到使用SQL Server Management Studio无法连接远程数据库实例的问题,错误描述信息摘 ...

  4. sql服务器显示error,SQLServer出现Error:1326错误(管理器无法连接远程数据库)问题解决方案...

    SQL Server 出现Error: 1326错误 我们在在使用SQL Server时都会遇到使用SQL Server Management Studio无法连接远程数据库实例的问题,错误描述信息摘 ...

  5. sql服务器显示error,SQL Server 出现Error: 1326错误(管理器无法连接远程数据库)问...

    SQL Server 出现Error: 1326错误 我们在在使用SQL Server时都会遇到使用SQL Server Management Studio无法连接远程数据库实例的问题,错误描述信息摘 ...

  6. oracle goldengate 数据库实例,案例:Oracle goldengate报错ggsci: error while loading shared libraries:...

    天萃荷净 Oracle goldengate配置时报错ggsci: error while loading shared libraries 在部署goldengate过程中发现如下错误: [orac ...

  7. select每门可说大于90分_MYSQL数据库查询

    作为测试人员最常用的数据库查询语句,本篇博文做了一个具体的总结,以下只是为了概述查询方法而举的例子,并不一定是最好的查询方式. 红色标记的地方是该查询语句中重点想要表达的查询方法. 一. 新建一个数据 ...

  8. ERROR: found xmin ...... from before relfrozenxid .... ( vacuum bug导致 age 无法降低 )

    os: centos 7.6 db: postgresql 10.3 数据库的日志里记录了一个 ERROR 日志 关键信息如下 "ERROR",XX001, "found ...

  9. [ERROR]在删除BizTalk主机之前必须删除其所有实例?

    在删除时报错如下: ------------------------------------ 无法从 MessageBox 数据库服务器"BLADE14"/数据库名称"B ...

最新文章

  1. Linux音频设备驱动
  2. R语言:计算各种距离
  3. 思科路由器DHCP基础配置
  4. Linux下首次操作MySQL失败
  5. MCMC笔记:蒙特卡罗方法
  6. MySQL 5.6 Threadpool(优先队列)介绍及性能测试【转】
  7. 16x16x16 4096个RGB LED的光立方是一种怎样的神奇效果?视频展示制作全过程
  8. 最优化学习笔记(三)——梯度下降法
  9. win8系统装c语言编程软件哪个好,win8系统安装c语言方法
  10. 只有10万元预算买畅销SUV自动车型您选谁?
  11. [转载] 请不要做浮躁的人
  12. IDC:2017年全球公有云服务开支将达1225亿美元
  13. pytorch梯度的自动求取
  14. poj2594 Treasure Exploration 二分匹配之最小路径覆盖+传递闭包
  15. 龙卷风路径_关于龙卷风,看这篇文章就够了
  16. c语言ascii码16进制,C语言附录ASCII码表.doc
  17. 根据投资回报率(ROI)进行决策
  18. 大数据时代——你是否希望“被遗忘”
  19. [电脑桌面右击新建没有excel、ppt、word]
  20. Wireshark抓包分析DHCP

热门文章

  1. gtkpod + ipod-convenience 后同步 iTunes 出现 -54 未知错误的解决方法
  2. mac androidstudio 无法连接魅蓝手机进行调试
  3. 服务器网络机柜的保养维护,保养维护|服务器机柜保养维护有哪些注意细节?...
  4. 解决C# webBrowser 您还没有安装flash播放器,请点击这里安装的问题,以及webkit.net 入门
  5. SpringMVC项目添加日志
  6. AFG与AWG的比较
  7. JavaSE基础-概述
  8. Spring5——AOP
  9. PostgreSQL 源码解读(153)- 后台进程#5(walsender#1)
  10. 史上最全小程序笔记整理