NMI全名Non Maskable Interrupt,在DSP等学习中解释为“不可屏蔽中断”。

戴尔 iDRAC重启选择出现NMI(非屏蔽中断),不知其用途故作此笔记。

说人话:NMI(非屏蔽中断)通常要提前设置在crash dump目录,当发生硬件故障时,使用该重启方法会将crash dump文件保存下来以供硬件厂商分析故障原因。

  • Crash dump具有崩溃转储、故障转储、系统崩溃后的内存倾倒等意思。
一、什么是NMI(非屏蔽中断)

来源:https://wiki.osdev.org/Non_Maskable_Interrupt
NMI occur for RAM errors and unrecoverable hardware problems. For newer computers these things may be handled using machine check exceptions and/or SMI. For the newest chipsets (at least for Intel) there’s also a pile of TCO stuff (“total cost of ownership”) that is tied into it all (with a special “TCO IRQ” and connections to SMI/SMM, etc). Somehow all of the TCO stuff is/can be connected to an onboard ethernet controller, and (at least part of it) is intended for remote monitoring of the system. Unfortunately the chipset documentation I’ve been reading can’t tell me how BIOSs normally configure the chipset, and the chipsets themselves support several different options in each case. For example, for a RAM error it could be handled by the chipset itself, it could generate an SMI (where the BIOS/SMM handler does “RAM scrubbing” in software), it could generate a “TCO interrupt”, etc. If you add it all up it’s a huge complex mess (TCO + SMI + SMBus + northbridge + PCI bus/controller/s + PCI-to-LPC-bridge + god-knows-what) that can be completely different between motherboards (even motherboards with the same chipset).

The short version of this story is that there’s only really 2 reasons for an NMI. The first reason is a hardware failure. The second reason is a “watchdog timer”, which can be used to detect when the kernel itself locks up (and is sometimes also used for more accurate profiling as it allows EIP to be sampled even when IRQs are disabled).

If a hardware failure caused an NMI then there’s no way to figure out which piece of hardware caused the NMI. In this case I’d try to do the least possible in an attempt to tell the user that a hardware failure occurred, but at the end of the day you can’t expect any OS to work sanely on faulty hardware and there’s nothing software can do to work around the hardware failure anyway.

For the watchdog timer, it must be setup by the OS first. This can actually be done even when the chipset itself doesn’t have a special watchdog timer for it (e.g. setting the PIT, RTC/CMOS IRQ or a HPET IRQ to “NMI, send to all CPUs” in the I/O APIC). In this case you want the watchdog timer to be fast (i.e. no slow hardware task switching and cache flushing) and you’d also want all CPUs to share the same timer, which means all CPUs would receive the same IRQ at the same time (which brings me back to the busy flag in your TSS).

As an alternative, you could also use the local APIC’s timer or the performance monitoring counter overflow for a “per CPU” watchdog timer. Unfortunately these things are usually used for other purposes.

  • 翻译:NMI发生在RAM错误和不可恢复的硬件问题上…
二、NMI在Linux的应用
当系统挂起,失去响应的时候,可以人工触发NMI,使系统重置,如果早已配置好了kdump,那么会保存crash dump以供分析。有的服务器提供了NMI按钮,而刀片服务器通常不提供按钮,但可以用iLO命令触发。Linux还提供一种称为”NMI watchdog“的机制,用于检测系统是否失去响应(也称为lockup),可以配置为在发生lockup时自动触发panic。原理是周期性地生成NMI,由NMI handler检查hrtimer中断的发生次数,如果一定时间内这个数字停顿了,表示系统失去了响应,于是调用panic例程。
NMI watchdog的开关是通过内核参数 kernel.nmi_watchdog 或者在boot parameter中加入”nmi_watchdog=1″参数实现,比如:在RHEL上编辑 /boot/grub/menu.lst
  • Linux kernel笼统地把NMI分为三大类:内存校验错 mem_parity_error(),总线数据损坏 io_check_error(),其他的全部归入 unknown_nmi_error()

NMI(非屏蔽中断) DELL iDRAC相关推荐

  1. 戴尔电脑 linux ssh,使用SSH管理Dell iDRAC远程控制卡

    使用SSH命令行方式管理Dell iDRAC远程控制卡 登录 用户名root默认密码calvin login as: root root@192.168.15.80's password: WARNI ...

  2. STM32H743+CubeMX-串口非空闲中断接收

    文章目录 一.前言 二.CubeMX 2.1.选择串口 三.代码 3.1.main.c 3.2.stm32f7xx_it.c 相关笔记: STM32H743-串口重定向printf 一.前言 在实际工 ...

  3. DELL IDRAC

    DELL IDRAC 1.基本概念 idrac:Integrated Dell Remote Access Controller,DELL专有的远程管理系统. 2.优点 (1) 可以远程查看.配置.控 ...

  4. 配置Dell iDRAC远程管理服务器

    转载自: 使用Dell iDRAC服务器远程控制安装操作系统简要图解 | 一个DBA的工作学习笔记​​​​​​ 零基础教程:Dell远程管理idrac配置_未来村长的博客-CSDN博客_戴尔idrac ...

  5. 屏蔽电感和非屏蔽电感的区别?

    这个答案好像是谜底就在谜面上. 下面两篇文章说的太绕口,好听点是专业. 重点:屏蔽电感产生的不稳定射频信号,隔离. 缺点:贵. 1. 电感是衡量线圈产生电磁感应能力的物理量.给一个线圈通入电流,线圈周 ...

  6. 很多人问我用屏蔽网线为什比非屏蔽的网线干扰会更大?

    超五类双屏蔽网线指的是除外皮外还有一层屏蔽金属网和一层铝箔纸包裹着4对线芯的网线.主要对有强干扰的布线环境起到抵抗作用.屏蔽网线的有效传输距离比非屏蔽和单屏蔽网线都要短一些,因为屏蔽层把外界的电磁波消 ...

  7. zabbix3.4 监控Dell iDrac SNMPV2

    ** 持续集成.自动化部署.弹性伸缩教程 http://edu.csdn.net/course/detail/6452 ** Dell 服务器启用Dell iDrac 参考: http://blog. ...

  8. Dell iDrac试用许可下载

    Dell iDrac试用许可以下载 iDRAC.CMC.OpenManage Enterprise.OpenManage 与 Microsoft Windows Admin Center 集成.Ope ...

  9. 普维六类非屏蔽智能LED模块化布线解决方案(普维综合布线)

    综合布线系统解决方案 普维六类非屏蔽智能LED模块化 布线解决方案 目录 一.综合布线系统概述 1.1 智能学院简介 1.2 综合布线与传统布线的比较 1.3 综合布线系统的结构 1.4 综合布线系统 ...

最新文章

  1. pytorch中Schedule与warmup_steps的用法
  2. 什么是NullReferenceException,如何解决?
  3. Keras之父:我担心的是AI被社交媒体操控
  4. linux redis客户端_你见过能把Redis的主从复制讲这么明白的吗?
  5. 【实习生笔试面试】腾讯2013实习生电话面试总结
  6. GridView导出Execl
  7. mysql skip-grant-tables my.cnf_skip-grant-tables:修改mysql密码
  8. 利用Linux系统生成随机密码的8种方法
  9. 01 docker容器技术基础入门
  10. Socket编程--TCP粘包问题
  11. 气象报告是什么计算机领域,计算机辅助翻译系统在亚运气象服务方面的应用报告...
  12. 瑞昱rtl8197fs芯片怎么样_代理台湾瑞昱芯片路由器芯片RTL8197FS-VE4-CG和VE5-CG
  13. 制作Nine-Patch图片
  14. OpenCV学习笔记-Shi-Tomasi角点检测
  15. 使用Fiddler破解钉钉回放视频不能下载
  16. 本周上榜的这9本原创技术书很赞,《数学之美》作者吴军博士新书霸榜
  17. 2017全球最具影响力机器人公司TOP排行榜
  18. 适配iOS 11和iPhone X——导航栏、UITableView
  19. k8s指定node调度
  20. 计算机达到什么水平可以接活,计算机一般达到什么水平才能被接受?

热门文章

  1. 取消了移动WAP无限流量
  2. 第三方测评:IOV智能车链CarBlock阿尔法车链第一车链纷享车链
  3. 不扯犊子!我们自己来读技术报告,了解真实的GPT-4
  4. 软件测试面试需要准备什么?面试有什么技巧?看完面试轻松解决
  5. 新浪微博 爬取实现之微博登录
  6. 看曾国藩家书有感(1)
  7. 转一篇汇编语言的好文
  8. 纪录片《永远》,美让我们永得安慰!
  9. 网络同步——帧同步和状态同步解析
  10. 《汉明码(海明码)》通俗易懂