Dai, Jifeng, Kaiming He, and Jian Sun. “Instance-aware semantic segmentation via multitask network cascades.” arXiv preprint arXiv:1512.04412 (2015). (Citations: 40).

1 Motivation

All previous works require externel segmentation proposals, which are slow at test time.

2 Architecture

See Fig. We divide the task into three sub-tasks.
1. RPN. The instances can be represented by bounding boxes that are class-agnostic.
2. Estimating masks. A pixel-level mask is predicted by logistic regression for each bounding box instance. They are still class-agnostic.
3. Categorizing objects. The category-wise label is predicted for each mask-level instance.
We expect that each sub-task is simpler than the original instance segmentation task, and is more easily addressed by CNNs.

3 Cascades with More Stages
In Fast R-CNN, the classification head is trained jointly with class-wise bounding box regression head. Inspired by this practice, on stage 3, we add a regression head, , which is
a sibling layer with the classifier head. For the testing step, we first run the entire 3-stage network and obtain the regressed boxes on stage 3. These boxes are then considered as new proposals 1. Stages 2 and 3 are performed for the second time on these proposals. This is in fact 5-stage inference.

4 Training Details
Each stage involves a loss term, but a later stage’s loss relies on the output of an earlier stage. We train the entire network cascade end-to-end with a unified loss function.

5 References
[1]. https://www.youtube.com/watch?v=bUjyXASy_Jo.

[深度学习论文笔记][Instance Segmentation] Instance-aware Semantic Segmentation via Multi-task Network Cascad相关推荐

  1. [深度学习论文笔记]Pairwise Learning for Medical Image Segmentation

    [深度学习论文笔记]Pairwise Learning for Medical Image Segmentation 医学图像分割的成对学习 Published: October 2020 Publi ...

  2. 深度学习论文: Rethink Dilated Convolution for Real-time Semantic Segmentation及其PyTorch实现

    深度学习论文: Rethink Dilated Convolution for Real-time Semantic Segmentation及其PyTorch实现 Rethink Dilated C ...

  3. 深度学习论文: Lightweight and Progressively-Scalable Networks for Semantic Segmentation及其PyTorch实现

    深度学习论文: Lightweight and Progressively-Scalable Networks for Semantic Segmentation及其PyTorch实现 Lightwe ...

  4. [深度学习论文笔记]A Tri-attention Fusion Guided Multi-modal Segmentation Network

    A Tri-attention Fusion Guided Multi-modal Segmentation Network 一种三注意力融合引导的多模态分割网络 Published: 2 Nov 2 ...

  5. [深度学习论文笔记]Multi-phase Liver Tumor Segmentation with Spatial Aggregation

    Multi-phase Liver Tumor Segmentation with Spatial Aggregation and Uncertain Region Inpainting [深度学习论 ...

  6. [深度学习论文笔记]医学图像分割U型网络大合集

    [深度学习论文笔记]医学图像分割U型网络大合集 2015 U-Net: Convolutional Networks for Biomedical Image Segmentation (MICCAI ...

  7. [深度学习论文笔记]Knowledge distillation from multi-modal to mono-modal segmentation networks从多模态到单模态分割的知识提取

    Knowledge distillation from multi-modal to mono-modal segmentation networks 从多模态分割网络到单模态分割网络的知识提取 MI ...

  8. [深度学习论文笔记]UNETR: Transformers for 3D Medical Image Segmentation

    UNETR: Transformers for 3D Medical Image Segmentation UNETR:用于三维医学图像分割的Transformer Published: Oct 20 ...

  9. [深度学习论文笔记]Modality-aware Mutual Learning for Multi-modal Medical Image Segmentation

    Modality-aware Mutual Learning for Multi-modal Medical Image Segmentation 多模态医学图像分割中的模态感知互学习 Publish ...

最新文章

  1. androidid什么时候会变_今天是“三九”为什么老话说:三九冰上走一走,过年也能露一手!...
  2. mfc140dll 丢失 微软常用运行库_微软常用运行库合集 2020.9月(32amp;64位)
  3. HTML 表格中的行合并与列合并
  4. Puppeteer入门初探
  5. [html] H5的Web Storage带来什么好处?
  6. qt5.13.2输出中文乱码
  7. 教你如何窃取网络信息
  8. python的一些解释
  9. 4.算法优化 -高性能JavaScript
  10. [转]Windows环境中使用版本管理工具 Git
  11. 有参组装新转录本cufflinks_RNA-Seq流程(cutadapt-tophat2-cufflinks)
  12. java webserver demo_Java 实现 web服务器的简单实例
  13. ubuntu更改主机名
  14. [Windows]msvcr110.dll运行库缺失问题解决方案
  15. CVPR 2021 | 自监督学习新思路!S2-BNN:基于蒸馏的自监督学习算法
  16. 基于TensorRT的神经网络推理与加速
  17. 前端js使浏览器窗口全屏与退出----浏览器全屏时 监测通过esc按键退出全屏 (退出全屏时有页面上的相关处理)
  18. xpath匹配html,使用xpath获取部分字符串匹配的html标记
  19. idea打不开,双击没反应的解决方案
  20. 发现新大陆>think-addons可以在自己的应用中做模块化开发

热门文章

  1. 处理DataFrame数据——利用Groupby函数分类汇总数据并转为矩阵
  2. yate sip 服务端和客服端 搭建
  3. 招教上机计算机技能测试题,技能测试-计算机(教师招考真题)
  4. 计算机文件用英语怎么说,电脑里的文件是什么意思
  5. 美国公司连续加班36个小时,只为给华为“多发货”
  6. PIC 开发 入门 教程 MPLAB X IDE v4.05 xc8-v1.41 仿真proteus
  7. 网页中插入flv播放器
  8. 无论你做什么爸爸妈妈都爱你java_金宝晚安故事丨《无论你做什么,宝贝我一直爱你》爱孩子所有的样子...
  9. svchost.exe
  10. iOS开发:iPhone分辨率指南