图像分类 Image Classification

The task of object classification requires binary labels indicating whether objects are present in an image.[1] 图像分类,该任务需要我们对出现在某幅图像中的物体做标注。比如一共有1000个物体类,对一幅图中所有物体来说,某个物体要么有,要么没有。可实现:输入一幅测试图片,输出该图片中物体类别的候选集。

物体检测 Object detection

Detecting an object entails both stating that an object belonging to a specified class is present, and localizing it in the image. The location of an object is typically represented by a bounding box. 物体检测,包含两个问题,一是判断属于某个特定类的物体是否出现在图中;二是对该物体定位,定位常用表征就是物体的边界框。可实现:输入测试图片,输出检测到的物体类别和位置。

语义分割 Semantic scene labeling

The task of labeling semantic objects in a scene requires that each pixel of an image be labeled as belonging to a category, such as sky, chair, floor, street, etc. In contrast to the detection task, individual instances of objects do not need to be segmented. 语义标注/分割:该任务需要将图中每一点像素标注为某个物体类别。同一物体的不同实例不需要单独分割出来。对下图,标注为人,羊,狗,草地。而不需要羊1,羊2,羊3,羊4,羊5.

实例分割 Instance segment

实例分割是物体检测+语义分割的综合体。相对物体检测的边界框,实例分割可精确到物体的边缘;相对语义分割,实例分割可以标注出图上同一物体的不同个体(羊1,羊2,羊3…)

4种任务的数据集标注示例如图示。可以看到,标注越来越复杂,但是处理效果越来越有用。

Object Segmentation

one of the reasons that this has fallen out of favor in the research community is because it is problematically vague. Object segmentation used to simply mean finding a single or small number of objects in an image and draw a boundary around them, and for most purposes you can still assume it means this. However, it also began to be used to mean segmentation of blobs that might be objects, segmentation of objects from the background (more commonly now called background subtraction or background segmentation or foreground detection), and even in some cases used interchangeably with object recognition using bounding boxes (this quickly stopped with the advent of deep neural network approaches to object recognition, but beforehand object recognition could also mean simply labeling an entire image with the object in it).

What makes “segmentation” “semantic”?

Simpy, each segment, or in the case of deep methods each pixel, is given a class label based on a category. Segmentation in general is just the division of the image by some rule. Meanshift segmentation, for example, from a very high level divide the data according to the changes in the energy of the image. Graph cut based segmentation is similarly not learned but directly derived from the properties of each image separate from the rest. More recent (neural network based) methods use pixels that are labeled to learn to identify the local features which are associated with specific classes, and then classify each pixel based on which class has the highest confidence for that pixel. In this way, “pixel-labeling” is actually more honest name for the task, and the “segmentation” component is emergent.

Instance Segmentation

Arguably the most difficult, relevant, and original meaning of Object Segmentation, “instance segmentation” means the segmentation of the individual objects within a scene, regardless of if they are the same type. However, one of the reason this is so difficult is because from a vision perspective (and in some ways a philosophical one) what makes an “object” instance is not entirely clear. Are body parts objects? Should such “part-objects” be segmented at all by an instance segmentation algorithm? Should they be only segmented if they are seen separate from the whole? What about compound objects should two things clearly adjoined but separable be one object or two (is a rock glued to the top of a stick an ax, a hammer, or just a stick and a rock unless properly made?). Also, it isn’t clear how to distinguish instances. Is a will a separate instance from the other walls it is attached to? What order should instances be counted in? As they appear? Proximity to the viewpoint? In spite of these difficulties, segmentation of objects is still a big deal because as humans we interact with objects all the time regardless of their “class label” (using random objects around you as paper weights, sitting on things that are not chairs), and so some dataset do attempt to get at this problem, but the main reason there isn’t much attention given to the problem yet is because it isn’t well enough defined.

Scene Parsing/Scene labeling

Scene Parsing is the strictly segmentation approach to scene labeling, which also has some vagueness problems of its own. Historically, scene labeling meant to divide the entire “scene” (image) up into segments and give them all a class label. However, it was also used to mean giving class labels to areas of the image without explicitly segmenting them. With respect to segmentation, “semantic segmentation” does not imply dividing the entire scene. For semantic segmentation, the algorithm is intended to segment only the objects it knows, and will be penalized by its loss function for labeling pixels that don’t have any label. For example the MS-COCO dataset is a dataset for semantic segmentation where only some objects are segmented.

图像分类,物体检测,语义分割,实例分割等概念相关推荐

  1. CV之IS:计算机视觉之图像分割(Image Segmentation)算法的挑战任务、算法演化、目标检测和图像分割(语义分割/实例分割/全景分割)的对比

    CV之IS:计算机视觉之图像分割(Image Segmentation)算法的挑战任务.算法演化.目标检测和图像分割(语义分割/实例分割/全景分割)的对比 相关文章 CV之IS:计算机视觉之图像分割( ...

  2. 基于三维点云场景的语义及实例分割:RandLA-Net和3D-BoNet

    本文整理自牛津大学的杨波和胡庆拥在深蓝学院的线上公开课论文演讲.两位作者都是来自牛津大学的博士生,RandLA-Net和3D-BoNet分别发表在CVPR20和NeurIPS19上,两份工作都具有非常 ...

  3. 视频语义显著实例分割数据集SESIV

    视频区域中,显著区域中有语义意义的组建,即语义显著实例.为解决视频语义显著实例分割,来自日本SOKENDAI和National Institute of Informatics的学者构建了一个新的数据 ...

  4. matlab 图像分割 提取人像_几种典型的图像处理技术(分类 目标提取 目标跟踪 语义分割 实例分割)...

    目前,在处理海上无人艇的感知系统的开发.其中,比较核心的一个部分就是基于光电的目标识别(Object Detection). 任务目标 如图所示,从包含舰艇的图片中,标识舰艇的位置 本文的学习笔记,主 ...

  5. 2021-08-27 分割之后mask在原图中提取出来(语义分割—实例分割)

    1.生成分割后的mask 原图和mask如下: 2.mask二值化 from PIL import Image import matplotlib.pyplot as plt import numpy ...

  6. 计算机视觉:图像分类、物体检测、图像语义分割、实例分割和全景分割

    计算机视觉 图像分类 Image Classification 物体检测 Object Detection/ Object Localization 语义分割 Semantic Segmentatio ...

  7. 计算机视觉六大技术:图像分类、目标检测、目标跟踪、语义分割、实例分割、影像重建..

    计算机视觉是使用计算机及相关设备对生物视觉的一种模拟.它的主要任务就是通过对采集的图片或视频进行处理以获得相应场景的三维信息,就像人类和许多其他类生物每天所做的那样. 计算机视觉既是工程领域,也是科学 ...

  8. 计算机视觉三大基本任务:分类、检测(定位)、分割(语义和实例)

    前言 刚刚接触计算机视觉时可能会对不同的任务的区分以及网络架构的选择产生迷惑,因此,在此总结了相关的基础知识.在本文中,我们试图回答两个问题: 不同任务要做的事情是什么,研究范畴是什么? 不同的任务需 ...

  9. CVPR2020:三维实例分割与目标检测

    CVPR2020:三维实例分割与目标检测 Joint 3D Instance Segmentation and Object Detection for Autonomous Driving 论文地址 ...

最新文章

  1. 你也许不知道的Vuejs - 使用ES6快乐的玩耍
  2. hadoop jetty的应用
  3. 【ML】 李宏毅机器学习一:Regression、Gradient Descent(python实现)
  4. [渝粤教育] 天津城建大学 混凝土结构设计原理A 参考 资料
  5. 安装perl5.10.0
  6. 静态库-动态库混合编译
  7. C语言函数二维数组传递方法
  8. c语言冒泡排序法程序填空_五种数组的排序方法(C语言)Part1
  9. PHP 获取微信小程序接口B二维码接口
  10. ubuntu CA安装 证书申请
  11. Laya 【U3D场景的 .ls场景】 【Laya.URL.basePath远程加载】 【加载分包】,【进度条】
  12. Design Pattern: Visitor and Acyclic Visitor and their differences
  13. 小明的调查作业java_小明的调查作业
  14. 微信小程序与普通网页开发的区别
  15. Office365 - 如何查询email是否发送成功
  16. th themlefy 计算_航程计算的数学模型
  17. Android彩信数据库分析
  18. 在冷风中我凌乱了半小时,只因健康码刷不出来
  19. 练习3-4 统计字符 (15分)
  20. 以太网交换机 VLAN 生成树协议

热门文章

  1. Windows环境FTP服务用户隔离设置(支持多用户登录)
  2. 机器理解大数据秘密:聚类算法深度剖析
  3. 如何配置自己的台式机
  4. 基于Pytorch实现Seq2Seq模型进行机器翻译(实现了Attention机制、编码器解码器的多层双向结构)
  5. Veeam Backup Replication v11比“十全十美”多了哪一点?
  6. Nacos 服务端健康检查及客户端服务订阅机制源码分析(三)
  7. LruCache之LruCache分析
  8. 微博开放平台接口整理系列--短链转长链
  9. LeetCode-Python-1544. 整理字符串(栈)
  10. 移动硬盘无法识别,参数错误的问题## 标题