arXiv-2018


文章目录

  • 1 Background and Motivation
  • 2 Related Work
  • 3 Advantages / Contributions
  • 4 Method
  • 5 Experiments
    • 5.1 Datasets
    • 5.2 Experiments and Results
  • 6 Conclusion(own) / Future work

1 Background and Motivation

CV 社区大部分聚焦于设计更好的网络结构,Less attention has been paid to finding better data augmentation methods that incorporate more invariances(teach a model about invariances in the data domain

作者旨在 automate(Reinforcement Learning) the process of finding an effective data augmentation policy for a target dataset

2 Related Work

3 Advantages / Contributions

automate 数据增广,在多个分类数据集上取得了 SOTA,且 can generalize well across different models and datasets(数据分布还是要有一定的相关性的,此前提下增广策略能屏蔽模型,跨越数据集)


Despite the observed transferability, we find that policies learned on data distributions closest to the target yield the best performance

4 Method

强化学习(Proximal Policy Optimization algorithm)搜数据增广策略,

搜索空间为

https://pillow.readthedocs.io/en/stable/reference/ImageOps.html(PIL实现)

ShearX/Y, TranslateX/Y, Rotate, AutoContrast, Invert, Equalize, Solarize, Posterize, Contrast, Color, Brightness, Sharpness, Cutout, Sample Pairing

  • invert 按给定的概率值将部分或全部通道的像素值从 v 设置为 255-v

  • Equalize 直方图均衡化

  • Solarize 在指定的阈值范围内,反转所有的像素点(高于阈值,则255-v)。

    from PIL import Image, ImageOps  # creating a image1 object
    im1 = Image.open("1.jpg")
    im2 = ImageOps.solarize(im1, threshold = 130)
    im2.show()
    
  • Posterize:保留 Image 各通道像素点数值的高 bits 位

    from PIL import Image, ImageOps
    im1 = Image.open("1.jpg")
    im2 = ImageOps.posterize(im1, bits=2)
    im2.show()
    

    128-64-32-16-8-4-2-1

    bits=1,图像各通道最大值为 128
    bits=2,图像各通道最大值为 128+64 = 192

    依次类推 bits 1~8 对应的图像最大值 128-192-224-240-248-252-254-255

16 种 data augmentation 方法(不同概率 probability——11个 values uniform spacing,不同参数配置 magnitude——10个等级 uniform spacing),部分增广方法没有 magnitude,eg:invert

每个 sub-policies 两种数据增广方法(16 选 2)串行组合——each sub-policy consisting of two image operations to be applied in sequence

一共会搜索出 5 种 sub-policies,搜索空间大致为 ( ( 16 × 11 × 10 ) 2 ) 5 = ( 16 × 11 × 10 ) 10 = 2.9 × 1 0 32 ((16 \times 11 \times 10)^2)^5 = (16 \times 11 \times 10)^{10} = 2.9 \times 10^{32} ((16×11×10)2)5=(16×11×10)10=2.9×1032

增广形式

训练时,a sub-policie is randomly chosen(5选1) for each image in each mini-batch

奖励机制,child model(a neural network trained as part of the search process) 的 acc

5 Experiments

5.1 Datasets

  • CIFAR-10,5W,reduced CIFAR-10(which consists of 4,000 randomly chosen examples, to save time for training child models during the augmentation search process)
  • CIFAR-100
  • SVHN,reduced SVHN dataset of 1,000 examples sampled randomly from the core training set.
  • ImageNet,reduced ImageNet,with 120 classes (randomly chosen) and 6,000 samples
  • Stanford Cars
  • FGVC Aircraft

5.2 Experiments and Results

1)CIFAR-10 and CIFAR-100 Results

选出来比较多的增广方式为 Equalize, AutoContrast, Color, and Brightness,而 ShearX/Y 较少

下面看看结果



再看看和半监督方法对比

作者仅用了 4000 张 labeled samples,半监督方法 use an additional 46,000 unlabeled samples in their training

效果上作者的更好

2)SVHN Results

Invert, Equalize, ShearX/Y, and Rotate 被选出来的比较多

the specific color of numbers is not as important as the relative color of the number and its background.


AutoAugment leads to more significant improvements on the reduced dataset than the full dataset(哈哈哈,数据本身才是王道,数据增广也是去增加数据的多样性)

3)ImageNet Results

focusing on color-based transformations + rotation

4)Fine Grained Visual Classification Datasets

用 table 4 的增广策略

增益效果明显,Stanford Cars 上还是 SOTA

5)Importance of Diversity in AutoAugment Policies


20 基本最好啦

6 Conclusion(own) / Future work

  • 数据增强:弹性变形(Elastic Distortion)

    CUDA加速弹性形变等图像增广(医疗图像)

  • 《Data Augmentation by Pairing Samples for Images Classification》(arXiv-2018)

  • We find that for a fixed amount of training time, it is more useful to allow child models to train for more epochs rather than train for fewer epochs with more training data.

  • 代码 [CVPR2019]AutoAugment:一种基于NAS方法的数据增强策略

【AutoAugment】《AutoAugment:Learning Augmentation Policies from Data》相关推荐

  1. 【GhostNet】《GhostNet:More Features from Cheap Operations》

    CVPR-2020 文章目录 1 Background and Motivation 2 Related Work 3 Advantages / Contributions 4 Method 4.1 ...

  2. 【RepVGG】《RepVGG:Making VGG-style ConvNets Great Again》

    CVPR-2021 文章目录 1 Background and Motivation 2 Related Work 3 Advantages / Contributions 4 Building Re ...

  3. 【MobileNet V2】《MobileNetV2:Inverted Residuals and Linear Bottlenecks》

    CVPR-2018 caffe 版本的代码:https://github.com/shicai/MobileNet-Caffe/blob/master/mobilenet_v2_deploy.prot ...

  4. 【FeatherNets】《FeatherNets:Convolutional Neural Networks as Light as Feather for Face Anti-spoofing》

    CVPR-2019 workshop code:https://github.com/SoftwareGift/FeatherNets_Face-Anti-spoofing-Attack-Detect ...

  5. 【用深度学习搜索相似服饰】《Using Deep Learning to Find Similar Dresses》by Luis Mey

    [用深度学习搜索相似服饰]<Using Deep Learning to Find Similar Dresses>by Luis Mey

  6. 【BiSeNet】《BiSeNet:Bilateral Segmentation Network for Real-time Semantic Segmentation》

    ECCV-2018 文章目录 1 Background and Motivation 2 Related Work 3 Advantages / Contributions 4 Method 5 Ex ...

  7. 【Jmeter】安装配置:Jmeter 安装插件 Redis Data Set

    目录 一.插件安装 二.插件作用 一.插件安装 使用插件管理器进行安装插件:Redis Data Set [Jmeter]安装配置:Jmeter 安装插件管理器 Plugins Manager

  8. AutoAugment: Learning Augmentation Policies from Data(一种自动数据增强技术)

    谷歌大脑提出自动数据增强方法AutoAugment:可迁移至不同数据集 近日,来自谷歌大脑的研究者在 arXiv 上发表论文,提出一种自动搜索合适数据增强策略的方法 AutoAugment,该方法创建 ...

  9. 【D2Det】《 D2Det:Towards High Quality Object Detection and Instance Segmentation》

    CVPR-2020 Pytorch Code: https://github.com/JialeCao001/D2Det. 文章目录 1 Background and Motivation 2 Rel ...

最新文章

  1. 如何查看别人(自己)电脑最近的浏览记录
  2. 每天学习python 30分钟 -了解python - 看懂#!/usr/bin/python
  3. user.config.ts文件里定义的配置是如何合并到SAP Spartacus的标准配置里去的
  4. Oracle dataguard 正常切换和应急切换
  5. 零基础学习ruby_学习Ruby:从零到英雄
  6. Sql字符串函数(1)
  7. 康乐php5.2_搭建PHP服务器调试环境套件下载-Easy2PHP5正式版下载[环境套件]-华军软件园...
  8. 【ElasticSearch】Es 源码之 NamedWriteableRegistry 源码解读
  9. java监听器的原理与实现
  10. 爱数助力国资委实现混合IT环境下的业务保护
  11. php中的print r与echo,php中echo(),print(),print_r()的区别
  12. hdu 1698 Just a Hook 基本线段树
  13. 关于c++产生string subscript out of range问题的自己的经历与解决方案
  14. vue项目使用mand mobile check选择项组点击选中,选中的列表延迟一位问题
  15. STM32/APM32 用DMA采集ADC1多通道--标准库
  16. 词频统计以及分析(词云制作)
  17. 【Java笔记+踩坑】SpringBoot基础2——运维实用
  18. 一款php开发的非常好的OA办公管理系统源码
  19. love2d之路(一)环境配置
  20. ATmega8仿真——LED 数码管的学习

热门文章

  1. ssr服务器 .json文件,vuessr 服务器端渲染
  2. 超棒的18张美女纹身肖像摄影
  3. 进入fastboot模式后,电脑连不上手机怎么办
  4. 记一次Mac上launchctl引发的问题
  5. linux桌面效果,Pekwm:一个轻量级的 Linux 桌面
  6. 女娲造人共始祖 -- Obejct类解读
  7. LosslessCut for Mac(无损视频剪切工具)
  8. 2021年化工自动化控制仪表复审考试及化工自动化控制仪表模拟试题
  9. postgreSQL数据库常用语法
  10. 可编程控制器有几种编程语言以及它们的特点