原以为:categorical data是离散型数据,numerical data是连接型数据;其实错的离谱;

本质上来讲,是能不能用来计算;categorical是不能用来计算的,比如性别;而数值型数据是可以用来计算的,数值型数据包括离散型与连续型的;比如说统计抛硬币的次数,那只能是1,2,3,4等整数表示;

Most data fall into one of two groups: numerical or categorical.

  • Numerical data. These data have meaning as a measurement, such as a person’s height, weight, IQ, or blood pressure; or they’re a count, such as the number of stock shares a person owns, how many teeth a dog has, or how many pages you can read of your favorite book before you fall asleep. (Statisticians also call numerical data quantitative data.)

    Numerical data can be further broken into two types: discrete and continuous.

    • Discrete data represent items that can be counted; they take on possible values that can be listed out. The list of possible values may be fixed (also called finite); or it may go from 0, 1, 2, on to infinity (making it countably infinite). For example, the number of heads in 100 coin flips takes on values from 0 through 100 (finite case), but the number of flips needed to get 100 heads takes on values from 100 (the fastest scenario) on up to infinity (if you never get to that 100th heads). Its possible values are listed as 100, 101, 102, 103, . . . (representing the countably infinite case).

    • Continuous data represent measurements; their possible values cannot be counted and can only be described using intervals on the real number line. For example, the exact amount of gas purchased at the pump for cars with 20-gallon tanks would be continuous data from 0 gallons to 20 gallons, represented by the interval [0, 20], inclusive. You might pump 8.40 gallons, or 8.41, or 8.414863 gallons, or any possible number from 0 to 20. In this way, continuous data can be thought of as being uncountably infinite. For ease of recordkeeping, statisticians usually pick some point in the number to round off. Another example would be that the lifetime of a C battery can be anywhere from 0 hours to an infinite number of hours (if it lasts forever), technically, with all possible values in between. Granted, you don’t expect a battery to last more than a few hundred hours, but no one can put a cap on how long it can go (remember the Energizer Bunny?).

  • Categorical data: Categorical data represent characteristics such as a person’s gender, marital status, hometown, or the types of movies they like. Categorical data can take on numerical values (such as “1” indicating male and “2” indicating female), but those numbers don’t have mathematical meaning. You couldn’t add them together, for example. (Other names for categorical data are qualitative data, or Yes/No data.)
    定义来自于 statistic书籍上的

categorical data and numerical data相关推荐

  1. R语言数据描述性统计(Descriptive statistics)实战:数据全局描述信息、数值数据的描述性统计(Numerical data)、离散型数据的描述性统计(Categorical)

    R语言数据描述性统计(Descriptive statistics)实战:数据全局描述信息.数值数据的描述性统计(Numerical data).离散型数据的描述性统计(Categorical) 目录

  2. 编码 data:text/html;c,关于 Data URI Scheme -- data:image/jpg;base64

    转载一篇大神的文章 大家可能注意到了,网页上有些图片的src或css背景图片的url后面跟了一大串字符,比如: data:image/jpeg;base64,/9j/4QAYRXhpZgAASUkqA ...

  3. 编译器编译16bit单片机程序对数组data[0]=0x12,data[1]=0xBD,data[2]=0x00,data[3]=0x45转换成16进制整数12BD0045的报警

    16bit单片机程序: 定义32bit一个变量,uint32_t  decData = 0; 刚开始时在16bit和32bit单片机中都这样写: decData = ( data[0] << ...

  4. Nominal data 与Ordinal data区别

    Nominal data 与Ordinal data均指向类别数据:如果类别数据不存在排序问题就是norminal data:如果存在 排序就是ordinal data

  5. DATA URI schema(data:base64)协议常用数据格式

    data:,<文本数据> data:text/plain,<文本数据> data:text/html,<HTML代码> data:text/html;base64, ...

  6. 数据湖 data lake_在Data Lake中高效更新TB级数据的模式

    数据湖 data lake GOAL: This post discusses SQL "UPDATE" statement equivalent for a data lake ...

  7. tableau for循环_Tableau for Data Science and Data Visualization-速成课程

    tableau for循环 Tableau is software that can help you see and understand your data. It is used for dat ...

  8. OCM_第十九天课程:Section9 —》Data Guard _ DATA GUARD 原理/DATA GUARD 应用/DATA GUARD 搭建...

    注:本文为原著(其内容来自 腾科教育培训课堂).阅读本文注意事项如下: 1:所有文章的转载请标注本文出处. 2:本文非本人不得用于商业用途.违者将承当相应法律责任. 3:该系列文章目录列表: 一:&l ...

  9. vue中this.init用法_Vue中的this.$options.data()和this.$data用法说明

    问题 项目里遇到一个问题,用this.$options.data()重置组件data时,data()里用this获取的props或method都为undefined,代码简化如下: export de ...

最新文章

  1. Springboot@Configuration和@Bean详解
  2. 简单干净的C#方法设计案例:SFCUI.AjaxLoadPage()之二
  3. docker启动nginx,并挂载目录及docker容器中文乱码
  4. Windows 下 MySQL-python 的安装
  5. REST资源何时应获得其自己的地址?
  6. 写屏障是什么_面试官为什么问内存模型总离不开final关键字,该如何应对?
  7. Hive中的数据库、表、数据与HDFS的对应关系
  8. @bean注解和@component注解的区别_通过源码查看 @Component,@Service 等注解是如何被解析的...
  9. Forbidden You don't have permission to access / on this server.解决方法
  10. java for index_Java IBarDataSet.getEntryForIndex方法代码示例
  11. 鸿蒙系统 oppo,OPPO公关抵制鸿蒙系统,结果被骂到辞职了,千亿营销确定了?
  12. 使用JavaParser进行java源码解析
  13. pip install 时 WARNING: No metadata found in e:\anaconda\lib\site-packages 问题解决
  14. 模拟斗地主洗牌发牌,并对已发好的拍进行排序(红桃A,方块A, 黑桃2.......)
  15. [转]语言模型训练工具SRILM
  16. 智能优化算法:斑点鬣狗优化算法-附代码
  17. learning rate对深度模型的影响:论文阅读Cyclical Learning Rates for Training Neural Networks
  18. Linux基础 - DNS服务基础
  19. 服务器主板显示不了独立显卡,独立显卡故障——如何才知道显卡和主板不兼容...
  20. 注意力机制的分类 | Soft Attention和Hard Attention

热门文章

  1. 5道Python函数练习
  2. Python:为什么只有一个元素的tuple要加逗号?
  3. python 字典+列表集合+文件读取
  4. C语言多线程教程(pthread)(线程创建pthread_t,指定线程run方法pthread_create,加mutex锁,解锁,伪共享 false sharing【假共享】)
  5. YunYang1994/tensorflow-yolov3 IndexError: list index out of range 解决办法
  6. python Matplotlib.pyplot 如何绘制三维折线图, 散点图, 线框图, 表面图, 柱状图, 箭头图, 2D转3D图, 文本图, 3D拼图, 网状图, 直方图, 角面片图, 条状图?
  7. python 计算 IOU
  8. 初学__Python——Python中文支持、Python计算器
  9. linux——Firewalld与iptables的基本配置
  10. php5.4 win10 mysql_win10本地搭建apache+php+mysql运行环境