`loc`,`iloc`,`ix`三者间的区别和联系

`loc`

.loc is primarily label based, but may also be used with a boolean array.

就是说，loc方法主要是用label来选择数据的。^[1]

A single label, e.g. 5 or 'a', (note that 5 is interpreted as a label of the index. This use is not an integer position along the index)
A list or array of labels ['a', 'b', 'c']
A slice object with labels 'a':'f', (note that contrary to usual python slices, both the start and the stop are included!)
A boolean array

总的形式还是要保持的df[xx:xx,xx:xx]，只不过这里边可以不用切片，但是中间的,还是很关键的。可以不写,，那么，就表示取某一行。但是，不能表示取某一列。

import pandas as pd

import numpy as np

test=pd.DataFrame(np.random.randn(20).reshape(4,5),index=['A','B','C','D'],columns=['E','F','G','H','I'])

test

Out[4]:

          E         F         G         H         I

A -0.833316 -1.982666  1.055594  0.781759 -0.107631

B -1.514709 -1.422883  0.204399 -0.487639 -1.652785

C -0.424735  0.400529 -0.786582  0.855885  0.059894

D  2.016221 -1.314878 -1.745535 -0.907778  0.834966

test.loc['A']

Out[5]:

E   -0.833316

F   -1.982666

G    1.055594

H    0.781759

I   -0.107631

Name: A, dtype: float64

test.loc['E']

KeyError: 'the label [E] is not in the [index]'

#看见了吧，是“闭区间”

test.loc['A':'B','E':'F']

Out[8]:

          E         F

A -0.833316 -1.982666

B -1.514709 -1.422883

label切片选择时，貌似是“闭区间”，:后边的也是包含进去的。

`iloc`

.iloc is primarily integer position based (from 0 to length-1 of the axis), but may also be used with a boolean array.

iloc主要就是基于position的选择。注意了，这里的position选择是一种”左闭右开“区间，意思就是df[m:n]只选择m:n-1行的数据。

An integer e.g. 5
A list or array of integers [4, 3, 0]
A slice object with ints 1:7
A boolean array

import pandas as pd

import numpy as np

test=pd.DataFrame(np.random.randn(20).reshape(4,5),index=['A','B','C','D'],columns=['E','F','G','H','I'])

test

Out[4]:

          E         F         G         H         I

A -0.833316 -1.982666  1.055594  0.781759 -0.107631

B -1.514709 -1.422883  0.204399 -0.487639 -1.652785

C -0.424735  0.400529 -0.786582  0.855885  0.059894

D  2.016221 -1.314878 -1.745535 -0.907778  0.834966

#看见了吧，是“左闭右开”区间呀！

test.iloc[0:1,0:1]

Out[10]:

          E

A -0.833316

`ix`

.ix supports mixed integer and label based access. It is primarily label based, but will fall back to integer positional access unless the corresponding axis is of integer type.

ix就是一种集大成者的选择方法呀！既支持position选择，也支持label选择。主要是label选择。

import pandas as pd

import numpy as np

test=pd.DataFrame(np.random.randn(20).reshape(4,5),index=['A','B','C','D'],columns=['E','F','G','H','I'])

test

Out[4]:

          E         F         G         H         I

A -0.833316 -1.982666  1.055594  0.781759 -0.107631

B -1.514709 -1.422883  0.204399 -0.487639 -1.652785

C -0.424735  0.400529 -0.786582  0.855885  0.059894

D  2.016221 -1.314878 -1.745535 -0.907778  0.834966

#下面的`ix`是不是和`loc`作用差不多啊~

test.ix['A':'B','E':'F']

Out[12]:

          E         F

A -0.833316 -1.982666

B -1.514709 -1.422883

#下面的是和`iloc`差不多了

test.ix[0:1,0:1]

Out[11]:

          E

A -0.833316

但是需要注意的是，当index或者columns是整数时，ix索引其实是按label选择的，因此，是闭区间的。

参考

发现还是官方文档说的最详细啊！希望以后有机会多看看这里的内容~

官方文档-Indexing and Selecting Data ↩︎

Python 数据分析 - 索引和选择数据的更多相关文章

pandas 学习第14篇：索引和选择数据
数据框和序列结构中都有轴标签,轴标签的信息存储在Index对象中,轴标签的最重要的作用是: 唯一标识数据,用于定位数据用于数据对齐获取和设置数据集的子集. 本文重点关注如何对序列(Series)和 ...
Pandas索引和选择数据
在本章中,我们将讨论如何切割和丢弃日期,并获取Pandas中大对象的子集. Python和NumPy索引运算符"[]"和属性运算符".". 可以在广泛的用例中快 ...
Python数据分析：大众点评数据进行选址
前言本文的文字及图片来源于网络,仅供学习.交流使用,不具有任何商业用途,版权归原作者所有,如有问题请及时联系我们以作处理. 作者:砂糖侠如果你处于想学Python或者正在学习Python,Pyth ...
Pandas | 13 索引和选择数据
Pandas现在支持三种类型的多轴索引; 编号索引描述 1 .loc() 基于标签 2 .iloc() 基于整数 3 .ix() 基于标签和整数 .loc() Pandas提供了各种方法来完成基于 ...
Python数据分析之双色球高频数据统计
Step1:基础数据准备(通过爬虫获取到),以下是从第一期03年双色球开奖号到今天的所有数据整理,截止目前一共2549期,balls.txt 文件内容如下 : 备注:想要现成数据的可以给我发邮件哟~ ...
python数据分析之csv/txt数据的导入和保存
约定: import numpy as np import pandas as pd 1 2 3 一.CSV数据的导入和保存 csv数据一般格式为逗号分隔,可在excel中打开展示. 示例 data1 ...
Python 数据分析—第七章数据归整：清理、转换、合并、重塑
一.数据库风格的Dataframe合并 import pandas as pd import numpy as np df1 = pd.DataFrame({'1key':['b','b','a',' ...
python数据分析第二版：数据加载，存储和格式
一:读取数据的函数 1.读取csv文件 import numpy as np import pandas as pd data = pd.read_csv("C:\\Users\\Admin ...
Python数据分析之全球人口数据
这篇文章用pandas对全球的人口数据做个简单分析.我收集全球各国1960-2019年人口数据,包含男女和不同年龄段,共6个文件. pop_total.csv: 各国每年总人口 pop_female. ...

随机推荐

Oralce安装、使用过程中出现的问题
OracleDBControl启动失败Unable to determine local host from URL REPOSITORY_URL=http://your-url.co 解决方法打开 ...
uvm设计分析——field automation
uvm中的field_automation主要实现了class中的基础元素的copy,compare等函数, 实现方式分为两种:1)用户注册,field系列宏:uvm内部调用static status ...
<9>cc.Sprite组件
1.精灵精灵(Sprite)是Cocos系列的核心概念之一,是Cocos Creator最常用的显示图像的组件. 游戏中显示一个图片,我们就可以把这个叫做”精灵” sprite,这只是简单理解概念. ...
html5-盒子模型
/*div{background: green;width: 60%;padding-top: 10px;padding-right: 20px;padding-bottom: 30px;paddin ...
CSS选择符-----属性选择符
Element[att] 选择具有att属性的E元素 <!DOCTYPE html> <html> <head> <meta charset=" ...
pymysql 数据库操控
模块安装 pip install pymysql 执行sql语句 import pymysql #添加数据 conn = pymysql.connect(host='127.0.0.1', port= ...
Redis入门——安装与基本命令
1. Redis安装下载地址:https://github.com/MSOpenTech/redis/releases 下载zip文件后直接解压 2. 启动Redis服务端解压目录下执行redis ...
Java基础整理
一.Java中的遍历 1.在java开发中会碰到遍历List删除其中多个元素的情况,如果使用一般的for循环以及增强的for循环,代码会抛出异常ConcurrentModificationExcept ...
VI编辑器常用命令
Linux下的文本编辑器有很多种,vi 是最常用的,也是各版本Linux的标配.注意,vi 仅仅是一个文本编辑器,可以给字符着色,可以自动补全,但是不像 Windows 下的 word 有排版功能. ...
【linux应用】将一个大文件按行拆分成小文件
例如将一个BLM.txt文件分成前缀为 BLM_ 的1000个小文件,后缀为系数形式,且后缀为4位数字形式先利用 wc -l BLM.txt #读出BLM.txt有多少行. 再利用 split 命令 ...

Python 数据分析 - 索引和选择数据

loc,iloc,ix三者间的区别和联系

loc

iloc

ix