Pandas - 随笔分类 - 做梦当财神

pandas 中的read_csv、read_fwf、read_table方法读取数据

摘要：pandas读取文本文件数据的常用方法：方法描述返回数据 read_csv 读取csv文件 DataFrame或TextParser read_fwf 读取表格或固定宽度格式的文本行到数据框 DataFrame或TextParser read_table 读取通用分隔符分割的数据文件到数据框阅读全文

posted @ 2021-08-02 21:02 做梦当财神阅读(4215) 评论(0) 推荐(0)

pandas.DataFrame.ewm()

摘要：DataFrame.ewm(self,com=None,halflife=None, alpha=None, min_periods=0, adjust=True, ignore_na=False, axis=0) 提供指数加权平均。返回值 DataFrame 参数 com：float，可选根据质阅读全文

posted @ 2021-07-31 19:36 做梦当财神阅读(3508) 评论(0) 推荐(0)

pandas.MultiIndex

摘要：分层/多级索引能在较低纬度的数据结构（如Series和DataFrame）中存储和操作任意维度的数据， 1. 创建MultiIndex MultiIndex对象是标准索引Index对象的扩展，可以将MultiIndex看作一个元组数组，其中每个元组都是唯一的。可以从数组列表（MultiIndex.f 阅读全文

posted @ 2020-07-21 16:21 做梦当财神阅读(1603) 评论(0) 推荐(0)

字符串的合并（str.cat()）

摘要：字符串的合并，主要有4种方法： 1. 使用“+”组合字符串例如：输入x='a'+'b'得到x的值是‘ab’。 2. 使用%占位符组合字符串例如：输入x='I am %s'%'Tony'，得到x的值是‘I am Tony’。 3. 使用.join方法将多个可迭代对象合并例如：输入x=' '.jo 阅读全文

posted @ 2020-07-02 18:49 做梦当财神阅读(1633) 评论(0) 推荐(0)

pandas.cut

摘要：用途 pandas.cut用来把一组数据分割成离散的区间。比如一组年龄数据，pandas.cut将年龄分割成不同的年龄段并打上标签。原型 pandas.cut(x, bins, right=True, labels=None, retbins=False, precision=3,include_ 阅读全文

posted @ 2020-07-01 12:58 做梦当财神阅读(454) 评论(0) 推荐(0)

to_datetime 以及 dt.days、dt.months

摘要：Series类型的数据，经过 to_datetime 之后就可以用 pandas.Series.dt.days 和 pandas.Series.pd.month。除了 days 和 month 外，还包括 date、dayofweek、dayofyear、days_in_month、freq、ho 阅读全文

posted @ 2019-08-19 21:10 做梦当财神阅读(5436) 评论(0) 推荐(0)

pandas 中的 reset_index()

摘要：数据清洗时，会将带空值的行删除，此时DataFrame或Series类型的数据不再是连续的索引，可以使用reset_index()重置索引。 import pandas as pd import numpy as np df = pd.DataFrame(np.arange(20).reshape( 阅读全文

posted @ 2019-07-23 09:42 做梦当财神阅读(104149) 评论(1) 推荐(12)

pandas-数据类型转换

摘要：1. Pandas数据类型 pandas做数据处理，经常用到数据转换，得到正确类型的数据。 pandas与numpy之间的数据对应关系。重点介绍object，int64，float64，datetime64，bool等几种类型，category与timedelta两种类型这里不做介绍。 Custo 阅读全文

posted @ 2019-07-21 10:56 做梦当财神阅读(29865) 评论(0) 推荐(0)

pandas分组运算（groupby）

摘要：1. groupby() 2. 聚合方法size()和count() size跟count的区别： size计数时包含NaN值，而count不包含NaN值 count() size() 来自：https://blog.csdn.net/m0_37870649/article/details/8097 阅读全文

posted @ 2019-07-08 19:59 做梦当财神阅读(12578) 评论(0) 推荐(0)

pandas中根据列的值选取多行数据

摘要：来自：https://www.cnblogs.com/everfight/p/pandas_select_rows.html 阅读全文

posted @ 2019-06-13 14:23 做梦当财神阅读(3203) 评论(0) 推荐(0)

pandas-数据的合并与拼接

摘要：Pandas包的merge、join、concat方法可以完成数据的合并和拼接，merge方法主要基于两个dataframe的共同列进行合并，join方法主要基于两个dataframe的索引进行合并，concat方法是对series或dataframe进行行拼接或列拼接。 1. Merge方法 pa 阅读全文

posted @ 2019-04-29 17:23 做梦当财神阅读(77174) 评论(0) 推荐(2)

pandas中关于accessor的骚操作

摘要：来自：Python那些事 pandas中accessor功能很强大，可以将它理解为一种属性接口，通过它获得额外的方法。下面用代码和实例理解一下：对于Series数据结构使用_accessors方法，我们得到3个对象：cat, str, dt。 .cat:用于分类数据(Categorical da 阅读全文

posted @ 2018-09-28 09:38 做梦当财神阅读(1450) 评论(0) 推荐(1)

iterrows(), iteritems(), itertuples()对dataframe进行遍历

摘要：iterrows(): 将DataFrame迭代为(insex, Series)对。 itertuples(): 将DataFrame迭代为元祖。 iteritems(): 将DataFrame迭代为(列名, Series)对现有如下DataFrame数据： iterrows(): iterite 阅读全文

posted @ 2018-09-19 11:01 做梦当财神阅读(23453) 评论(0) 推荐(2)

pandas 计数 value_counts()

摘要：在pandas里面常用value_counts确认数据出现的频率。 1. Series 情况下： pandas 的 value_counts() 函数可以对Series里面的每个值进行计数并且排序。 import pandas as pd df = pd.DataFrame({'区域' : ['西安阅读全文

posted @ 2018-09-17 19:43 做梦当财神阅读(71927) 评论(2) 推荐(2)

gensim使用方法以及例子

摘要：gensim是一个Python的自然语言处理库，能够将文档根据TF-IDF，LDA，LSI等模型转换成向量模式，此外，gensim还实现了word2vec，能够将单词转换为词向量。 1. corpora和dictionary 1.1 基本概念和用法 corpora是gensim中的一个基本概念，是文阅读全文

posted @ 2018-06-16 12:29 做梦当财神阅读(10164) 评论(0) 推荐(0)

pandas display选项

摘要：来自：https://www.cnblogs.com/yesuuu/p/6100714.html 阅读全文

posted @ 2018-06-06 16:06 做梦当财神阅读(263) 评论(0) 推荐(0)

pandas.DataFrame.drop()

摘要：DataFrame.drop(labels=None, axis=0, index=None, columns=None, level=None, inplace=False, errors='raise') 参数： labels：要删除行、列的名字。 axis：默认为0，指删除行；axis=1指删阅读全文

posted @ 2018-05-01 21:52 做梦当财神阅读(610) 评论(0) 推荐(0)

pandas——ix 与 loc 与 iloc 与 icol 的区别

摘要：来自：https://blog.csdn.net/xw_classmate/article/details/51333646 来自：https://blog.csdn.net/chenKFKevin/article/details/62049060 来自：https://blog.csdn.net/ 阅读全文

posted @ 2018-05-01 20:37 做梦当财神阅读(1400) 评论(0) 推荐(0)

pandas.read_csv参数整理

摘要：读取CSV（逗号分隔）文件到DataFrame，也支持文件的部分导入和选择迭代更多帮助参见：http://pandas.pydata.org/pandas-docs/stable/io.html 参数： filepath_or_buffer：str，pathlib。str，pathlib.Path 阅读全文

posted @ 2017-11-30 16:14 做梦当财神阅读(2838) 评论(0) 推荐(0)

pandas (loc、iloc、ix)的区别

摘要：loc：通过行标签索引数据 iloc：通过行号索引行数据 ix：通过行标签或行号索引数据（基于loc和iloc的混合）使用loc、iloc、ix索引第一行数据： loc： iloc： ix：阅读全文

posted @ 2017-11-13 10:47 做梦当财神阅读(25026) 评论(0) 推荐(0)

做梦当财神

随笔分类 - Pandas

公告