随笔档案「2021年7月28日」：python DataFrame将某一列的格式转换为指定格式 ... - Shilo

2021年7月28日

python DataFrame将某一列的格式转换为指定格式

摘要： d[d.columns[0]]=d[d.columns[0]].astype('float64') #第1列换成浮点型阅读全文

posted @ 2021-07-28 10:32 Shilo 阅读(2682) 评论(0) 推荐(0)

python DataFrame 列的重命名

摘要： data.columns = [re_colname] 比如 data.columns = ['一个列名'] data.columns = [['两个列名1','两个列名2']] 阅读全文

posted @ 2021-07-28 10:31 Shilo 阅读(1765) 评论(0) 推荐(0)

python 删除list里面的空字符

摘要： dat_lst=list(filter(None, dat_lst)) # 如果是listoflist就要多嵌套一层循环阅读全文

posted @ 2021-07-28 10:29 Shilo 阅读(124) 评论(0) 推荐(0)

python 将DataFrame转换为list

摘要： dat_lst=dat.iloc[:,1:].values.tolist() 阅读全文

posted @ 2021-07-28 10:28 Shilo 阅读(381) 评论(0) 推荐(0)

python DataFrame 简单行拼接列拼接

摘要：分别对df的行或者列进行处理后，会遇到想要把拆开的数据重新拼起来的情况这些数据具有相同的结构，只是单纯的要拼到一起，不涉及连接的关联变量。（就是R的rbind 和 cbind）df= a.append([b,c,d,e,f,g,h,i,j,k,l,m], ignore_index=False) 阅读全文

posted @ 2021-07-28 10:27 Shilo 阅读(4136) 评论(0) 推荐(0)

python DataFrame 重置INDEX

摘要： DataFrame删除某些列后会出现INDEX不连续的问题，会影响循环的运行因此会常用到将INDEX重置为从0到n df.reset_index(drop=True, inplace=True) 阅读全文

posted @ 2021-07-28 10:19 Shilo 阅读(2377) 评论(0) 推荐(0)

python dataframe 读取excel

摘要： # 使用预设数据格式使读取更快,converters={"COLlv1":str,"COLlv2":str,"COLlv3:str"} # 可加入参数限制读取的行数，nrows =10000 d1 = pd.read_excel("D:/data/data.xlsx", encoding="gbk" 阅读全文

posted @ 2021-07-28 10:17 Shilo 阅读(999) 评论(0) 推荐(0)

python dataframe 删掉某几列

摘要： dat = dat.drop(['a','b','c','d','e','f'],axis=1) 阅读全文

posted @ 2021-07-28 10:16 Shilo 阅读(693) 评论(0) 推荐(0)

python DataFrame 去掉重复行

摘要： dat = DataFrame.drop_duplicates(dat,keep='first',inplace=False) 阅读全文

posted @ 2021-07-28 10:15 Shilo 阅读(275) 评论(0) 推荐(0)

python DataFrame 读取excel文件的前n行

摘要： def read_head_xls(file,nrow): ''' 读取nrow行excel数据,并计算耗时用于读取测试数据依赖于 from time import time from xlrd import open_workbook from pandas import DataFrame 阅读全文

posted @ 2021-07-28 10:13 Shilo 阅读(1607) 评论(0) 推荐(0)

python 计算程序运行时长

摘要：计算程序运行的时间，验证优化的效果。 ①依赖于time from time import time ②在程序开始前记录当前系统时间（后面接程序运行代码） t_start=time() ③在程序结束后记录当前系统时间（前面完成了程序的运行） t_end=time() ④计算时长打印时长删除相关阅读全文

posted @ 2021-07-28 10:07 Shilo 阅读(2164) 评论(0) 推荐(0)

python DataFrame数据情况检查函数（列名、类型、非空行数、缺失比例、取值个数）

摘要： def summary(dat): ''' 求一个df的列名、每列数据类型、每列非空行数、每列缺失比例、每列取值个数用于了解原始数据情况 *依赖于 singe_df() from pandas import concat ''' dat_head = singe_df(dat.columns,'c 阅读全文

posted @ 2021-07-28 09:50 Shilo 阅读(687) 评论(0) 推荐(0)

实用主义

能起作用的代码都是好代码