pd.melt详解--列转行

方法详解：

pandas.melt(frame, id_vars=None, value_vars=None, var_name=None, value_name='value', col_level=None

“Unpivots” a DataFrame from wide format to long format, optionally leaving identifier variables set.

如何理解Unpivots其实就是列转行。

This function is useful to massage a DataFrame into a format where one or more columns are identifier variables (id_vars)
, while all other columns, considered measured variables (value_vars)
, are “unpivoted” to the row axis, leaving just two non-identifier columns, ‘variable’ and ‘value’.

frame : DataFrame

id_vars : tuple, list, or ndarray, optional

　　Column(s) to use as identifier variables.其实就是作为主键

value_vars : tuple, list, or ndarray, optional

　　Column(s) to unpivot. If not specified, uses all columns that are not set as id_vars.对那些字段进行转列且返回数据只有该字段的数据。

var_name : scalar

　　Name to use for the ‘variable’ column. If None it uses frame.columns.name or ‘variable’.对行专列的后，多个列名组成的列用什么名字？默认用variable

value_name : scalar, default ‘value’.

　　Name to use for the ‘value’ column.对行专列的后，多个列名下的值组成的列用什么名字？默认用value

col_level : int or string, optional

　　If columns are a MultiIndex then use this level to melt.

其实这个和我之前数仓博客hive数据仓库表设计之（矮宽表+高窄表）异曲同工：

官网案例：

import pandas as pd
df = pd.DataFrame({'A': {0: 'a', 1: 'b', 2: 'c'},
　　　　　　　　　　  'B': {0: 1, 1: 3, 2: 5},
                   'C': {0: 2, 1: 4, 2: 6}})
df

pd.melt(df, id_vars=['A'], value_vars=['B'])

pd.melt(df, id_vars=['A'], value_vars=['B','C'])

posted @ 2020-04-26 20:31 wqbin 阅读(5418) 评论(0) 收藏举报

刷新页面返回顶部

少年阿斌

人类被赋予了一种工作，那就是精神的成长。

pd.melt详解--列转行

公告