0. DataFrame 的 index、columns、values
>> df = pd.DataFrame(np.arange(6).reshape(3, 2), index=['one', 'two', 'three'], columns=['a', 'b'])
>> df
a b
one 0 1
two 2 3
three 4 5
>> df.index
Index(['one', 'two', 'three'], dtype='object')
>> df.columns
Index(['a', 'b'], dtype='object')
>> df.values
array([[0, 1],
[2, 3],
[4, 5]])
1. 认识 pivot table
>> df = pd.DataFrame({'foo': ['one','one','one','two','two','two'],
'bar': ['A', 'B', 'C', 'A', 'B', 'C'],
'baz': [1, 2, 3, 4, 5, 6]})
# 通过字典的方式构造 DataFrame
>> df
bar baz foo
0 A 1 one
1 B 2 one
2 C 3 one
3 A 4 two
4 B 5 two
5 C 6 two
>> df.pivot(index='foo', columns='bar', values='baz')
bar A B C
foo
one 1 2 3
two 4 5 6
>> df.pivot(index='bar', columns='foo', values='baz')
foo one two
bar
A 1 4
B 2 5
C 3 6
可见 pivot 是对原始 dataframe 展示样式的重新组织和排列。