DataFrame GroupBy 之后的取值
movies_stats = data.groupby(‘movieTitle’,as_index=False).agg({‘rating’:[‘size’,‘mean’]})
atleast_size_100 = movies_stats[‘rating’][‘size’] >= 100
atleast_rating_4 = movies_stats[‘rating’][‘mean’] > 4
data_sort = movies_stats[atleast_size_100].sort_values([(‘rating’,‘mean’)], ascending=False)
data_sort2 = movies_stats[atleast_rating_4].sort_values([(‘rating’,‘size’)], ascending=False)
print(data_sort2.head(10))
print(type(data_sort2))
print(data_sort2[‘movieTitle’].head(10))
print(data_sort2[‘rating’][‘size’].head(10))
需要在groupby函数中加入as_index=False,默认as_index=True才可以保证原有的Column可以通过 df[‘column’]的方式获得,