微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

使用 Pandas 获取堆叠瀑布图

如何解决使用 Pandas 获取堆叠瀑布图

我需要您的支持来创建瀑布图。已发布的脚本使用熊猫创建了一个瀑布(桥)图。为了在图表中显示更多细节,我将添加一个额外的字典 (data_stacked) 来创建一个堆叠的电子瀑布图。有人知道如何修改代码以获得堆叠瀑布图吗? 如前所述,我将为堆叠的字典添加一个额外的字典(data 和 data_stacked),请参阅代码第 13 行。 这是脚本:

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from matplotlib.ticker import FuncFormatter
def money(x,pos):
        'The two args are the value and tick position'
        return "${:,.0f}".format(x)
    formatter = FuncFormatter(money)
    
    #Data to plot. Do not include a total,it will be calculated
    index = ['sales','returns','credit fees','rebates','late charges','shipping']
    data = {'amount': [350000,-30000,-7500,-25000,95000,-7000]}
    # data_stacked = {'amount2': [350000,-7000]}
    #Store data and create a blank series to use for the waterfall
    trans = pd.DataFrame(data=data,index=index)
    blank = trans.amount.cumsum().shift(1).fillna(0)
    
    #Get the net total number for the final element in the waterfall
    total = trans.sum().amount
    trans.loc["net"]= total
    blank.loc["net"] = total
    
    #The steps graphically show the levels as well as used for label placement
    step = blank.reset_index(drop=True).repeat(3).shift(-1)
    step[1::3] = np.nan
    
    #When plotting the last element,we want to show the full bar,#Set the blank to 0
    blank.loc["net"] = 0
    
    #Plot and label
    my_plot = trans.plot(kind='bar',stacked=True,bottom=blank,legend=None,figsize=(10,5),title="2014 Sales Waterfall")
    my_plot.plot(step.index,step.values,'k')
    my_plot.set_xlabel("Transaction Types")
    
    #Format the axis for dollars
    my_plot.yaxis.set_major_formatter(formatter)
    
    #Get the y-axis position for the labels
    y_height = trans.amount.cumsum().shift(1).fillna(0)
    
    #Get an offset so labels don't sit right on top of the bar
    max = trans.max()
    neg_offset = max / 25
    pos_offset = max / 50
    plot_offset = int(max / 15)
    
    #Start label loop
    loop = 0
    for index,row in trans.iterrows():
        # For the last item in the list,we don't want to double count
        if row['amount'] == total:
            y = y_height[loop]
        else:
            y = y_height[loop] + row['amount']
        # Determine if we want a neg or pos offset
        if row['amount'] > 0:
            y += pos_offset
        else:
            y -= neg_offset
        my_plot.annotate("{:,.0f}".format(row['amount']),(loop,y),ha="center")
        loop+=1
    
    #Scale up the y axis so there is room for the labels
    my_plot.set_ylim(0,blank.max()+int(plot_offset))
    #Rotate the labels
    my_plot.set_xticklabels(trans.index,rotation=0)

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。