如何解决Statsmodel MN Logit 回归完成退出代码 -1066598273
我有一个大数据框 [870MB,shape=(938050,31)],我正在尝试运行带有 statsmodels 的多项 Logit 回归。 当我运行回归行时,直到我得到
进程完成,退出代码 -1066598273 (0xC06D007F)
我成功地在 10k 行上运行,但我想检查整个 df。 我该怎么做才能有效地运行此流程?
感谢您的帮助!
代码行:
import statsmodels.api as st
import pandas as pd
import os
writer = pd.ExcelWriter(path=os.path.join(export_path,f'regression.xlsx'),engine='xlsxwriter')
vars_matrix_df = pd.read_csv(data_path,skipinitialspace=True)
corr_cols = ['sales_vs_service','agent_experience','minutes_passed_since_shift_started','stage_in_conv','current_cust_wait_time','prev_cust_line_words','total_cust_words_in_conv','agent_total_turns','sentiment_score','max_sentiment','min_sentiment','last_sentiment','agent_response_time','customer_response_rate','is_last_cust_answered','conversation_opening','queue_length','total_lines_from_rep','agent_number_of_conversations','concurrency','rep_shift_start_time','first_cust_line_num_of_words','queue_wait_time','day_of_week','time_of_day']
reg_equation = st.formula.mnlogit(f'visitor_was_answered ~C(day_of_week)+C(time_of_day)+{"+".join(corr_cols)} ',vars_matrix_df).fit()
reg_summary_as_html = vars_summary.tables[0].as_html()
reg_summary_df = (pd.read_html(reg_summary_as_html,header=0,index_col=0)[0]).reset_index()
reg_summary_df.to_excel(writer,sheet_name='reg_summary_results',index=False)
writer.save()
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。