微信公众号搜"智元新知"关注
微信扫一扫可直接关注哦!

Python- 不在索引错误中one-hot 编码

如何解决Python- 不在索引错误中one-hot 编码

我正在处理 Titanic 数据集,并将 one-hot 应用于所有列。所以一切都是 0 和 1。

但是,我在火车运行中遇到以下错误。我尝试了很多东西,但找不到解决方案。我尽量解释错误,希望你能轻松理解问题。感谢您抽出宝贵时间。

Traceback (most recent call last):
  File "titanic_classifier.py",line 277,in <module>
    fire.Fire()
  File "C:\Users\PC\Anaconda3\lib\site-packages\fire\core.py",line 138,in Fire
    component_trace = _Fire(component,args,parsed_flag_args,context,name)
  File "C:\Users\PC\Anaconda3\lib\site-packages\fire\core.py",line 468,in _Fire
    target=component.__name__)
  File "C:\Users\PC\Anaconda3\lib\site-packages\fire\core.py",line 672,in _CallAndUpdateTrace
    component = fn(*varargs,**kwargs)
  File "titanic_classifier.py",line 220,in train
    n_sample          =  nsample if nsample is not None else m['n_sample'])
  File "C:\Users\PC\Desktop\x\source\run_train.py",line 248,in run_train
    dfXy,dfXytest   = train(model_dict,dfXy,cols,post_process_fun)
  File "C:\Users\PC\Desktop\x\source\run_train.py",line 125,in train
    data_pars['train'] = {'Xtrain' : dfX[colsX].iloc[:itrain,:],File "C:\Users\PC\Anaconda3\lib\site-packages\pandas\core\frame.py",line 2908,in __getitem__
    indexer = self.loc._get_listlike_indexer(key,axis=1,raise_missing=True)[1]
  File "C:\Users\PC\Anaconda3\lib\site-packages\pandas\core\indexing.py",line 1254,in _get_listlike_indexer
    self._validate_read_indexer(keyarr,indexer,axis,raise_missing=raise_missing)
  File "C:\Users\PC\Anaconda3\lib\site-packages\pandas\core\indexing.py",line 1304,in _validate_read_indexer
    raise KeyError(f"{not_found} not in index")
KeyError: "['space_svd_0','interaction_svd_1','transit_svd_0','summary_svd_1','last_scraped_month','neighborhood_overview_svd_0','host_name_svd_0','house_rules_svd_1','house_rules_svd_0','first_review_month','transit_svd_1','last_review_year','interaction_svd_0','first_review_day','last_scraped_day','summary_svd_0','access_svd_1','first_review_year','notes_svd_0','name_svd_1','space_svd_1','neighborhood_overview_svd_1','host_since_year','amenities_svd_1','amenities_svd_0','host_about_svd_0','host_name_svd_1','notes_svd_1','name_svd_0','last_review_day','last_scraped_year','description_svd_0','host_since_day','last_review_month','access_svd_0','host_about_svd_1','host_since_month','description_svd_1'] not in index"

错误来自run_train.py的代码部分:

data_pars['train'] = {'Xtrain' : dfX[colsX].iloc[:itrain,'ytrain' : dfX[coly].iloc[:itrain],'Xtest'  : dfX[colsX].iloc[itrain:ival,'ytest'  : dfX[coly].iloc[itrain:ival],'Xval'   : dfX[colsX].iloc[ival:,'yval'   : dfX[coly].iloc[ival:]
                          }

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。