如何解决MultiLabelBinarizer() 正在拟合列,但未转换其值
match_id innings batsmen bowlers
335982 1 [BB McCullum,DJ Hussey,Mohammad Hafeez,RT P... [AA Noffke,CL White,JH Kallis,P Kumar,SB J... 67.0 Kolkata Knight Riders Royal Challengers Bangalore M Chinnaswamy Stadium
2 [AA Noffke,B Akhil,MV B... [AB Agarkar,AB Dinda,I Sharma,LR Shukla,SC... 28.0 Royal Challengers Bangalore Kolkata Knight Riders M Chinnaswamy Stadium
335989 1 [JDP Oram,MEK Hussey,ML Hayden,MS Dhoni,PA... [A Nehra,AM Nayar,DJ Bravo,DS Kulkarni,Har... 60.0 Chennai Super Kings Mumbai Indians MA Chidambaram Stadium
2 [A Nehra,Harbhajan Singh,... [JDP Oram,Joginder Sharma,M Muralitharan,MS... 54.0 Mumbai Indians Chennai Super Kings MA Chidambaram Stadium
这是我的数据框“concat_data”
我将 MultiLabelBinarizer() 应用为
lb = MultiLabelBinarizer()
lb.fit_transform(concat_data['batsmen'])
它的输出是
array([[0,...,0],[0,1],0]])
它是正确的,但是当我检查 concat_data.head() 时它再次给出非编码数据
match_id innings batsmen bowlers
335982 1 [BB McCullum,MS... 54.0 Mumbai Indians Chennai Super Kings MA Chidambaram Stadium
谁能帮我看看为什么 MultiLabelBinarizer() 不能在“batsmen”列上工作
lb.classes_
它给出以下输出
array(['A Ashish Reddy','A Chandila','A Chopra','A Choudhary','A Dananjaya','A Flintoff','A Kumble','A Mishra','A Mithun','A Mukund','A Nehra','A Nel','A Nortje','A Singh','A Symonds','A Uniyal','AA Chavan','AA Jhunjhunwala','AA Noffke','AB Agarkar','AB Barath','AB Dinda','AB McDonald','AB de Villiers','AC Blizzard','AC Gilchrist','AC Voges','AD Hales','AD Mascarenhas',....
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。