如何解决如何使用所有模型的 f1 分数制作新数据框?
现在我想制作一个数据框,其中我只显示所有不同模型(随机森林分类器、逻辑回归等)的“负”的 f1 分数。我怎样才能做到这一点?谢谢:)
from sklearn.model_selection import train_test_split
Independent_var = reviews_english['tokenized']
Dependent_var = reviews_english['sentiment']
IV_train,IV_test,DV_train,DV_test = train_test_split(Independent_var,Dependent_var,test_size = 0.2,random_state = 500 )
print('IV_train :',len(IV_train))
print('IV_test :',len(IV_test))
print('DV_train :',len(DV_train))
print('DV_test :',len(DV_test))
# Train model with vectorizer and classifier
# Random Forest
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.ensemble import RandomForestClassifier
from sklearn.linear_model import LogisticRegression
from sklearn.svm import LinearSVC
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import classification_report
from sklearn.pipeline import Pipeline
from sklearn.metrics import confusion_matrix,classification_report
tvec = TfidfVectorizer()
RandomForestClassifier = RandomForestClassifier()
LogisticRegression = LogisticRegression(solver = "lbfgs")
LinearSVC = LinearSVC()
KNeighborsClassifier = KNeighborsClassifier(n_neighbors=5)
clf_list = [RandomForestClassifier,LogisticRegression,LinearSVC,KNeighborsClassifier]
for n,clf in enumerate(clf_list):
model = Pipeline([('vectorizer',tvec),('classifier',clf)])
# Model learning
model.fit(IV_train,DV_train)
# Model prediction on training and test data
pred_train = model.predict(IV_train)
pred_test = model.predict(IV_test)
# Performances
report_training = classification_report(DV_train,pred_train)
report_test = classification_report(DV_test,pred_test)
print("*************************" " Training report",clf,"*********************")
print(report_training)
print("*************************" " Test report","*********************")
print(report_test)
# Test confusion matrix
confusion_matrix(pred_test,DV_test) ```
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。