只是尝试做一个简单的最近邻居分类,但我对此错误感到困惑:
neigh = KNeighborsClassifier(n_neighbors=3)
neigh.fit(X_train, y_train)
产生:
ValueError: Found input variables with inconsistent numbers of samples: [489, 1890]
有谁解释我错过了什么?
该错误告诉您X_train
和y_train
样本的大小不一样。重新检查您的列车测试拆分并确保您正确执行它。例如,如果你使用sklearn.model_selectin.train_test_split
,你会这样做:
X_train, X_test, y_train, y_test = train_test_split(X, y)
neigh = KNeighborsClassifier(n_neighbors=3)
neigh.fit(X_train, y_train)
请检查train_test_split参数的顺序,它应该完全低于顺序
X_train, X_test, y_train, y_test=train_test_split(X,y)
错误的顺序产生错误“ValueError:找到具有不一致的样本数量的输入变量......”