我使用GridSearchCV
找到RandomForestClassifier
Sklearn:讓使用GridSearchCV
這裏最優參數的所有核心是部分代碼:
clf = RandomForestClassifier(n_jobs=-1)
param_grid = {"max_depth": [3, None],
"max_features": [1, 3, 10],
"min_samples_split": [2, 3, 10],
"min_samples_leaf": [1, 3, 10],
"bootstrap": [True, False],
"criterion": ["gini", "entropy"]}
# run grid search
grid_search = GridSearchCV(clf, param_grid=param_grid, n_jobs=-1)
start = time.time()
grid_search.fit(X_train, y_train)
print("GridSearchCV took %.2f seconds for %d candidate parameter settings."
% (time.time() - start, len(grid_search.cv_results_['params'])))
我跑32核心服務器上的代碼,但使用htop
我看到只有大約8個內核正在使用,所以我的問題是如何啓用所有內核?
將'n_jobs'明確設置爲32? –
@cᴏʟᴅsᴘᴇᴇᴅ與'n_jobs = -1'相同的效果 – mrgloom