Public scan — anyone with this URL can view this analysis. Sign up to track your own repos privately, run scheduled re-scans, and get AI fix prompts via your dashboard.

scikit-learn/scikit-learn

https://github.com/scikit-learn/scikit-learn · scanned 2026-05-15 09:54 UTC (3 weeks ago) · 10 languages

548 findings (26 legacy + 522 scanner) 52nd percentile · Python · large (100-500K LoC)

UNIFIED Repobility · multi-layer engine · AI coders

Complete repo analysis

Last scanned 3 weeks ago · v1 · 20 findings from 1 source. Findings combine the legacy security pipeline AND the multi-layer engine (atlas, wiring, flows, ranked) AND verified AI agent contributions.

JSON
Score breakdown â 2026-05-14-v3
Component Sub-score Weight Contribution
structure_score 60.0 0.15 9.00
security_score 90.0 0.25 22.50
testing_score 95.0 0.20 19.00
documentation_score 65.0 0.15 9.75
practices_score 65.0 0.15 9.75
code_quality 50.0 0.10 5.00
Overall 1.00 75.0
Severity distribution — click a segment to filter
Active filters: excluding tests × Reset all

All 12288 nodes from the latest scan, grouped by kind. Each node is a unit the engine identified (file, function, endpoint, table…). Most users won't need this view — it's primarily for debugging the engine's graph extraction or for AI agents that want to enumerate the project structure.

LabelLayerStatusPath
measure_one software healthy benchmarks/bench_pca_solvers.py:47
compute_bench software healthy benchmarks/bench_plot_fastkmeans.py:10
compute_bench_2 software healthy benchmarks/bench_plot_fastkmeans.py:56
compute_bench software healthy benchmarks/bench_plot_hierarchical.py:10
compute_bench software healthy benchmarks/bench_plot_svd.py:17
unpickle software healthy benchmarks/bench_plot_randomized_svd.py:135
handle_missing_dataset software healthy benchmarks/bench_plot_randomized_svd.py:140
get_data software healthy benchmarks/bench_plot_randomized_svd.py:146
plot_time_vs_s software healthy benchmarks/bench_plot_randomized_svd.py:200
scatter_time_vs_s software healthy benchmarks/bench_plot_randomized_svd.py:224
plot_power_iter_vs_s software healthy benchmarks/bench_plot_randomized_svd.py:263
svd_timing software healthy benchmarks/bench_plot_randomized_svd.py:273
norm_diff software healthy benchmarks/bench_plot_randomized_svd.py:305
scalable_frobenius_norm_discrepancy software healthy benchmarks/bench_plot_randomized_svd.py:327
bench_a software healthy benchmarks/bench_plot_randomized_svd.py:346
bench_b software healthy benchmarks/bench_plot_randomized_svd.py:401
bench_c software healthy benchmarks/bench_plot_randomized_svd.py:447
load_data software healthy benchmarks/bench_tsne_mnist.py:36
nn_accuracy software healthy benchmarks/bench_tsne_mnist.py:52
tsne_fit_transform software healthy benchmarks/bench_tsne_mnist.py:60
sanitize software healthy benchmarks/bench_tsne_mnist.py:65
bhtsne software dead benchmarks/bench_tsne_mnist.py:147
run_vectorizer software healthy benchmarks/bench_text_vectorizers.py:29
f software healthy benchmarks/bench_text_vectorizers.py:30
load_data software healthy benchmarks/bench_mnist.py:57
load_data software healthy benchmarks/bench_covertype.py:75
print_outlier_ratio software healthy benchmarks/bench_online_ocsvm.py:40
autolabel_auc software healthy benchmarks/bench_online_ocsvm.py:232
autolabel_time software healthy benchmarks/bench_online_ocsvm.py:245
plot software healthy benchmarks/bench_plot_parallel_pairwise.py:12
euclidean_distances software healthy benchmarks/bench_plot_parallel_pairwise.py:38
rbf_kernels software dead benchmarks/bench_plot_parallel_pairwise.py:42
get_loss software healthy benchmarks/bench_rcv1_logreg_convergence.py:24
bench_one software healthy benchmarks/bench_rcv1_logreg_convergence.py:37
bench software healthy benchmarks/bench_rcv1_logreg_convergence.py:66
plot_train_losses software healthy benchmarks/bench_rcv1_logreg_convergence.py:102
plot_train_scores software healthy benchmarks/bench_rcv1_logreg_convergence.py:111
plot_test_scores software healthy benchmarks/bench_rcv1_logreg_convergence.py:121
plot_dloss software healthy benchmarks/bench_rcv1_logreg_convergence.py:131
get_max_squared_sum software healthy benchmarks/bench_rcv1_logreg_convergence.py:149
fit software healthy benchmarks/bench_hist_gradient_boosting_categorical_only.py…
predict software healthy benchmarks/bench_hist_gradient_boosting_categorical_only.py…
plot_results software healthy benchmarks/bench_plot_incremental_pca.py:21
benchmark software healthy benchmarks/bench_plot_incremental_pca.py:25
plot_feature_times software healthy benchmarks/bench_plot_incremental_pca.py:37
plot_feature_errors software healthy benchmarks/bench_plot_incremental_pca.py:52
plot_batch_times software healthy benchmarks/bench_plot_incremental_pca.py:66
plot_batch_errors software healthy benchmarks/bench_plot_incremental_pca.py:79
fixed_batch_size_comparison software healthy benchmarks/bench_plot_incremental_pca.py:92
variable_batch_size_comparison software healthy benchmarks/bench_plot_incremental_pca.py:115

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
CONTRIBUTING.md software healthy CONTRIBUTING.md
.pre-commit-config.yaml software healthy .pre-commit-config.yaml
pyproject.toml software healthy pyproject.toml
Makefile software healthy Makefile
CODE_OF_CONDUCT.md software healthy CODE_OF_CONDUCT.md
SECURITY.md software healthy SECURITY.md
AGENTS.md software healthy AGENTS.md
.codecov.yml software healthy .codecov.yml
devcontainer.json software healthy .devcontainer/devcontainer.json
setup.sh software healthy .devcontainer/setup.sh
config.yml software healthy .circleci/config.yml
bench_sgd_regression.py software warning benchmarks/bench_sgd_regression.py
bench_feature_expansions.py software warning benchmarks/bench_feature_expansions.py
bench_pca_solvers.py software healthy benchmarks/bench_pca_solvers.py
bench_plot_fastkmeans.py software healthy benchmarks/bench_plot_fastkmeans.py
bench_plot_hierarchical.py software healthy benchmarks/bench_plot_hierarchical.py
bench_plot_svd.py software healthy benchmarks/bench_plot_svd.py
bench_plot_randomized_svd.py software healthy benchmarks/bench_plot_randomized_svd.py
bench_tsne_mnist.py software healthy benchmarks/bench_tsne_mnist.py
plot_tsne_mnist.py software warning benchmarks/plot_tsne_mnist.py
bench_text_vectorizers.py software healthy benchmarks/bench_text_vectorizers.py
bench_glm.py software warning benchmarks/bench_glm.py
bench_mnist.py software healthy benchmarks/bench_mnist.py
bench_covertype.py software healthy benchmarks/bench_covertype.py
bench_online_ocsvm.py software healthy benchmarks/bench_online_ocsvm.py
bench_kernel_pca_solvers_time_vs_n_samples.py software warning benchmarks/bench_kernel_pca_solvers_time_vs_n_samples.py
bench_plot_parallel_pairwise.py software healthy benchmarks/bench_plot_parallel_pairwise.py
bench_rcv1_logreg_convergence.py software healthy benchmarks/bench_rcv1_logreg_convergence.py
bench_plot_ward.py software warning benchmarks/bench_plot_ward.py
bench_hist_gradient_boosting_categorical_only.py software healthy benchmarks/bench_hist_gradient_boosting_categorical_only.py
bench_plot_polynomial_kernel_approximation.py software warning benchmarks/bench_plot_polynomial_kernel_approximation.py
bench_plot_incremental_pca.py software healthy benchmarks/bench_plot_incremental_pca.py
bench_lof.py software warning benchmarks/bench_lof.py
bench_hist_gradient_boosting_higgsboson.py software healthy benchmarks/bench_hist_gradient_boosting_higgsboson.py
bench_saga.py software healthy benchmarks/bench_saga.py
bench_random_projections.py software healthy benchmarks/bench_random_projections.py
bench_plot_lasso_path.py software healthy benchmarks/bench_plot_lasso_path.py
bench_plot_nmf.py software healthy benchmarks/bench_plot_nmf.py
bench_hist_gradient_boosting.py software healthy benchmarks/bench_hist_gradient_boosting.py
bench_isotonic.py software healthy benchmarks/bench_isotonic.py
bench_tree.py software healthy benchmarks/bench_tree.py
bench_20newsgroups.py software warning benchmarks/bench_20newsgroups.py
bench_glmnet.py software healthy benchmarks/bench_glmnet.py
bench_sparsify.py software healthy benchmarks/bench_sparsify.py
bench_plot_omp_lars.py software healthy benchmarks/bench_plot_omp_lars.py
bench_hist_gradient_boosting_adult.py software healthy benchmarks/bench_hist_gradient_boosting_adult.py
bench_isolation_forest_predict.py software healthy benchmarks/bench_isolation_forest_predict.py
bench_plot_neighbors.py software healthy benchmarks/bench_plot_neighbors.py
bench_isolation_forest.py software healthy benchmarks/bench_isolation_forest.py
bench_lasso.py software healthy benchmarks/bench_lasso.py

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
_PGNMF software healthy benchmarks/bench_plot_nmf.py:191
LogisticRegressionBenchmark software healthy asv_benchmarks/benchmarks/linear_model.py:20
RidgeBenchmark software healthy asv_benchmarks/benchmarks/linear_model.py:66
LinearRegressionBenchmark software healthy asv_benchmarks/benchmarks/linear_model.py:110
SGDRegressorBenchmark software healthy asv_benchmarks/benchmarks/linear_model.py:142
ElasticNetBenchmark software healthy asv_benchmarks/benchmarks/linear_model.py:178
LassoBenchmark software healthy asv_benchmarks/benchmarks/linear_model.py:219
RandomForestClassifierBenchmark software healthy asv_benchmarks/benchmarks/ensemble.py:16
GradientBoostingClassifierBenchmark software healthy asv_benchmarks/benchmarks/ensemble.py:56
HistGradientBoostingClassifierBenchmark software healthy asv_benchmarks/benchmarks/ensemble.py:95
PCABenchmark software healthy asv_benchmarks/benchmarks/decomposition.py:8
DictionaryLearningBenchmark software healthy asv_benchmarks/benchmarks/decomposition.py:33
MiniBatchDictionaryLearningBenchmark software healthy asv_benchmarks/benchmarks/decomposition.py:67
PairwiseDistancesBenchmark software healthy asv_benchmarks/benchmarks/metrics.py:7
KNeighborsClassifierBenchmark software healthy asv_benchmarks/benchmarks/neighbors.py:8
TSNEBenchmark software healthy asv_benchmarks/benchmarks/manifold.py:7
SVCBenchmark software healthy asv_benchmarks/benchmarks/svm.py:8
KMeansBenchmark software healthy asv_benchmarks/benchmarks/cluster.py:8
MiniBatchKMeansBenchmark software healthy asv_benchmarks/benchmarks/cluster.py:57
Benchmark software healthy asv_benchmarks/benchmarks/common.py:81
Estimator software healthy asv_benchmarks/benchmarks/common.py:120
Predictor software healthy asv_benchmarks/benchmarks/common.py:201
Transformer software healthy asv_benchmarks/benchmarks/common.py:230
CrossValidationBenchmark software healthy asv_benchmarks/benchmarks/model_selection.py:9
GridSearchBenchmark software healthy asv_benchmarks/benchmarks/model_selection.py:41
KernelRidge software healthy sklearn/kernel_ridge.py:21
DiscriminantAnalysisPredictionMixin software healthy sklearn/discriminant_analysis.py:177
LinearDiscriminantAnalysis software healthy sklearn/discriminant_analysis.py:255
QuadraticDiscriminantAnalysis software healthy sklearn/discriminant_analysis.py:841
_strict_mul_mixin software healthy sklearn/conftest.py:284
_strict_coo_matrix software healthy sklearn/conftest.py:335
_strict_bsr_matrix software healthy sklearn/conftest.py:338
_strict_csr_matrix software healthy sklearn/conftest.py:341
_strict_csc_matrix software healthy sklearn/conftest.py:344
_strict_dok_matrix software healthy sklearn/conftest.py:347
_strict_lil_matrix software healthy sklearn/conftest.py:350
_strict_dia_matrix software healthy sklearn/conftest.py:353
_ConstantPredictor software healthy sklearn/multiclass.py:127
OneVsRestClassifier software healthy sklearn/multiclass.py:202
OneVsOneClassifier software healthy sklearn/multiclass.py:678
OutputCodeClassifier software healthy sklearn/multiclass.py:1043
UnsetMetadataPassedError software healthy sklearn/exceptions.py:21
NotFittedError software healthy sklearn/exceptions.py:46
ConvergenceWarning software healthy sklearn/exceptions.py:68
DataConversionWarning software healthy sklearn/exceptions.py:76
DataDimensionalityWarning software healthy sklearn/exceptions.py:94
EfficiencyWarning software healthy sklearn/exceptions.py:108
FitFailedWarning software healthy sklearn/exceptions.py:119
SkipTestWarning software healthy sklearn/exceptions.py:131
UndefinedMetricWarning software healthy sklearn/exceptions.py:140

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
.devcontainer software healthy .devcontainer
.circleci software healthy .circleci
benchmarks software healthy benchmarks
asv_benchmarks software healthy asv_benchmarks
benchmarks software healthy asv_benchmarks/benchmarks
build_tools software healthy build_tools
github software healthy build_tools/github
circle software healthy build_tools/circle
wheels software healthy build_tools/wheels
sklearn software healthy sklearn
compose software healthy sklearn/compose
tests software healthy sklearn/compose/tests
datasets software healthy sklearn/datasets
tests software healthy sklearn/datasets/tests
descr software healthy sklearn/datasets/descr
images software healthy sklearn/datasets/images
covariance software healthy sklearn/covariance
tests software healthy sklearn/covariance/tests
tests software healthy sklearn/tests
decomposition software healthy sklearn/decomposition
tests software healthy sklearn/decomposition/tests
_loss software healthy sklearn/_loss
tests software healthy sklearn/_loss/tests
externals software healthy sklearn/externals
_packaging software healthy sklearn/externals/_packaging
array_api_extra software healthy sklearn/externals/array_api_extra
_lib software healthy sklearn/externals/array_api_extra/_lib
_utils software healthy sklearn/externals/array_api_extra/_lib/_utils
array_api_compat software healthy sklearn/externals/array_api_compat
dask software healthy sklearn/externals/array_api_compat/dask
array software healthy sklearn/externals/array_api_compat/dask/array
numpy software healthy sklearn/externals/array_api_compat/numpy
common software healthy sklearn/externals/array_api_compat/common
cupy software healthy sklearn/externals/array_api_compat/cupy
torch software healthy sklearn/externals/array_api_compat/torch
_scipy software healthy sklearn/externals/_scipy
sparse software healthy sklearn/externals/_scipy/sparse
csgraph software healthy sklearn/externals/_scipy/sparse/csgraph
_numpydoc software healthy sklearn/externals/_numpydoc
mixture software healthy sklearn/mixture
tests software healthy sklearn/mixture/tests
ensemble software healthy sklearn/ensemble
tests software healthy sklearn/ensemble/tests
_hist_gradient_boosting software healthy sklearn/ensemble/_hist_gradient_boosting
tests software healthy sklearn/ensemble/_hist_gradient_boosting/tests
experimental software healthy sklearn/experimental
tests software healthy sklearn/experimental/tests
gaussian_process software healthy sklearn/gaussian_process
tests software healthy sklearn/gaussian_process/tests
utils software healthy sklearn/utils

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
label-remover cicd healthy .github/workflows/cuda-label-remover.yml
analyze cicd healthy .github/workflows/codeql.yml
labeler cicd healthy .github/workflows/labeler-title-regex.yml
update_tracking_issue cicd healthy .github/workflows/update_tracking_issue.yml
check cicd healthy .github/workflows/check-changelog.yml
check_build_trigger cicd healthy .github/workflows/emscripten.yml
build_wasm_wheel cicd healthy .github/workflows/emscripten.yml
upload-wheels cicd healthy .github/workflows/emscripten.yml
lint cicd healthy .github/workflows/lint.yml
post_comment cicd healthy .github/workflows/autoclose-comment.yml
update_lock_files cicd healthy .github/workflows/update-lock-files.yml
label-blank-issues cicd healthy .github/workflows/label-blank-issue.yml
codespell cicd healthy .github/workflows/codespell.yml
circleci_artifacts_redirector_job cicd healthy .github/workflows/artifact-redirector.yml
triage cicd healthy .github/workflows/labeler-module.yml
triage_file_extensions cicd healthy .github/workflows/labeler-module.yml
add_not_ready_for_PR_warning cicd healthy .github/workflows/not-ready-for-pr-warning.yml
remove_not_ready_for_PR_warning cicd healthy .github/workflows/not-ready-for-pr-warning.yml
build_wheel cicd healthy .github/workflows/cuda-ci.yml
tests cicd healthy .github/workflows/cuda-ci.yml
bot-comment cicd healthy .github/workflows/bot-lint-comment.yml
autoclose cicd healthy .github/workflows/autoclose-schedule.yml
check-sdist cicd healthy .github/workflows/check-sdist.yml
update-tracker cicd healthy .github/workflows/check-sdist.yml
lint cicd healthy .github/workflows/unit-tests.yml
retrieve-commit-message cicd healthy .github/workflows/unit-tests.yml
retrieve-selected-tests cicd healthy .github/workflows/unit-tests.yml
unit-tests cicd healthy .github/workflows/unit-tests.yml
free-threaded cicd healthy .github/workflows/unit-tests.yml
scipy-dev cicd healthy .github/workflows/unit-tests.yml
debian-32bit cicd healthy .github/workflows/unit-tests.yml
check-job-statuses cicd healthy .github/workflows/unit-tests.yml
publish cicd healthy .github/workflows/publish_pypi.yml
post_comment cicd healthy .github/workflows/needs-decision.yml
check_build_trigger cicd healthy .github/workflows/wheels.yml
build_wheels cicd healthy .github/workflows/wheels.yml
update-tracker cicd healthy .github/workflows/wheels.yml
build_sdist cicd healthy .github/workflows/wheels.yml
upload_anaconda cicd healthy .github/workflows/wheels.yml

LabelLayerStatusPath
gha::cuda-label-remover cicd healthy .github/workflows/cuda-label-remover.yml
gha::codeql cicd healthy .github/workflows/codeql.yml
gha::labeler-title-regex cicd healthy .github/workflows/labeler-title-regex.yml
gha::update_tracking_issue cicd healthy .github/workflows/update_tracking_issue.yml
gha::check-changelog cicd healthy .github/workflows/check-changelog.yml
gha::emscripten cicd healthy .github/workflows/emscripten.yml
gha::lint cicd healthy .github/workflows/lint.yml
gha::autoclose-comment cicd healthy .github/workflows/autoclose-comment.yml
gha::update-lock-files cicd healthy .github/workflows/update-lock-files.yml
gha::label-blank-issue cicd healthy .github/workflows/label-blank-issue.yml
gha::codespell cicd healthy .github/workflows/codespell.yml
gha::artifact-redirector cicd healthy .github/workflows/artifact-redirector.yml
gha::labeler-module cicd healthy .github/workflows/labeler-module.yml
gha::not-ready-for-pr-warning cicd healthy .github/workflows/not-ready-for-pr-warning.yml
gha::cuda-ci cicd healthy .github/workflows/cuda-ci.yml
gha::bot-lint-comment cicd healthy .github/workflows/bot-lint-comment.yml
gha::autoclose-schedule cicd healthy .github/workflows/autoclose-schedule.yml
gha::check-sdist cicd healthy .github/workflows/check-sdist.yml
gha::unit-tests cicd healthy .github/workflows/unit-tests.yml
gha::publish_pypi cicd healthy .github/workflows/publish_pypi.yml
gha::needs-decision cicd healthy .github/workflows/needs-decision.yml
gha::wheels cicd healthy .github/workflows/wheels.yml
circleci cicd healthy .circleci/config.yml

LabelLayerStatusPath
CCACHE_DIR cicd healthy
SCIKIT_LEARN_NIGHTLY_UPLOAD_TOKEN cicd healthy
BOT_GITHUB_TOKEN cicd healthy
GITHUB_TOKEN cicd healthy
JOB_NAME cicd healthy
COVERAGE cicd healthy
SCIKIT_LEARN_STAGING_UPLOAD_TOKEN cicd healthy
CIRCLECI_TOKEN cicd healthy
CHECK_PYTEST_SOFT_DEPENDENCY cicd healthy
CODECOV_TOKEN cicd healthy

LabelLayerStatusPath
port:02 network healthy .github/workflows/autoclose-schedule.yml
port:30 network healthy .github/workflows/unit-tests.yml

LabelLayerStatusPath
repobility-clone-_i6jk96m software healthy /tmp/repobility-clone-_i6jk96m

LabelLayerStatusPath
127.0.0.1 network healthy sklearn/utils/_repr_html/tests/test_js.py

LabelLayerStatusPath
gpu (detected) hardware healthy build_tools/update_environments_and_lock_files.py

LabelLayerStatusPath
auth::maint_tools/update_tracking_issue.py security healthy maint_tools/update_tracking_issue.py
For AI agents: Voting guide (TP/FP) MCP manifest Stdio wrapper SARIF Integrate Findings queue Vote TP/FP on findings to calibrate the engine.
For AI agents + API integrations
Email me when this repo regresses
Free. We re-scan periodically; new criticals → your inbox. No signup required for the scan itself.
API access

This page is publicly accessible at: https://repobility.com/scan/a5f73a3d-9c26-4983-8ec3-040adfc69698/

To check status programmatically (no auth required):

curl -s https://repobility.com/api/v1/public/scan/a5f73a3d-9c26-4983-8ec3-040adfc69698/

Important — please don't re-submit the same URL repeatedly. The submission endpoint is idempotent: re-submitting the same git URL returns this same scan_token, not a new one. To re-scan this repo, sign up free and use the dashboard.