Scan timing: clone 3.28s · analysis 9.69s · 9.2 MB · GitHub API rate-limit (preflight)
https://github.com/LiveBench/LiveBench
· scanned 2026-06-05 21:06 UTC (4 days, 11 hours ago)
· 10 languages
475 raw signals (295 security + 180 graph) 32nd percentile · Python · medium (20-100K LoC) System graph score 85 (lower by 43)
Last scanned 4 days, 11 hours ago · v2 · 232 actionable findings from 2 signal sources. 153 repeated signals grouped for readability. Security checks, system graph analysis, and verified AI-agent feedback are merged into one review queue.
| Component | Sub-score | Weight | Contribution |
|---|---|---|---|
structure_score |
60.0 | 0.15 | 9.00 |
security_score |
30.0 | 0.25 | 7.50 |
testing_score |
20.0 | 0.20 | 4.00 |
documentation_score |
81.0 | 0.15 | 12.15 |
practices_score |
40.0 | 0.15 | 6.00 |
code_quality |
27.3 | 0.10 | 2.73 |
| Overall | 1.00 | 41.4 |
Showing 197 of 232 actionable findings. 385 raw detector signals were grouped into reader-sized issues. Click TP / FP to vote on a finding's accuracy — votes adjust the confidence weighting and improve detection across the platform.
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/process_results/math/AMPS_Hard/utils.py:49, 98 (2 hits)livebench/code_runner/eval/__init__.py:240livebench/if_runner/instruction_following_eval/instructions.py:162livebench/process_results/math/olympiad/utils.py:63livebench/process_results/util.py:7livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/agentic_code_runner/minisweagent/agents/interactive.py:73livebench/agentic_code_runner/minisweagent/run/run_batch.py:209livebench/agentic_code_runner/minisweagent/run_inference.py:189livebench/agentic_code_runner/eval/harness/repos/javascript/axios/axios.py:59livebench/agentic_code_runner/eval/harness/repos/javascript/sveltejs/svelte.py:51livebench/agentic_code_runner/eval/harness/repos/typescript/ant_design/ant_design.py:52livebench/agentic_code_runner/minisweagent/environments/local.py:23
livebench/agentic_code_runner/minisweagent/environments/docker.py:106
livebench/if_runner/ifbench/evaluation_lib.py:45livebench/if_runner/instruction_following_eval/evaluation_main.py:191livebench/scripts/answer_csv_to_jsonl.py:11livebench/process_results/reasoning/logic_with_navigation/utils.py:28
livebench/agentic_code_runner/eval/harness/repos/c/mruby/mruby.py:423
livebench/agentic_code_runner/minisweagent/run/batch_progress.py:111, 140, 155, 174, 175, 176, 178, 181, +1 more (9 hits)livebench/agentic_code_runner/minisweagent/agents/interactive.py:47, 57, 58, 63, 75, 86, 87 (7 hits)livebench/code_runner/eval/utils.py:191, 192, 195, 196 (4 hits)livebench/agentic_code_runner/minisweagent/agents/replay.py:62, 78, 79 (3 hits)livebench/agentic_code_runner/minisweagent/environments/docker.py:110livebench/agentic_code_runner/minisweagent/run/run_batch.py:48livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/agentic_code_runner/eval/harness/instance.py:56livebench/agentic_code_runner/eval/harness/repos/c/OpenMathLib/OpenBLAS.py:229livebench/agentic_code_runner/eval/harness/repos/c/facebook/zstd.py:230livebench/agentic_code_runner/eval/harness/repos/c/fluent/fluentbit.py:282livebench/agentic_code_runner/eval/harness/repos/c/jqlang/jq.py:237livebench/agentic_code_runner/eval/harness/repos/c/libgit2/libgit2.py:402livebench/agentic_code_runner/eval/harness/repos/c/libsdlorg/SDL.py:229livebench/agentic_code_runner/eval/harness/repos/c/mruby/mruby.py:368livebench/agentic_code_runner/eval/harness/report.py:216
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/eval/__init__.py:158
Exec used
livebench/process_results/math/integrals_with_game/utils.py:122
livebench/process_results/data_analysis/tablereformat/utils.py:15
livebench/agentic_code_runner/minisweagent/environments/docker.py:106livebench/agentic_code_runner/minisweagent/environments/local.py:23livebench/code_runner/eval/utils.py:201livebench/lcb_runner/evaluation/compute_code_generation_metrics.py:29livebench/scripts/check_grading_flakiness.py:111livebench/scripts/edit_questions.py:138livebench/scripts/check_grading_flakiness.py:43
livebench/scripts/inspect_agentic_traj.py:141, 144, 181 (3 hits)livebench/code_runner/eval/__init__.py:182, 346 (2 hits)livebench/model/completions.py:231, 524 (2 hits)livebench/scripts/check_grading_flakiness.py:45, 56 (2 hits)livebench/scripts/edit_questions.py:144, 184 (2 hits)livebench/scripts/replay_agent_trajectory.py:79, 373 (2 hits)livebench/agentic_code_runner/minisweagent/run_inference.py:233livebench/code_runner/eval/utils.py:236livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/agentic_code_runner/eval/harness/report.py:303livebench/lcb_runner/evaluation/compute_code_generation_metrics.py:157livebench/lcb_runner/evaluation/pass_k_utils.py:26livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/agentic_code_runner/eval/harness/repos/javascript/Automattic/mongoose.py:98livebench/agentic_code_runner/eval/harness/repos/javascript/axios/axios.py:60livebench/agentic_code_runner/eval/harness/repos/javascript/sveltejs/svelte.py:52livebench/agentic_code_runner/eval/harness/repos/typescript/ant_design/ant_design.py:58livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/if_runner/instruction_following_eval/requirements.txt:1, 2, 3, 4 (4 hits)livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/agentic_code_runner/minisweagent/environments/docker.py:106
Subprocess shell true
livebench/agentic_code_runner/minisweagent/environments/extra/swerex_docker.py:33
Subprocess shell true
livebench/agentic_code_runner/minisweagent/environments/local.py:25
Subprocess shell true
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/code_runner/requirements_eval.txt
livebench/agentic_code_runner/eval/harness/repos/c/valkey_io/valkey.py:7, 18, 25, 98, 175 (5 hits)livebench/agentic_code_runner/eval/harness/repos/c/ponylang/ponyc.py:7, 18, 25, 342 (4 hits)livebench/agentic_code_runner/eval/harness/repos/c/redis/redis.py:1, 18, 175, 211 (4 hits)livebench/agentic_code_runner/eval/harness/repos/c/libgit2/libgit2.py:7, 18, 72 (3 hits)livebench/agentic_code_runner/eval/harness/repos/c/mruby/mruby.py:7, 125, 174 (3 hits)livebench/agentic_code_runner/eval/harness/repos/c/jqlang/jq.py:1, 18 (2 hits)livebench/agentic_code_runner/eval/harness/repos/c/libsdlorg/SDL.py:7, 87 (2 hits)livebench/agentic_code_runner/eval/harness/repos/c/php/phpsrc.py:7, 88 (2 hits)livebench/code_runner/requirements_eval.txt
livebench/lcb_runner/evaluation/compute_code_generation_metrics.py:29
Debug true
livebench/scripts/check_grading_flakiness.py:111
Debug true
livebench/scripts/edit_questions.py:138
Debug true
repo-level (12 hits)repo-level (3 hits)repo-level (2 hits)livebench/agentic_code_runner/eval/harness/run_evaluation.py:566
livebench/code_runner/eval/__init__.py:51
livebench/show_livebench_result.py:226
livebench/code_runner/eval/__init__.py:254
livebench/agentic_code_runner/eval/harness/instance.py:33
livebench/code_runner/eval/__init__.py:101
livebench/common.py:404
livebench/common.py:395
livebench/gen_ground_truth_judgment.py:507
livebench/agentic_code_runner/minisweagent/run/batch_progress.py:183
livebench/agentic_code_runner/minisweagent/run/run_batch.py:100
livebench/scripts/syntax_error_finder.py:119
livebench/code_runner/eval/utils.py:268
livebench/agentic_code_runner/eval/utils/fs_utils.py:36
livebench/scripts/rerun_failed_questions.py:77
livebench/agentic_code_runner/eval/harness/run_evaluation.py:725
livebench/scripts/check_question_variance.py:59
livebench/code_runner/eval/utils.py:204
livebench/code_runner/eval/utils.py:146
livebench/code_runner/eval/utils.py:198
livebench/code_runner/eval/utils.py:158
livebench/code_runner/eval/utils.py:164
livebench/code_runner/eval/utils.py:170
livebench/code_runner/eval/utils.py:152
livebench/code_runner/eval/__init__.py:351
livebench/code_runner/eval/__init__.py:341
livebench/code_runner/eval/__init__.py:112
livebench/agentic_code_runner/minisweagent/run/run_batch.py:75
This page is publicly accessible at:
https://repobility.com/scan/285d8c54-1310-4654-8c87-9d14ef632d84/
To check status programmatically (no auth required):
curl -s https://repobility.com/api/v1/public/scan/285d8c54-1310-4654-8c87-9d14ef632d84/
Important — please don't re-submit the same URL repeatedly. The submission endpoint is idempotent: re-submitting the same git URL returns this same scan_token, not a new one. To re-scan this repo, sign up free and use the dashboard.