Public scan — anyone with this URL can view this analysis. Sign up to track your own repos privately, run scheduled re-scans, and get AI fix prompts via your dashboard.
119 of your 194 findings came from Repobility's proprietary detections. ✓ Repobility tags below mark them.

Scan timing: clone 16.12s · analysis 9.45s · 21.6 MB · GitHub API rate-limit (preflight)

opendatalab/MinerU

https://github.com/opendatalab/MinerU · scanned 2026-06-05 08:44 UTC (5 days, 20 hours ago) · 10 languages

442 raw signals (186 security + 256 graph) 41st percentile · Python · medium (20-100K LoC) System graph score 79 (lower by 27)

UNIFIED Repobility · multi-layer engine · AI coders

Complete repo analysis

Last scanned 5 days, 20 hours ago · v2 · 159 actionable findings from 2 signal sources. 155 repeated signals grouped for readability. Security checks, system graph analysis, and verified AI-agent feedback are merged into one review queue.

JSON
Score breakdown â 2026-05-18-v5
Component Sub-score Weight Contribution
structure_score 60.0 0.15 9.00
security_score 55.4 0.25 13.85
testing_score 25.0 0.20 5.00
documentation_score 69.0 0.15 10.35
practices_score 70.0 0.15 10.50
code_quality 30.9 0.10 3.09
Overall 1.00 51.8
Severity distribution — click a segment to filter
Active filters: excluding tests × Reset all

All 2918 nodes from the latest scan, grouped by kind. Each node is a unit the engine identified (file, function, endpoint, table…). Most users won't need this view — it's primarily for debugging the engine's graph extraction or for AI agents that want to enumerate the project structure.

LabelLayerStatusPath
get_version software healthy update_version.py:6
write_version_to_commons software healthy update_version.py:20
collect_input_files software healthy demo/demo.py:16
build_form_data software healthy demo/demo.py:44
format_status_message software healthy demo/demo.py:75
prepare_local_api_temp_dir software healthy demo/demo.py:81
run_demo software healthy demo/demo.py:93
on_status_update software dead demo/demo.py:170
main software healthy demo/demo.py:202
get_covrage software healthy tests/get_coverage.py:7
delete_file software healthy tests/clean_coverage.py:8
test_pipeline_with_two_config software healthy tests/unittest/test_e2e.py:23
run_pipeline_parse software healthy tests/unittest/test_e2e.py:74
on_doc_ready software healthy tests/unittest/test_e2e.py:88
write_infer_result software healthy tests/unittest/test_e2e.py:108
validate_html software healthy tests/unittest/test_e2e.py:144
assert_content software healthy tests/unittest/test_e2e.py:152
normalizeMineruLocale software healthy mineru/resources/gradio_app.js:normalizeMineruLocale
resolveMineruLocale software healthy mineru/resources/gradio_app.js:resolveMineruLocale
localizeMineruCustomText software healthy mineru/resources/gradio_app.js:localizeMineruCustomText
getOfficePreviewNoticeIgnored software healthy mineru/resources/gradio_app.js:getOfficePreviewNoticeIgnored
setOfficePreviewNoticeIgnored software healthy mineru/resources/gradio_app.js:setOfficePreviewNoticeIgnored
findOfficePreviewNotices software healthy mineru/resources/gradio_app.js:findOfficePreviewNotices
applyOfficePreviewNoticePreference software healthy mineru/resources/gradio_app.js:applyOfficePreviewNoticePref…
refreshMineruCustomHtml software healthy mineru/resources/gradio_app.js:refreshMineruCustomHtml
findButton software healthy mineru/resources/gradio_app.js:findButton
findPopover software healthy mineru/resources/gradio_app.js:findPopover
findUploadFileInput software healthy mineru/resources/gradio_app.js:findUploadFileInput
getUploadAcceptedTypes software healthy mineru/resources/gradio_app.js:getUploadAcceptedTypes
fileMatchesAcceptedType software healthy mineru/resources/gradio_app.js:fileMatchesAcceptedType
name software healthy mineru/resources/gradio_app.js:name
type software healthy mineru/resources/gradio_app.js:type
buildClipboardFileName software healthy mineru/resources/gradio_app.js:buildClipboardFileName
normalizeClipboardFile software healthy mineru/resources/gradio_app.js:normalizeClipboardFile
collectClipboardFiles software healthy mineru/resources/gradio_app.js:collectClipboardFiles
createUploadFileList software healthy mineru/resources/gradio_app.js:createUploadFileList
assignClipboardFileToUpload software healthy mineru/resources/gradio_app.js:assignClipboardFileToUpload
uploadClipboardFile software healthy mineru/resources/gradio_app.js:uploadClipboardFile
positionAdvancedDropdowns software healthy mineru/resources/gradio_app.js:positionAdvancedDropdowns
supportsHoverPopover software healthy mineru/resources/gradio_app.js:supportsHoverPopover
cancelPopoverTimers software healthy mineru/resources/gradio_app.js:cancelPopoverTimers
clearLegacyPopoverDisplay software healthy mineru/resources/gradio_app.js:clearLegacyPopoverDisplay
applyOpenPopoverStyle software healthy mineru/resources/gradio_app.js:applyOpenPopoverStyle
applyClosedPopoverStyle software healthy mineru/resources/gradio_app.js:applyClosedPopoverStyle
queueDropdownPosition software healthy mineru/resources/gradio_app.js:queueDropdownPosition
positionPopover software healthy mineru/resources/gradio_app.js:positionPopover
openPopover software healthy mineru/resources/gradio_app.js:openPopover
closePopover software healthy mineru/resources/gradio_app.js:closePopover
scheduleHoverOpen software healthy mineru/resources/gradio_app.js:scheduleHoverOpen
scheduleHoverClose software healthy mineru/resources/gradio_app.js:scheduleHoverClose

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
UploadAsset software healthy mineru/cli/api_client.py:439
ServerHealth software healthy mineru/cli/api_client.py:445
SubmitResponse software healthy mineru/cli/api_client.py:452
TaskStatusSnapshot software healthy mineru/cli/api_client.py:461
LocalAPIServer software healthy mineru/cli/api_client.py:466
ReusableLocalAPIServer software healthy mineru/cli/api_client.py:598
InputDocument software healthy mineru/cli/client.py:53
PlannedTask software healthy mineru/cli/client.py:62
TaskExecutionProgress software healthy mineru/cli/client.py:69
VisualizationContext software healthy mineru/cli/client.py:78
TaskFailure software healthy mineru/cli/client.py:89
LiveTaskStatusState software healthy mineru/cli/client.py:96
LiveAwareStderrSink software healthy mineru/cli/client.py:104
LiveTaskStatusRenderer software healthy mineru/cli/client.py:134
RouterSettings software healthy mineru/cli/router.py:304
StagedUpload software healthy mineru/cli/router.py:331
MultipartPayload software healthy mineru/cli/router.py:339
RouterTaskRecord software healthy mineru/cli/router.py:355
ManagedLocalServer software healthy mineru/cli/router.py:389
WorkerState software healthy mineru/cli/router.py:503
WorkerPool software healthy mineru/cli/router.py:544
RouterTaskRegistry software healthy mineru/cli/router.py:832
UpstreamSubmissionUnavailable software healthy mineru/cli/router.py:1033
UpstreamSubmissionRejected software healthy mineru/cli/router.py:1037
GradioConcurrencyWaitSnapshot software healthy mineru/cli/gradio_app.py:50
_LimiterState software healthy mineru/cli/gradio_app.py:58
GradioRequestConcurrencyLimiter software healthy mineru/cli/gradio_app.py:64
StatusPanelState software healthy mineru/cli/gradio_app.py:440
HybridDependencyError software healthy mineru/cli/common.py:51
StoredUpload software healthy mineru/cli/fast_api.py:134
AsyncParseTask software healthy mineru/cli/fast_api.py:141
TaskWaitAbortedError software healthy mineru/cli/fast_api.py:197
AsyncTaskManager software healthy mineru/cli/fast_api.py:922
VisualizationJob software healthy mineru/cli/visualization.py:14
VisualizationResult software healthy mineru/cli/visualization.py:23
ParseRequestOptions software healthy mineru/cli/api_request.py:18
OcrConfidence software healthy mineru/utils/ocr_utils.py:9
OfficeRichTextSegment software healthy mineru/utils/office_rich_text.py:10
RowMetrics software healthy mineru/utils/table_merge.py:18
RowSignature software healthy mineru/utils/table_merge.py:26
RenderedCellSegment software healthy mineru/utils/table_merge.py:39
RowScanResult software healthy mineru/utils/table_merge.py:46
TableMergeState software healthy mineru/utils/table_merge.py:55
BlockType software healthy mineru/utils/enum_class.py:4
ContentType software healthy mineru/utils/enum_class.py:51
ContentTypeV2 software healthy mineru/utils/enum_class.py:62
MakeMode software healthy mineru/utils/enum_class.py:89
ModelPath software healthy mineru/utils/enum_class.py:96
SplitFlag software healthy mineru/utils/enum_class.py:110
ImageType software healthy mineru/utils/enum_class.py:115

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
mineru.template.json software healthy mineru.template.json
update_version.py software healthy update_version.py
mkdocs.yml software healthy mkdocs.yml
README.md software healthy README.md
LICENSE.md software healthy LICENSE.md
README_zh-CN.md software healthy README_zh-CN.md
pyproject.toml software healthy pyproject.toml
MinerU_CLA.md software healthy MinerU_CLA.md
SECURITY.md software healthy SECURITY.md
compose.yaml software healthy docker/compose.yaml
mlu.Dockerfile software healthy docker/china/mlu.Dockerfile
ppu.Dockerfile software healthy docker/china/ppu.Dockerfile
corex.Dockerfile software healthy docker/china/corex.Dockerfile
gcu.Dockerfile software healthy docker/china/gcu.Dockerfile
npu.Dockerfile software healthy docker/china/npu.Dockerfile
kxpu.Dockerfile software healthy docker/china/kxpu.Dockerfile
Dockerfile software healthy docker/china/Dockerfile
dcu.Dockerfile software healthy docker/china/dcu.Dockerfile
musa.Dockerfile software healthy docker/china/musa.Dockerfile
maca.Dockerfile software healthy docker/china/maca.Dockerfile
Dockerfile software healthy docker/global/Dockerfile
demo.py software healthy demo/demo.py
get_coverage.py software healthy tests/get_coverage.py
clean_coverage.py software healthy tests/clean_coverage.py
test_e2e.py software healthy tests/unittest/test_e2e.py
README.md software healthy projects/README.md
README_zh-CN.md software healthy projects/README_zh-CN.md
requirements.txt software healthy docs/requirements.txt
index.md software healthy docs/zh/index.md
index.md software healthy docs/zh/demo/index.md
output_files.md software healthy docs/zh/reference/output_files.md
index.md software healthy docs/zh/reference/index.md
changelog.md software healthy docs/zh/reference/changelog.md
model_source.md software healthy docs/zh/usage/model_source.md
index.md software healthy docs/zh/usage/index.md
quick_usage.md software healthy docs/zh/usage/quick_usage.md
advanced_cli_parameters.md software healthy docs/zh/usage/advanced_cli_parameters.md
cli_tools.md software healthy docs/zh/usage/cli_tools.md
FastGPT.md software healthy docs/zh/usage/plugin/FastGPT.md
Dify.md software healthy docs/zh/usage/plugin/Dify.md
ModelWhale.md software healthy docs/zh/usage/plugin/ModelWhale.md
DingTalk.md software healthy docs/zh/usage/plugin/DingTalk.md
RagFlow.md software healthy docs/zh/usage/plugin/RagFlow.md
Coze.md software healthy docs/zh/usage/plugin/Coze.md
n8n.md software healthy docs/zh/usage/plugin/n8n.md
Sider.md software healthy docs/zh/usage/plugin/Sider.md
Cherry_Studio.md software healthy docs/zh/usage/plugin/Cherry_Studio.md
DataFlow.md software healthy docs/zh/usage/plugin/DataFlow.md
BISHENG.md software healthy docs/zh/usage/plugin/BISHENG.md
IluvatarCorex.md software healthy docs/zh/usage/acceleration_cards/IluvatarCorex.md

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
docker software healthy docker
china software healthy docker/china
global software healthy docker/global
demo software healthy demo
tests software healthy tests
unittest software healthy tests/unittest
projects software healthy projects
docs software healthy docs
zh software healthy docs/zh
demo software healthy docs/zh/demo
reference software healthy docs/zh/reference
usage software healthy docs/zh/usage
plugin software healthy docs/zh/usage/plugin
acceleration_cards software healthy docs/zh/usage/acceleration_cards
quick_start software healthy docs/zh/quick_start
faq software healthy docs/zh/faq
en software healthy docs/en
demo software healthy docs/en/demo
reference software healthy docs/en/reference
usage software healthy docs/en/usage
quick_start software healthy docs/en/quick_start
faq software healthy docs/en/faq
mineru software healthy mineru
resources software healthy mineru/resources
cli software healthy mineru/cli
utils software healthy mineru/utils
model software healthy mineru/model
layout software healthy mineru/model/layout
mfr software healthy mineru/model/mfr
pp_formulanet_plus_m software healthy mineru/model/mfr/pp_formulanet_plus_m
unimernet software healthy mineru/model/mfr/unimernet
unimernet_hf software healthy mineru/model/mfr/unimernet/unimernet_hf
unimer_mbart software healthy mineru/model/mfr/unimernet/unimernet_hf/unimer_mbart
unimer_swin software healthy mineru/model/mfr/unimernet/unimernet_hf/unimer_swin
vlm software healthy mineru/model/vlm
table software healthy mineru/model/table
cls software healthy mineru/model/table/cls
rec software healthy mineru/model/table/rec
slanet_plus software healthy mineru/model/table/rec/slanet_plus
unet_table software healthy mineru/model/table/rec/unet_table
utils software healthy mineru/model/utils
pytorchocr software healthy mineru/model/utils/pytorchocr
utils software healthy mineru/model/utils/pytorchocr/utils
resources software healthy mineru/model/utils/pytorchocr/utils/resources
postprocess software healthy mineru/model/utils/pytorchocr/postprocess
modeling software healthy mineru/model/utils/pytorchocr/modeling
architectures software healthy mineru/model/utils/pytorchocr/modeling/architectures
necks software healthy mineru/model/utils/pytorchocr/modeling/necks
heads software healthy mineru/model/utils/pytorchocr/modeling/heads
backbones software healthy mineru/model/utils/pytorchocr/modeling/backbones

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
on-failure cicd healthy .github/workflows/rerun.yml
build cicd healthy .github/workflows/mkdocs.yml
CLAAssistant cicd healthy .github/workflows/cla.yml
update-version cicd healthy .github/workflows/python-package.yml
check-install cicd healthy .github/workflows/python-package.yml
build cicd healthy .github/workflows/python-package.yml
release cicd healthy .github/workflows/python-package.yml
cli-test cicd healthy .github/workflows/cli.yml

LabelLayerStatusPath
image::docker/china/Dockerfile hardware healthy docker/china/Dockerfile
image::docker/global/Dockerfile hardware healthy docker/global/Dockerfile
mineru-openai-server hardware healthy docker/compose.yaml
mineru-api hardware healthy docker/compose.yaml
mineru-router hardware healthy docker/compose.yaml
mineru-gradio hardware healthy docker/compose.yaml

LabelLayerStatusPath
gha::rerun cicd healthy .github/workflows/rerun.yml
gha::mkdocs cicd healthy .github/workflows/mkdocs.yml
gha::cla cicd healthy .github/workflows/cla.yml
gha::python-package cicd healthy .github/workflows/python-package.yml
gha::cli cicd healthy .github/workflows/cli.yml

LabelLayerStatusPath
port:30000 network healthy docker/compose.yaml
port:8000 network healthy docker/compose.yaml
port:8002 network healthy docker/compose.yaml
port:7860 network healthy docker/compose.yaml

LabelLayerStatusPath
PYPI_TOKEN cicd healthy
GITHUB_TOKEN cicd healthy
RELEASE_TOKEN cicd healthy
FEISHU_WEBHOOK_URL cicd healthy

LabelLayerStatusPath
4.11.0.86 network healthy pyproject.toml
0.10.1.1 network healthy pyproject.toml
127.0.0.1 network healthy demo/demo.py

LabelLayerStatusPath
repobility-clone-xde099rz software healthy /tmp/repobility-clone-xde099rz

LabelLayerStatusPath
gpu (detected) hardware healthy docker/compose.yaml
For AI agents: Voting guide (TP/FP) MCP manifest Stdio wrapper SARIF Integrate Findings queue Vote TP/FP on findings to calibrate the engine.
For AI agents + API integrations
Email me when this repo regresses
Free. We re-scan periodically; new criticals → your inbox. No signup required for the scan itself.
API access

This page is publicly accessible at: https://repobility.com/scan/307eb78d-dd1b-4d20-a65e-ea4268f41aeb/

To check status programmatically (no auth required):

curl -s https://repobility.com/api/v1/public/scan/307eb78d-dd1b-4d20-a65e-ea4268f41aeb/

Important — please don't re-submit the same URL repeatedly. The submission endpoint is idempotent: re-submitting the same git URL returns this same scan_token, not a new one. To re-scan this repo, sign up free and use the dashboard.