Public scan — anyone with this URL can view this analysis. Sign up to track your own repos privately, run scheduled re-scans, and get AI fix prompts via your dashboard.

cocoindex-io/cocoindex

https://github.com/cocoindex-io/cocoindex · scanned 2026-05-15 03:47 UTC (3 weeks ago) · 10 languages

204 findings (93 legacy + 111 scanner) 84th percentile · Python · medium (20-100K LoC) Scanner says 87 (lower by 6)

UNIFIED Repobility · multi-layer engine · AI coders

Complete repo analysis

Last scanned 3 weeks ago · v1 · 88 findings from 1 source. Findings combine the legacy security pipeline AND the multi-layer engine (atlas, wiring, flows, ranked) AND verified AI agent contributions.

JSON
Score breakdown â 2026-05-18-v5 calibration-aware
Component Sub-score Weight Contribution
structure_score 60.0 0.15 9.00
security_score 86.1 0.25 21.52
testing_score 95.0 0.20 19.00
documentation_score 86.6 0.15 12.99
practices_score 80.0 0.15 12.00
code_quality 63.8 0.10 6.38
Overall 1.00 80.9
Calibrated penalty buckets (security_score): agent: 1.1 · authz: 2.1 · threat: 10.8
Severity distribution — click a segment to filter
Active filters: excluding tests × Reset all

All 3820 nodes from the latest scan, grouped by kind. Each node is a unit the engine identified (file, function, endpoint, table…). Most users won't need this view — it's primarily for debugging the engine's graph extraction or for AI agents that want to enumerate the project structure.

LabelLayerStatusPath
parse_args software healthy benchmarks/file_summarization/python/runner.py:30
build_rust_binary software healthy benchmarks/file_summarization/python/runner.py:67
run_language software healthy benchmarks/file_summarization/python/runner.py:75
run_trial software healthy benchmarks/file_summarization/python/runner.py:133
aggregate_results software healthy benchmarks/file_summarization/python/runner.py:199
print_table software healthy benchmarks/file_summarization/python/runner.py:249
main software healthy benchmarks/file_summarization/python/runner.py:288
__init__ software healthy benchmarks/file_summarization/python/common.py:249
update software healthy benchmarks/file_summarization/python/common.py:252
hexdigest software healthy benchmarks/file_summarization/python/common.py:257
workload_profile software healthy benchmarks/file_summarization/python/common.py:261
scaled_count software healthy benchmarks/file_summarization/python/common.py:268
fnv1a64_bytes software healthy benchmarks/file_summarization/python/common.py:272
fnv1a64_text software healthy benchmarks/file_summarization/python/common.py:278
fnv1a64_hex software dead benchmarks/file_summarization/python/common.py:282
canonical_json_bytes software healthy benchmarks/file_summarization/python/common.py:286
tree_digest software healthy benchmarks/file_summarization/python/common.py:295
reset_dir software healthy benchmarks/file_summarization/python/common.py:308
write_text software healthy benchmarks/file_summarization/python/common.py:314
build_phrase software healthy benchmarks/file_summarization/python/common.py:319
build_line_block software healthy benchmarks/file_summarization/python/common.py:333
build_comment_block software healthy benchmarks/file_summarization/python/common.py:348
slugify_heading software healthy benchmarks/file_summarization/python/common.py:365
pick_language_from_suffix software healthy benchmarks/file_summarization/python/common.py:379
tokenize_ascii_words software healthy benchmarks/file_summarization/python/common.py:389
top_tokens_from_counts software healthy benchmarks/file_summarization/python/common.py:408
split_into_sections software healthy benchmarks/file_summarization/python/common.py:413
flush software healthy benchmarks/file_summarization/python/common.py:420
is_boundary software healthy benchmarks/file_summarization/python/common.py:426
analyze_section software healthy benchmarks/file_summarization/python/common.py:480
_summarize_file software healthy benchmarks/file_summarization/python/common.py:526
summarize_collection software healthy benchmarks/file_summarization/python/common.py:549
collection_to_jsonable software healthy benchmarks/file_summarization/python/common.py:584
file_report_to_jsonable software healthy benchmarks/file_summarization/python/common.py:588
sync_output_tree software healthy benchmarks/file_summarization/python/common.py:607
generate_dataset software healthy benchmarks/file_summarization/python/common.py:708
apply_edit_mutation software healthy benchmarks/file_summarization/python/common.py:721
apply_shape_mutation software healthy benchmarks/file_summarization/python/common.py:782
_mutable_files software healthy benchmarks/file_summarization/python/common.py:811
_add_shape_file software healthy benchmarks/file_summarization/python/common.py:819
_generate_codebase_dataset software healthy benchmarks/file_summarization/python/common.py:853
_generate_docs_dataset software healthy benchmarks/file_summarization/python/common.py:934
_render_rust_file software healthy benchmarks/file_summarization/python/common.py:961
_render_python_file software healthy benchmarks/file_summarization/python/common.py:999
_render_markdown_page software healthy benchmarks/file_summarization/python/common.py:1035
_render_toml_file software healthy benchmarks/file_summarization/python/common.py:1076
run_state software healthy benchmarks/file_summarization/python/benchmark.py:51
collection_kind software healthy benchmarks/file_summarization/python/benchmark.py:57
file_patterns software healthy benchmarks/file_summarization/python/benchmark.py:61
extract_sections software healthy benchmarks/file_summarization/python/benchmark.py:68

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
WorkloadProfile software healthy benchmarks/file_summarization/python/common.py:139
SectionInput software healthy benchmarks/file_summarization/python/common.py:205
SectionAnalysis software healthy benchmarks/file_summarization/python/common.py:214
FileSummary software healthy benchmarks/file_summarization/python/common.py:227
CollectionSummary software healthy benchmarks/file_summarization/python/common.py:237
Fnv1a64 software healthy benchmarks/file_summarization/python/common.py:248
BenchMetrics software healthy benchmarks/file_summarization/python/benchmark.py:26
RunState software healthy benchmarks/file_summarization/python/benchmark.py:40
DocEmbedding software healthy examples/amazon_s3_embedding/main.py:72
FunctionInfo software healthy examples/multi_codebase_summarization/models.py:6
ClassInfo software healthy examples/multi_codebase_summarization/models.py:19
CodebaseInfo software healthy examples/multi_codebase_summarization/models.py:45
DocEmbedding software healthy examples/gdrive_text_embedding/main.py:60
TopicsResponse software healthy examples/hn_trending_topics/models.py:6
Comment software healthy examples/hn_trending_topics/main.py:51
Thread software healthy examples/hn_trending_topics/main.py:59
HnMessage software healthy examples/hn_trending_topics/main.py:74
HnTopic software healthy examples/hn_trending_topics/main.py:87
TableTargets software healthy examples/hn_trending_topics/main.py:200
DocEmbedding software healthy examples/text_embedding_lancedb/main.py:47
AuthorModel software healthy examples/paper_metadata/models.py:6
PaperMetadataModel software healthy examples/paper_metadata/models.py:14
PaperBasicInfo software healthy examples/paper_metadata/main.py:75
PaperMetadataRow software healthy examples/paper_metadata/main.py:81
AuthorPaperRow software healthy examples/paper_metadata/main.py:90
MetadataEmbeddingRow software healthy examples/paper_metadata/main.py:96
CodeEmbedding software healthy examples/code_embedding_lancedb/main.py:47
Meeting software healthy examples/meeting_notes_graph_neo4j/main.py:102
Person software healthy examples/meeting_notes_graph_neo4j/main.py:110
Task software healthy examples/meeting_notes_graph_neo4j/main.py:115
AttendedRel software healthy examples/meeting_notes_graph_neo4j/main.py:120
ExtractedPerson software healthy examples/meeting_notes_graph_neo4j/main.py:139
ExtractedTask software healthy examples/meeting_notes_graph_neo4j/main.py:145
ExtractedMeeting software healthy examples/meeting_notes_graph_neo4j/main.py:155
MeetingExtraction software healthy examples/meeting_notes_graph_neo4j/main.py:232
DocEmbedding software healthy examples/text_embedding/main.py:63
SourceProduct software healthy examples/postgres_source/main.py:47
OutputProduct software healthy examples/postgres_source/main.py:56
Meeting software healthy examples/meeting_notes_graph_falkordb/main.py:93
Person software healthy examples/meeting_notes_graph_falkordb/main.py:101
Task software healthy examples/meeting_notes_graph_falkordb/main.py:106
AttendedRel software healthy examples/meeting_notes_graph_falkordb/main.py:111
ExtractedPerson software healthy examples/meeting_notes_graph_falkordb/main.py:130
ExtractedTask software healthy examples/meeting_notes_graph_falkordb/main.py:136
ExtractedMeeting software healthy examples/meeting_notes_graph_falkordb/main.py:146
MeetingExtraction software healthy examples/meeting_notes_graph_falkordb/main.py:223
DocEmbedding software healthy examples/oci_object_storage_embedding/main.py:126
TranscriptChunk software healthy examples/entire_session_search/models.py:7
SessionInfo software healthy examples/entire_session_search/models.py:15
ChunkInput software healthy examples/entire_session_search/models.py:23

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
Cargo.toml software healthy Cargo.toml
README.md software healthy README.md
CLAUDE.md software healthy CLAUDE.md
CONTRIBUTING.md software healthy CONTRIBUTING.md
.pre-commit-config.yaml software healthy .pre-commit-config.yaml
pyproject.toml software healthy pyproject.toml
CODE_OF_CONDUCT.md software healthy CODE_OF_CONDUCT.md
opencode.json software healthy opencode.json
about.toml software healthy about.toml
ruff.toml software healthy ruff.toml
README.md software healthy benchmarks/file_summarization/README.md
run.sh software healthy benchmarks/file_summarization/run.sh
BENCHMARK_REPORT.md software healthy benchmarks/file_summarization/BENCHMARK_REPORT.md
Cargo.toml software healthy benchmarks/file_summarization/rust/Cargo.toml
main.rs software healthy benchmarks/file_summarization/rust/src/main.rs
pyproject.toml software healthy benchmarks/file_summarization/python/pyproject.toml
runner.py software healthy benchmarks/file_summarization/python/runner.py
common.py software healthy benchmarks/file_summarization/python/common.py
benchmark.py software healthy benchmarks/file_summarization/python/benchmark.py
package.json software healthy docs/package.json
package-lock.json software healthy docs/package-lock.json
tsconfig.json software healthy docs/tsconfig.json
content.config.ts software warning docs/src/content.config.ts
consts.ts software healthy docs/src/consts.ts
README.md software healthy docs/src/components/diagrams/README.md
diagrams.css software healthy docs/src/components/diagrams/diagrams.css
multi-codebase-summarization.md software healthy docs/src/content/example-posts/multi-codebase-summarization…
pdf-to-markdown.md software healthy docs/src/content/example-posts/pdf-to-markdown.md
globals.css software healthy docs/src/styles/globals.css
Cargo.toml software healthy rust/ops_text/Cargo.toml
lib.rs software healthy rust/ops_text/src/lib.rs
pattern_matcher.rs software healthy rust/ops_text/src/pattern_matcher.rs
output_positions.rs software healthy rust/ops_text/src/output_positions.rs
prog_langs.rs software healthy rust/ops_text/src/prog_langs.rs
by_separators.rs software healthy rust/ops_text/src/split/by_separators.rs
mod.rs software healthy rust/ops_text/src/split/mod.rs
recursive.rs software healthy rust/ops_text/src/split/recursive.rs
Cargo.toml software healthy rust/utils/Cargo.toml
error.rs software healthy rust/utils/src/error.rs
lib.rs software healthy rust/utils/src/lib.rs
yaml_ser.rs software healthy rust/utils/src/yaml_ser.rs
bytes_decode.rs software healthy rust/utils/src/bytes_decode.rs
http.rs software healthy rust/utils/src/http.rs
concur_control.rs software healthy rust/utils/src/concur_control.rs
slow_warn.rs software healthy rust/utils/src/slow_warn.rs
prelude.rs software healthy rust/utils/src/prelude.rs
str_sanitize.rs software healthy rust/utils/src/str_sanitize.rs
batching.rs software healthy rust/utils/src/batching.rs
deser.rs software healthy rust/utils/src/deser.rs
retryable.rs software healthy rust/utils/src/retryable.rs

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
benchmarks software healthy benchmarks
file_summarization software healthy benchmarks/file_summarization
rust software healthy benchmarks/file_summarization/rust
src software healthy benchmarks/file_summarization/rust/src
python software healthy benchmarks/file_summarization/python
docs software healthy docs
src software healthy docs/src
components software healthy docs/src/components
diagrams software healthy docs/src/components/diagrams
content software healthy docs/src/content
example-posts software healthy docs/src/content/example-posts
styles software healthy docs/src/styles
rust software healthy rust
ops_text software healthy rust/ops_text
src software healthy rust/ops_text/src
split software healthy rust/ops_text/src/split
utils software healthy rust/utils
src software healthy rust/utils/src
py software healthy rust/py
src software healthy rust/py/src
py_utils software healthy rust/py_utils
src software healthy rust/py_utils/src
core software healthy rust/core
tests software healthy rust/core/tests
src software healthy rust/core/src
state software healthy rust/core/src/state
inspect software healthy rust/core/src/inspect
engine software healthy rust/core/src/engine
telemetry software healthy rust/core/src/telemetry
cocoindex software healthy rust/cocoindex
src software healthy rust/cocoindex/src
llm software healthy rust/cocoindex/src/llm
sdk software healthy rust/sdk
cocoindex_macros software healthy rust/sdk/cocoindex_macros
src software healthy rust/sdk/cocoindex_macros/src
cocoindex software healthy rust/sdk/cocoindex
tests software healthy rust/sdk/cocoindex/tests
src software healthy rust/sdk/cocoindex/src
benches software healthy rust/sdk/cocoindex/benches
dev software healthy dev
examples software healthy examples
amazon_s3_embedding software healthy examples/amazon_s3_embedding
multi_codebase_summarization software healthy examples/multi_codebase_summarization
image_search software healthy examples/image_search
frontend software healthy examples/image_search/frontend
src software healthy examples/image_search/frontend/src
text_embedding_turbopuffer software healthy examples/text_embedding_turbopuffer
markdown_files software healthy examples/text_embedding_turbopuffer/markdown_files
gdrive_text_embedding software healthy examples/gdrive_text_embedding
hn_trending_topics software healthy examples/hn_trending_topics

Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.

LabelLayerStatusPath
.env in repo security healthy examples/multi_codebase_summarization/.env
.env in repo security healthy examples/image_search/.env
.env in repo security healthy examples/gdrive_text_embedding/.env
.env in repo security healthy examples/hn_trending_topics/.env
.env in repo security healthy examples/files_transform/.env
.env in repo security healthy examples/text_embedding_lancedb/.env
.env in repo security healthy examples/paper_metadata/.env
.env in repo security healthy examples/code_embedding_lancedb/.env
.env in repo security healthy examples/pdf_to_markdown/.env
.env in repo security healthy examples/text_embedding/.env
.env in repo security healthy examples/postgres_source/.env
.env in repo security healthy examples/patient_intake_extraction_baml/.env
.env in repo security healthy examples/entire_session_search/.env
.env in repo security healthy examples/text_embedding_qdrant/.env
.env in repo security healthy examples/code_embedding/.env
.env in repo security healthy examples/image_search_colpali/.env
.env in repo security healthy examples/patient_intake_extraction_dspy/.env
.env in repo security healthy examples/pdf_embedding/.env
.env in repo security healthy examples/audio_to_text/.env
password_literal::python/tests/connectors/test_neo4j_target… security healthy python/tests/connectors/test_neo4j_target.py

LabelLayerStatusPath
deploy cicd healthy .github/workflows/_docs_release.yml
test-build cicd healthy .github/workflows/docs_test.yml
e2e-type-check cicd healthy .github/workflows/e2e_type_check.yml
link-check cicd healthy .github/workflows/links.yml
generate-3p-notices cicd healthy .github/workflows/release.yml
build cicd healthy .github/workflows/release.yml
test-abi3 cicd healthy .github/workflows/release.yml
test-free-threaded cicd healthy .github/workflows/release.yml
sdist cicd healthy .github/workflows/release.yml
release cicd healthy .github/workflows/release.yml
release-docs cicd healthy .github/workflows/release.yml
build-test cicd healthy .github/workflows/CI.yml
deploy cicd healthy .github/workflows/docs_release.yml
fast-check cicd healthy .github/workflows/fast_check.yml
build-test cicd healthy .github/workflows/_test.yml
validate-3p-notices cicd healthy .github/workflows/_test.yml
autoupdate cicd healthy .github/workflows/prek_autoupdate.yml

LabelLayerStatusPath
gha::_docs_release cicd healthy .github/workflows/_docs_release.yml
gha::docs_test cicd healthy .github/workflows/docs_test.yml
gha::e2e_type_check cicd healthy .github/workflows/e2e_type_check.yml
gha::links cicd healthy .github/workflows/links.yml
gha::release cicd healthy .github/workflows/release.yml
gha::CI cicd healthy .github/workflows/CI.yml
gha::docs_release cicd healthy .github/workflows/docs_release.yml
gha::fast_check cicd healthy .github/workflows/fast_check.yml
gha::_test cicd healthy .github/workflows/_test.yml
gha::prek_autoupdate cicd healthy .github/workflows/prek_autoupdate.yml

LabelLayerStatusPath
postgres data healthy pyproject.toml
sqlite data healthy pyproject.toml
redis data healthy docs/package-lock.json
postgresql data healthy examples/hn_trending_topics/models.py
mysql data healthy python/tests/connectors/test_doris_target.py

LabelLayerStatusPath
port:7474 network healthy dev/neo4j.yaml
port:7687 network healthy dev/neo4j.yaml
port:5432 network healthy dev/postgres.yaml
port:5173 network healthy examples/image_search/frontend/vite.config.js

LabelLayerStatusPath
vps::azure hardware healthy docs/package-lock.json
vps::aws hardware healthy rust/utils/Cargo.toml
vps::gcp hardware healthy examples/text_embedding_turbopuffer/.env.example

LabelLayerStatusPath
source_products data healthy examples/postgres_source/prepare_source_data.sql
IF data healthy python/cocoindex/connectors/doris/_target.py

LabelLayerStatusPath
App frontend healthy examples/image_search/frontend/src/App.jsx
App frontend healthy examples/image_search_colpali/frontend/src/App.jsx

LabelLayerStatusPath
repobility-clone-5ktct4ua software healthy /tmp/repobility-clone-5ktct4ua

LabelLayerStatusPath
GET /search api healthy examples/image_search/api.py

LabelLayerStatusPath
kafka data healthy pyproject.toml

LabelLayerStatusPath
127.0.0.1 network healthy rust/core/src/telemetry/mod.rs

LabelLayerStatusPath
gpu (detected) hardware healthy examples/pdf_embedding/main.py

LabelLayerStatusPath
GH_PAGES_DEPLOY cicd healthy
For AI agents: Voting guide (TP/FP) MCP manifest Stdio wrapper SARIF Integrate Findings queue Vote TP/FP on findings to calibrate the engine.
For AI agents + API integrations
Email me when this repo regresses
Free. We re-scan periodically; new criticals → your inbox. No signup required for the scan itself.
API access

This page is publicly accessible at: https://repobility.com/scan/ecebda09-0cac-4f20-b198-ea898eaf9be2/

To check status programmatically (no auth required):

curl -s https://repobility.com/api/v1/public/scan/ecebda09-0cac-4f20-b198-ea898eaf9be2/

Important — please don't re-submit the same URL repeatedly. The submission endpoint is idempotent: re-submitting the same git URL returns this same scan_token, not a new one. To re-scan this repo, sign up free and use the dashboard.