https://github.com/cocoindex-io/cocoindex
· scanned 2026-05-15 03:47 UTC (3 weeks ago)
· 10 languages
204 findings (93 legacy + 111 scanner) 84th percentile · Python · medium (20-100K LoC) Scanner says 87 (lower by 6)
Last scanned 3 weeks ago · v1 · 88 findings from 1 source. Findings combine the legacy security pipeline AND the multi-layer engine (atlas, wiring, flows, ranked) AND verified AI agent contributions.
| Component | Sub-score | Weight | Contribution |
|---|---|---|---|
structure_score |
60.0 | 0.15 | 9.00 |
security_score |
86.1 | 0.25 | 21.52 |
testing_score |
95.0 | 0.20 | 19.00 |
documentation_score |
86.6 | 0.15 | 12.99 |
practices_score |
80.0 | 0.15 | 12.00 |
code_quality |
63.8 | 0.10 | 6.38 |
| Overall | 1.00 | 80.9 |
agent: 1.1 ·
authz: 2.1 ·
threat: 10.8
All 3820 nodes from the latest scan, grouped by kind. Each node is a unit the engine identified (file, function, endpoint, table…). Most users won't need this view — it's primarily for debugging the engine's graph extraction or for AI agents that want to enumerate the project structure.
| Label | Layer | Status | Path |
|---|---|---|---|
parse_args |
software | healthy | benchmarks/file_summarization/python/runner.py:30 |
build_rust_binary |
software | healthy | benchmarks/file_summarization/python/runner.py:67 |
run_language |
software | healthy | benchmarks/file_summarization/python/runner.py:75 |
run_trial |
software | healthy | benchmarks/file_summarization/python/runner.py:133 |
aggregate_results |
software | healthy | benchmarks/file_summarization/python/runner.py:199 |
print_table |
software | healthy | benchmarks/file_summarization/python/runner.py:249 |
main |
software | healthy | benchmarks/file_summarization/python/runner.py:288 |
__init__ |
software | healthy | benchmarks/file_summarization/python/common.py:249 |
update |
software | healthy | benchmarks/file_summarization/python/common.py:252 |
hexdigest |
software | healthy | benchmarks/file_summarization/python/common.py:257 |
workload_profile |
software | healthy | benchmarks/file_summarization/python/common.py:261 |
scaled_count |
software | healthy | benchmarks/file_summarization/python/common.py:268 |
fnv1a64_bytes |
software | healthy | benchmarks/file_summarization/python/common.py:272 |
fnv1a64_text |
software | healthy | benchmarks/file_summarization/python/common.py:278 |
fnv1a64_hex |
software | dead | benchmarks/file_summarization/python/common.py:282 |
canonical_json_bytes |
software | healthy | benchmarks/file_summarization/python/common.py:286 |
tree_digest |
software | healthy | benchmarks/file_summarization/python/common.py:295 |
reset_dir |
software | healthy | benchmarks/file_summarization/python/common.py:308 |
write_text |
software | healthy | benchmarks/file_summarization/python/common.py:314 |
build_phrase |
software | healthy | benchmarks/file_summarization/python/common.py:319 |
build_line_block |
software | healthy | benchmarks/file_summarization/python/common.py:333 |
build_comment_block |
software | healthy | benchmarks/file_summarization/python/common.py:348 |
slugify_heading |
software | healthy | benchmarks/file_summarization/python/common.py:365 |
pick_language_from_suffix |
software | healthy | benchmarks/file_summarization/python/common.py:379 |
tokenize_ascii_words |
software | healthy | benchmarks/file_summarization/python/common.py:389 |
top_tokens_from_counts |
software | healthy | benchmarks/file_summarization/python/common.py:408 |
split_into_sections |
software | healthy | benchmarks/file_summarization/python/common.py:413 |
flush |
software | healthy | benchmarks/file_summarization/python/common.py:420 |
is_boundary |
software | healthy | benchmarks/file_summarization/python/common.py:426 |
analyze_section |
software | healthy | benchmarks/file_summarization/python/common.py:480 |
_summarize_file |
software | healthy | benchmarks/file_summarization/python/common.py:526 |
summarize_collection |
software | healthy | benchmarks/file_summarization/python/common.py:549 |
collection_to_jsonable |
software | healthy | benchmarks/file_summarization/python/common.py:584 |
file_report_to_jsonable |
software | healthy | benchmarks/file_summarization/python/common.py:588 |
sync_output_tree |
software | healthy | benchmarks/file_summarization/python/common.py:607 |
generate_dataset |
software | healthy | benchmarks/file_summarization/python/common.py:708 |
apply_edit_mutation |
software | healthy | benchmarks/file_summarization/python/common.py:721 |
apply_shape_mutation |
software | healthy | benchmarks/file_summarization/python/common.py:782 |
_mutable_files |
software | healthy | benchmarks/file_summarization/python/common.py:811 |
_add_shape_file |
software | healthy | benchmarks/file_summarization/python/common.py:819 |
_generate_codebase_dataset |
software | healthy | benchmarks/file_summarization/python/common.py:853 |
_generate_docs_dataset |
software | healthy | benchmarks/file_summarization/python/common.py:934 |
_render_rust_file |
software | healthy | benchmarks/file_summarization/python/common.py:961 |
_render_python_file |
software | healthy | benchmarks/file_summarization/python/common.py:999 |
_render_markdown_page |
software | healthy | benchmarks/file_summarization/python/common.py:1035 |
_render_toml_file |
software | healthy | benchmarks/file_summarization/python/common.py:1076 |
run_state |
software | healthy | benchmarks/file_summarization/python/benchmark.py:51 |
collection_kind |
software | healthy | benchmarks/file_summarization/python/benchmark.py:57 |
file_patterns |
software | healthy | benchmarks/file_summarization/python/benchmark.py:61 |
extract_sections |
software | healthy | benchmarks/file_summarization/python/benchmark.py:68 |
Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.
| Label | Layer | Status | Path |
|---|---|---|---|
WorkloadProfile |
software | healthy | benchmarks/file_summarization/python/common.py:139 |
SectionInput |
software | healthy | benchmarks/file_summarization/python/common.py:205 |
SectionAnalysis |
software | healthy | benchmarks/file_summarization/python/common.py:214 |
FileSummary |
software | healthy | benchmarks/file_summarization/python/common.py:227 |
CollectionSummary |
software | healthy | benchmarks/file_summarization/python/common.py:237 |
Fnv1a64 |
software | healthy | benchmarks/file_summarization/python/common.py:248 |
BenchMetrics |
software | healthy | benchmarks/file_summarization/python/benchmark.py:26 |
RunState |
software | healthy | benchmarks/file_summarization/python/benchmark.py:40 |
DocEmbedding |
software | healthy | examples/amazon_s3_embedding/main.py:72 |
FunctionInfo |
software | healthy | examples/multi_codebase_summarization/models.py:6 |
ClassInfo |
software | healthy | examples/multi_codebase_summarization/models.py:19 |
CodebaseInfo |
software | healthy | examples/multi_codebase_summarization/models.py:45 |
DocEmbedding |
software | healthy | examples/gdrive_text_embedding/main.py:60 |
TopicsResponse |
software | healthy | examples/hn_trending_topics/models.py:6 |
Comment |
software | healthy | examples/hn_trending_topics/main.py:51 |
Thread |
software | healthy | examples/hn_trending_topics/main.py:59 |
HnMessage |
software | healthy | examples/hn_trending_topics/main.py:74 |
HnTopic |
software | healthy | examples/hn_trending_topics/main.py:87 |
TableTargets |
software | healthy | examples/hn_trending_topics/main.py:200 |
DocEmbedding |
software | healthy | examples/text_embedding_lancedb/main.py:47 |
AuthorModel |
software | healthy | examples/paper_metadata/models.py:6 |
PaperMetadataModel |
software | healthy | examples/paper_metadata/models.py:14 |
PaperBasicInfo |
software | healthy | examples/paper_metadata/main.py:75 |
PaperMetadataRow |
software | healthy | examples/paper_metadata/main.py:81 |
AuthorPaperRow |
software | healthy | examples/paper_metadata/main.py:90 |
MetadataEmbeddingRow |
software | healthy | examples/paper_metadata/main.py:96 |
CodeEmbedding |
software | healthy | examples/code_embedding_lancedb/main.py:47 |
Meeting |
software | healthy | examples/meeting_notes_graph_neo4j/main.py:102 |
Person |
software | healthy | examples/meeting_notes_graph_neo4j/main.py:110 |
Task |
software | healthy | examples/meeting_notes_graph_neo4j/main.py:115 |
AttendedRel |
software | healthy | examples/meeting_notes_graph_neo4j/main.py:120 |
ExtractedPerson |
software | healthy | examples/meeting_notes_graph_neo4j/main.py:139 |
ExtractedTask |
software | healthy | examples/meeting_notes_graph_neo4j/main.py:145 |
ExtractedMeeting |
software | healthy | examples/meeting_notes_graph_neo4j/main.py:155 |
MeetingExtraction |
software | healthy | examples/meeting_notes_graph_neo4j/main.py:232 |
DocEmbedding |
software | healthy | examples/text_embedding/main.py:63 |
SourceProduct |
software | healthy | examples/postgres_source/main.py:47 |
OutputProduct |
software | healthy | examples/postgres_source/main.py:56 |
Meeting |
software | healthy | examples/meeting_notes_graph_falkordb/main.py:93 |
Person |
software | healthy | examples/meeting_notes_graph_falkordb/main.py:101 |
Task |
software | healthy | examples/meeting_notes_graph_falkordb/main.py:106 |
AttendedRel |
software | healthy | examples/meeting_notes_graph_falkordb/main.py:111 |
ExtractedPerson |
software | healthy | examples/meeting_notes_graph_falkordb/main.py:130 |
ExtractedTask |
software | healthy | examples/meeting_notes_graph_falkordb/main.py:136 |
ExtractedMeeting |
software | healthy | examples/meeting_notes_graph_falkordb/main.py:146 |
MeetingExtraction |
software | healthy | examples/meeting_notes_graph_falkordb/main.py:223 |
DocEmbedding |
software | healthy | examples/oci_object_storage_embedding/main.py:126 |
TranscriptChunk |
software | healthy | examples/entire_session_search/models.py:7 |
SessionInfo |
software | healthy | examples/entire_session_search/models.py:15 |
ChunkInput |
software | healthy | examples/entire_session_search/models.py:23 |
Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.
| Label | Layer | Status | Path |
|---|---|---|---|
Cargo.toml |
software | healthy | Cargo.toml |
README.md |
software | healthy | README.md |
CLAUDE.md |
software | healthy | CLAUDE.md |
CONTRIBUTING.md |
software | healthy | CONTRIBUTING.md |
.pre-commit-config.yaml |
software | healthy | .pre-commit-config.yaml |
pyproject.toml |
software | healthy | pyproject.toml |
CODE_OF_CONDUCT.md |
software | healthy | CODE_OF_CONDUCT.md |
opencode.json |
software | healthy | opencode.json |
about.toml |
software | healthy | about.toml |
ruff.toml |
software | healthy | ruff.toml |
README.md |
software | healthy | benchmarks/file_summarization/README.md |
run.sh |
software | healthy | benchmarks/file_summarization/run.sh |
BENCHMARK_REPORT.md |
software | healthy | benchmarks/file_summarization/BENCHMARK_REPORT.md |
Cargo.toml |
software | healthy | benchmarks/file_summarization/rust/Cargo.toml |
main.rs |
software | healthy | benchmarks/file_summarization/rust/src/main.rs |
pyproject.toml |
software | healthy | benchmarks/file_summarization/python/pyproject.toml |
runner.py |
software | healthy | benchmarks/file_summarization/python/runner.py |
common.py |
software | healthy | benchmarks/file_summarization/python/common.py |
benchmark.py |
software | healthy | benchmarks/file_summarization/python/benchmark.py |
package.json |
software | healthy | docs/package.json |
package-lock.json |
software | healthy | docs/package-lock.json |
tsconfig.json |
software | healthy | docs/tsconfig.json |
content.config.ts |
software | warning | docs/src/content.config.ts |
consts.ts |
software | healthy | docs/src/consts.ts |
README.md |
software | healthy | docs/src/components/diagrams/README.md |
diagrams.css |
software | healthy | docs/src/components/diagrams/diagrams.css |
multi-codebase-summarization.md |
software | healthy | docs/src/content/example-posts/multi-codebase-summarization… |
pdf-to-markdown.md |
software | healthy | docs/src/content/example-posts/pdf-to-markdown.md |
globals.css |
software | healthy | docs/src/styles/globals.css |
Cargo.toml |
software | healthy | rust/ops_text/Cargo.toml |
lib.rs |
software | healthy | rust/ops_text/src/lib.rs |
pattern_matcher.rs |
software | healthy | rust/ops_text/src/pattern_matcher.rs |
output_positions.rs |
software | healthy | rust/ops_text/src/output_positions.rs |
prog_langs.rs |
software | healthy | rust/ops_text/src/prog_langs.rs |
by_separators.rs |
software | healthy | rust/ops_text/src/split/by_separators.rs |
mod.rs |
software | healthy | rust/ops_text/src/split/mod.rs |
recursive.rs |
software | healthy | rust/ops_text/src/split/recursive.rs |
Cargo.toml |
software | healthy | rust/utils/Cargo.toml |
error.rs |
software | healthy | rust/utils/src/error.rs |
lib.rs |
software | healthy | rust/utils/src/lib.rs |
yaml_ser.rs |
software | healthy | rust/utils/src/yaml_ser.rs |
bytes_decode.rs |
software | healthy | rust/utils/src/bytes_decode.rs |
http.rs |
software | healthy | rust/utils/src/http.rs |
concur_control.rs |
software | healthy | rust/utils/src/concur_control.rs |
slow_warn.rs |
software | healthy | rust/utils/src/slow_warn.rs |
prelude.rs |
software | healthy | rust/utils/src/prelude.rs |
str_sanitize.rs |
software | healthy | rust/utils/src/str_sanitize.rs |
batching.rs |
software | healthy | rust/utils/src/batching.rs |
deser.rs |
software | healthy | rust/utils/src/deser.rs |
retryable.rs |
software | healthy | rust/utils/src/retryable.rs |
Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.
| Label | Layer | Status | Path |
|---|---|---|---|
benchmarks |
software | healthy | benchmarks |
file_summarization |
software | healthy | benchmarks/file_summarization |
rust |
software | healthy | benchmarks/file_summarization/rust |
src |
software | healthy | benchmarks/file_summarization/rust/src |
python |
software | healthy | benchmarks/file_summarization/python |
docs |
software | healthy | docs |
src |
software | healthy | docs/src |
components |
software | healthy | docs/src/components |
diagrams |
software | healthy | docs/src/components/diagrams |
content |
software | healthy | docs/src/content |
example-posts |
software | healthy | docs/src/content/example-posts |
styles |
software | healthy | docs/src/styles |
rust |
software | healthy | rust |
ops_text |
software | healthy | rust/ops_text |
src |
software | healthy | rust/ops_text/src |
split |
software | healthy | rust/ops_text/src/split |
utils |
software | healthy | rust/utils |
src |
software | healthy | rust/utils/src |
py |
software | healthy | rust/py |
src |
software | healthy | rust/py/src |
py_utils |
software | healthy | rust/py_utils |
src |
software | healthy | rust/py_utils/src |
core |
software | healthy | rust/core |
tests |
software | healthy | rust/core/tests |
src |
software | healthy | rust/core/src |
state |
software | healthy | rust/core/src/state |
inspect |
software | healthy | rust/core/src/inspect |
engine |
software | healthy | rust/core/src/engine |
telemetry |
software | healthy | rust/core/src/telemetry |
cocoindex |
software | healthy | rust/cocoindex |
src |
software | healthy | rust/cocoindex/src |
llm |
software | healthy | rust/cocoindex/src/llm |
sdk |
software | healthy | rust/sdk |
cocoindex_macros |
software | healthy | rust/sdk/cocoindex_macros |
src |
software | healthy | rust/sdk/cocoindex_macros/src |
cocoindex |
software | healthy | rust/sdk/cocoindex |
tests |
software | healthy | rust/sdk/cocoindex/tests |
src |
software | healthy | rust/sdk/cocoindex/src |
benches |
software | healthy | rust/sdk/cocoindex/benches |
dev |
software | healthy | dev |
examples |
software | healthy | examples |
amazon_s3_embedding |
software | healthy | examples/amazon_s3_embedding |
multi_codebase_summarization |
software | healthy | examples/multi_codebase_summarization |
image_search |
software | healthy | examples/image_search |
frontend |
software | healthy | examples/image_search/frontend |
src |
software | healthy | examples/image_search/frontend/src |
text_embedding_turbopuffer |
software | healthy | examples/text_embedding_turbopuffer |
markdown_files |
software | healthy | examples/text_embedding_turbopuffer/markdown_files |
gdrive_text_embedding |
software | healthy | examples/gdrive_text_embedding |
hn_trending_topics |
software | healthy | examples/hn_trending_topics |
Showing first 50 of this kind. Full payload available via the JSON button at the top of the page.
| Label | Layer | Status | Path |
|---|---|---|---|
.env in repo |
security | healthy | examples/multi_codebase_summarization/.env |
.env in repo |
security | healthy | examples/image_search/.env |
.env in repo |
security | healthy | examples/gdrive_text_embedding/.env |
.env in repo |
security | healthy | examples/hn_trending_topics/.env |
.env in repo |
security | healthy | examples/files_transform/.env |
.env in repo |
security | healthy | examples/text_embedding_lancedb/.env |
.env in repo |
security | healthy | examples/paper_metadata/.env |
.env in repo |
security | healthy | examples/code_embedding_lancedb/.env |
.env in repo |
security | healthy | examples/pdf_to_markdown/.env |
.env in repo |
security | healthy | examples/text_embedding/.env |
.env in repo |
security | healthy | examples/postgres_source/.env |
.env in repo |
security | healthy | examples/patient_intake_extraction_baml/.env |
.env in repo |
security | healthy | examples/entire_session_search/.env |
.env in repo |
security | healthy | examples/text_embedding_qdrant/.env |
.env in repo |
security | healthy | examples/code_embedding/.env |
.env in repo |
security | healthy | examples/image_search_colpali/.env |
.env in repo |
security | healthy | examples/patient_intake_extraction_dspy/.env |
.env in repo |
security | healthy | examples/pdf_embedding/.env |
.env in repo |
security | healthy | examples/audio_to_text/.env |
password_literal::python/tests/connectors/test_neo4j_target… |
security | healthy | python/tests/connectors/test_neo4j_target.py |
| Label | Layer | Status | Path |
|---|---|---|---|
deploy |
cicd | healthy | .github/workflows/_docs_release.yml |
test-build |
cicd | healthy | .github/workflows/docs_test.yml |
e2e-type-check |
cicd | healthy | .github/workflows/e2e_type_check.yml |
link-check |
cicd | healthy | .github/workflows/links.yml |
generate-3p-notices |
cicd | healthy | .github/workflows/release.yml |
build |
cicd | healthy | .github/workflows/release.yml |
test-abi3 |
cicd | healthy | .github/workflows/release.yml |
test-free-threaded |
cicd | healthy | .github/workflows/release.yml |
sdist |
cicd | healthy | .github/workflows/release.yml |
release |
cicd | healthy | .github/workflows/release.yml |
release-docs |
cicd | healthy | .github/workflows/release.yml |
build-test |
cicd | healthy | .github/workflows/CI.yml |
deploy |
cicd | healthy | .github/workflows/docs_release.yml |
fast-check |
cicd | healthy | .github/workflows/fast_check.yml |
build-test |
cicd | healthy | .github/workflows/_test.yml |
validate-3p-notices |
cicd | healthy | .github/workflows/_test.yml |
autoupdate |
cicd | healthy | .github/workflows/prek_autoupdate.yml |
| Label | Layer | Status | Path |
|---|---|---|---|
gha::_docs_release |
cicd | healthy | .github/workflows/_docs_release.yml |
gha::docs_test |
cicd | healthy | .github/workflows/docs_test.yml |
gha::e2e_type_check |
cicd | healthy | .github/workflows/e2e_type_check.yml |
gha::links |
cicd | healthy | .github/workflows/links.yml |
gha::release |
cicd | healthy | .github/workflows/release.yml |
gha::CI |
cicd | healthy | .github/workflows/CI.yml |
gha::docs_release |
cicd | healthy | .github/workflows/docs_release.yml |
gha::fast_check |
cicd | healthy | .github/workflows/fast_check.yml |
gha::_test |
cicd | healthy | .github/workflows/_test.yml |
gha::prek_autoupdate |
cicd | healthy | .github/workflows/prek_autoupdate.yml |
| Label | Layer | Status | Path |
|---|---|---|---|
postgres |
data | healthy | pyproject.toml |
sqlite |
data | healthy | pyproject.toml |
redis |
data | healthy | docs/package-lock.json |
postgresql |
data | healthy | examples/hn_trending_topics/models.py |
mysql |
data | healthy | python/tests/connectors/test_doris_target.py |
| Label | Layer | Status | Path |
|---|---|---|---|
port:7474 |
network | healthy | dev/neo4j.yaml |
port:7687 |
network | healthy | dev/neo4j.yaml |
port:5432 |
network | healthy | dev/postgres.yaml |
port:5173 |
network | healthy | examples/image_search/frontend/vite.config.js |
| Label | Layer | Status | Path |
|---|---|---|---|
vps::azure |
hardware | healthy | docs/package-lock.json |
vps::aws |
hardware | healthy | rust/utils/Cargo.toml |
vps::gcp |
hardware | healthy | examples/text_embedding_turbopuffer/.env.example |
| Label | Layer | Status | Path |
|---|---|---|---|
source_products |
data | healthy | examples/postgres_source/prepare_source_data.sql |
IF |
data | healthy | python/cocoindex/connectors/doris/_target.py |
| Label | Layer | Status | Path |
|---|---|---|---|
App |
frontend | healthy | examples/image_search/frontend/src/App.jsx |
App |
frontend | healthy | examples/image_search_colpali/frontend/src/App.jsx |
| Label | Layer | Status | Path |
|---|---|---|---|
repobility-clone-5ktct4ua |
software | healthy | /tmp/repobility-clone-5ktct4ua |
| Label | Layer | Status | Path |
|---|---|---|---|
GET /search |
api | healthy | examples/image_search/api.py |
| Label | Layer | Status | Path |
|---|---|---|---|
kafka |
data | healthy | pyproject.toml |
| Label | Layer | Status | Path |
|---|---|---|---|
127.0.0.1 |
network | healthy | rust/core/src/telemetry/mod.rs |
| Label | Layer | Status | Path |
|---|---|---|---|
gpu (detected) |
hardware | healthy | examples/pdf_embedding/main.py |
| Label | Layer | Status | Path |
|---|---|---|---|
GH_PAGES_DEPLOY |
cicd | healthy | — |
This page is publicly accessible at:
https://repobility.com/scan/ecebda09-0cac-4f20-b198-ea898eaf9be2/
To check status programmatically (no auth required):
curl -s https://repobility.com/api/v1/public/scan/ecebda09-0cac-4f20-b198-ea898eaf9be2/
Important — please don't re-submit the same URL repeatedly. The submission endpoint is idempotent: re-submitting the same git URL returns this same scan_token, not a new one. To re-scan this repo, sign up free and use the dashboard.