Public scan — anyone with this URL can view this analysis. Sign up to track your own repos privately, run scheduled re-scans, and get AI fix prompts via your dashboard.
126 of your 215 findings came from Repobility's proprietary detections. ✓ Repobility tags below mark them.

Scan timing: clone 7.34s · analysis 17.45s · 17.9 MB · GitHub API rate-limit (preflight)

deepset-ai/haystack-core-integrations

https://github.com/deepset-ai/haystack-core-integrations · scanned 2026-06-05 13:26 UTC (5 days, 6 hours ago) · 10 languages

642 raw signals (198 security + 444 graph) 17th percentile · Python · large (100-500K LoC) System graph score 79 (lower by 16)

UNIFIED Repobility · multi-layer engine · AI coders

Complete repo analysis

Last scanned 5 days, 6 hours ago · v2 · 206 actionable findings from 2 signal sources. 210 repeated signals grouped for readability. Security checks, system graph analysis, and verified AI-agent feedback are merged into one review queue.

JSON
Score breakdown â 2026-05-18-v5
Component Sub-score Weight Contribution
structure_score 40.0 0.15 6.00
security_score 37.0 0.25 9.25
testing_score 100.0 0.20 20.00
documentation_score 90.0 0.15 13.50
practices_score 74.0 0.15 11.10
code_quality 31.5 0.10 3.15
Overall 1.00 63.0
Severity distribution — click a segment to filter
Active filters: excluding tests × Reset all
Scan summary Quality grade C+ (63/100). Dimensions: security 37, maintainability 40. 198 findings (39 security). 195,457 lines analyzed.

Showing 135 of 206 actionable findings. 416 raw detector signals were grouped into reader-sized issues. Click TP / FP to vote on a finding's accuracy — votes adjust the confidence weighting and improve detection across the platform.

critical Security checks cicd CI/CD security conf 0.96 4 occurrences Compose service contains a literal secret environment value
Literal secrets in Compose files are committed to source and exposed through container inspection.
3 files, 4 locations
integrations/supabase/docker-compose.yml:1, 15 (2 hits)
integrations/opensearch/docker-compose.yml:1
integrations/oracle/docker-compose.yml:1
CI/CD securitycontainers
critical Security checks security secrets conf 0.95 Discovered a potential authorization token provided in a curl command header, which could compromise the curl accessed resource.
Gitleaks detected a committed secret or credential pattern.
.github/workflows/supabase.yml:137
high Security checks quality Quality conf 1.00 ✓ Repobility 3 occurrences Missing import: `operator` used but not imported
The file uses `operator.something(...)` but never imports `operator`. This raises NameError at runtime the first time the line executes.
3 files, 3 locations
integrations/arcadedb/src/haystack_integrations/document_stores/arcadedb/filters.py:29
integrations/cohere/src/haystack_integrations/components/embedders/cohere/embedding_types.py:33
integrations/falkordb/src/haystack_integrations/document_stores/falkordb/document_store.py:828
critical Security checks security secrets conf 0.95 Uncovered a JSON Web Token, which may lead to unauthorized access to web applications and sensitive user data.
Gitleaks detected a committed secret or credential pattern.
.github/workflows/supabase.yml:137
low Security checks cicd CI/CD security conf 0.35 ✓ Repobility 25 occurrences Workflow references repository secrets in a pull_request workflow
Fork pull_request runs do not receive normal repository secrets on GitHub Actions. Review this as a reliability/intent signal, not as direct fork-secret exfiltration. Raise severity only for pull_request_target or another trusted-context path that runs untrusted PR code with secrets.
12 files, 22 locations
.github/workflows/arcadedb.yml:30, 70, 147 (3 hits)
.github/workflows/mcp.yml:32, 33, 155 (3 hits)
.github/workflows/nvidia.yml:32, 33, 145 (3 hits)
.github/workflows/paddleocr.yml:32, 33, 143 (3 hits)
.github/workflows/cohere.yml:32, 142 (2 hits)
.github/workflows/firecrawl.yml:28, 137 (2 hits)
.github/workflows/amazon_sagemaker.yml:141
.github/workflows/falkordb.yml:154
CI/CD securityworkflow secretsGitHub Actions
critical System graph security Secrets conf 1.00 Possible secret in integrations/alloydb/src/haystack_integrations/document_stores/alloydb/document_store.py
Detected pattern matching password_literal. Rotate the credential and move to a secret manager.
integrations/alloydb/src/haystack_integrations/document_stores/alloydb/document_store.py:91
critical System graph security Secrets conf 1.00 Possible secret in integrations/snowflake/src/haystack_integrations/components/retrievers/snowflake/auth.py
Detected pattern matching password_literal. Rotate the credential and move to a secret manager.
integrations/snowflake/src/haystack_integrations/components/retrievers/snowflake/auth.py:25
critical System graph security Secrets conf 1.00 Possible secret in integrations/snowflake/src/haystack_integrations/components/retrievers/snowflake/snowflake_table_retriever.py
Detected pattern matching password_literal. Rotate the credential and move to a secret manager.
integrations/snowflake/src/haystack_integrations/components/retrievers/snowflake/snowflake_table_retriever.py:238
critical System graph security Secrets conf 1.00 Possible secret in integrations/weaviate/src/haystack_integrations/document_stores/weaviate/auth.py
Detected pattern matching password_literal. Rotate the credential and move to a secret manager.
integrations/weaviate/src/haystack_integrations/document_stores/weaviate/auth.py:27
high Security checks quality Quality conf 1.00 ✓ Repobility 22 occurrences `self._client` used but never assigned in __init__
Method `_ensure_initialized` of class `ChromaDocumentStore` reads `self._client`, but no assignment to it exists in __init__ (and no class-level fallback). This raises AttributeError the first time the method runs against an instance.
lines 141, 189, 398, 410, 426, 432, 446, 452, +14 more
integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:141, 189, 398, 410, 426, 432, 446, 452, +14 more (22 hits)
high Security checks cicd CI/CD security conf 0.90 Database service has no persistent data volume
Database containers store data in the writable container layer unless a volume or bind mount is attached to the image's data directory. Recreating the container can lose state.
integrations/opensearch/docker-compose.yml:1 CI/CD securitycontainers
high Security checks cicd CI/CD security conf 0.90 Database service has no persistent data volume
Database containers store data in the writable container layer unless a volume or bind mount is attached to the image's data directory. Recreating the container can lose state.
integrations/elasticsearch/docker-compose.yml:1 CI/CD securitycontainers
high Security checks cicd CI/CD security conf 0.84 3 occurrences Database service publishes a host port
Publishing database ports to the host increases exposure. Internal Compose networking usually only needs expose, not ports.
3 files, 3 locations
integrations/elasticsearch/docker-compose.yml:1
integrations/opensearch/docker-compose.yml:1
integrations/oracle/docker-compose.yml:1
CI/CD securitycontainers
high Security checks software dependencies conf 0.90 ✓ Repobility Dockerfile FROM `postgres:17-bookworm` not pinned by digest
`FROM postgres:17-bookworm` resolves the tag at build time. The registry CAN re-push a different image for the same tag, so every build is potentially different. Production images should pin to `image@sha256:...` for reproducibility + supply-chain integrity.
integrations/supabase/Dockerfile.pgroonga:1
high Security checks software dependencies conf 0.90 ✓ Repobility 5 occurrences Workflow container/services image `arcadedata/arcadedb:latest` unpinned
`container/services image: arcadedata/arcadedb:latest` without `@sha256:...` pulls a mutable tag at workflow-run time. Treat workflow container references with the same supply-chain discipline as Dockerfile FROM lines.
5 files, 5 locations
.github/workflows/arcadedb.yml:67
.github/workflows/falkordb.yml:64
.github/workflows/pgvector.yml:64
.github/workflows/supabase.yml:65
.github/workflows/valkey.yml:65
high System graph cicd CI/CD security conf 1.00 pull_request_target workflow appears to check out untrusted PR code
pull_request_target runs with base-repo privileges. Checking out PR head code in that context can expose repository tokens or secrets to attacker-controlled code.
.github/workflows/CI_docstring_labeler.yml CI/CD securitySupply chainGithub actions
medium Security checks quality Quality conf 1.00 [SEC125] AI placeholder credential left in source (your-api-key-here style): AI coding assistants frequently emit placeholder credentials shaped like `API_KEY = "your-api-key-here"` instead of pulling from env. These get committed verbatim — production code with a literal placeholder string is a near-certain bug, and the value also leaks what credential type the system expects to authentication crawlers. CWE-1188. Distinctive AI footprint: the exact phrase shape `your-X-here` is uncommon in hand
Replace with env lookup: `API_KEY = os.environ['SERVICE_API_KEY']`. Move actual key to a secret manager. Add a startup check that the env var is non-empty so missing config fails loudly instead of shipping the placeholder.
integrations/huggingface_api/src/haystack_integrations/components/embedders/huggingface_api/text_embedder.py:44
low Security checks quality Error handling conf 0.55 ✓ Repobility 3 occurrences Broad exception handler needs review
This handler catches Exception/BaseException. It is actionable when it swallows errors without logging, re-raising, or returning a structured error. Handlers that intentionally convert exceptions into typed error results should not be treated as high risk.
2 files, 3 locations
integrations/weaviate/src/haystack_integrations/document_stores/weaviate/document_store.py:1360, 1454 (2 hits)
integrations/mistral/src/haystack_integrations/components/converters/mistral/ocr_document_converter.py:428
Error handlingquality
medium Security checks cicd CI/CD security conf 0.94 Compose service `falkordb` image uses the latest tag
The latest tag is mutable and can change without a code review, producing different images from the same source.
integrations/falkordb/docker-compose.yml:1 CI/CD securitycontainers
medium Security checks cicd CI/CD security conf 0.84 Database data bind mount is inside the Docker build context
Keeping live database files under the repository/build context can leak data into Docker builds, slow context loading, and make accidental commits more likely.
integrations/oracle/docker-compose.yml:1 CI/CD securitycontainers
medium Security checks cicd CI/CD security conf 0.74 Database service has no persistent data volume
Database containers store data in the writable container layer unless a volume or bind mount is attached to the image's data directory. Recreating the container can lose state.
integrations/supabase/docker-compose.yml:1 CI/CD securitycontainers
medium Security checks cicd CI/CD security conf 0.90 Docker build context has no .dockerignore
Without .dockerignore, build context can include source history, local env files, dependencies, and generated artifacts.
.dockerignore CI/CD securitycontainers
high Security checks cicd CI/CD security conf 0.82 Docker final stage has no non-root USER
Docker images run as root unless the image or Dockerfile switches to a non-root user.
integrations/supabase/Dockerfile.pgroonga:1 CI/CD securitycontainers
medium Security checks software dependencies conf 0.90 Python package `openai` is 1 major version(s) behind (1.6.1 -> 2.41.0)
`openai==1.6.1` is 1 major version(s) behind the latest stable release on PyPI (2.41.0). Pinned-but-stale Python dependencies drift away from upstream security and bugfix releases. This is the version-currency signal Dependabot raises.
integrations/astra/examples/requirements.txt:3
medium Security checks software dependencies conf 0.90 Python package `sentence_transformers` is 3 major version(s) behind (2.2.2 -> 5.5.1)
`sentence_transformers==2.2.2` is 3 major version(s) behind the latest stable release on PyPI (5.5.1). Pinned-but-stale Python dependencies drift away from upstream security and bugfix releases. This is the version-currency signal Dependabot raises.
integrations/astra/examples/requirements.txt:2
high Security checks software dependencies conf 0.70 Remote install command pipes network code directly to a shell
Agent helper projects often publish one-line installers. `curl | sh` style commands are convenient, but they bypass review unless the script is pinned, signed, or checksum-verified.
.github/workflows/ollama.yml:75
high Security checks software dependencies conf 0.70 Remote install command pipes network code directly to a shell
Agent helper projects often publish one-line installers. `curl | sh` style commands are convenient, but they bypass review unless the script is pinned, signed, or checksum-verified.
.github/workflows/llama_stack.yml:72
medium Security checks software dependencies conf 0.90 ✓ Repobility 4 occurrences requirements.txt: `haystack-ai` has no version pin
Unpinned pip requirement means every fresh install may resolve a different version. Newer releases can introduce malicious code (typosquats, account compromises). Reproducible installs need exact pins.
2 files, 4 locations
integrations/langfuse/example/requirements.txt:1, 2, 3 (3 hits)
integrations/astra/examples/requirements.txt:1
medium System graph cicd CI/CD security conf 1.00 72 occurrences GitHub Actions workflow grants broad write permissions
CI tokens with write permissions increase blast radius when an action, dependency, or PR workflow is compromised. Prefer job-level least-privilege permissions.
12 files, 12 locations
.github/workflows/CI_pypi_release.yml
.github/workflows/aimlapi.yml
.github/workflows/alloydb.yml
.github/workflows/amazon_bedrock.yml
.github/workflows/amazon_sagemaker.yml
.github/workflows/amazon_textract.yml
.github/workflows/anthropic.yml
.github/workflows/arcadedb.yml
CI/CD securitySupply chainGithub actions
medium System graph quality Integrity conf 1.00 Network/subprocess call without timeout or try/except — integrations/google_ai/src/haystack_integrations/components/generators/google_ai/gemini.py:54
`requests.get(...)` here lacks both a `timeout=` arg and an enclosing try/except. This is exactly the class of bug that took down our git-clone earlier (HTTP/2 stream cancel surfaced as a fatal). Add a `timeout=` and wrap in try/except, or use a wrapper that retries.
runtime safetyRobustness
medium System graph quality Integrity conf 1.00 Network/subprocess call without timeout or try/except — integrations/google_vertex/src/haystack_integrations/components/generators/google_vertex/captioner.py:31
`requests.get(...)` here lacks both a `timeout=` arg and an enclosing try/except. This is exactly the class of bug that took down our git-clone earlier (HTTP/2 stream cancel surfaced as a fatal). Add a `timeout=` and wrap in try/except, or use a wrapper that retries.
runtime safetyRobustness
medium System graph quality Integrity conf 1.00 Network/subprocess call without timeout or try/except — integrations/libreoffice/src/haystack_integrations/components/converters/libreoffice/converter.py:298
`subprocess.run(...)` here lacks both a `timeout=` arg and an enclosing try/except. This is exactly the class of bug that took down our git-clone earlier (HTTP/2 stream cancel surfaced as a fatal). Add a `timeout=` and wrap in try/except, or use a wrapper that retries.
runtime safetyRobustness
high Security checks cicd CI/CD security conf 0.56 5 occurrences Compose service does not declare a runtime user
If the image does not define USER internally, this service may run as root.
5 files, 5 locations
integrations/falkordb/docker-compose.yml:1
integrations/oracle/docker-compose.yml:1
integrations/supabase/docker-compose.yml:1
integrations/vespa/docker-compose.yml:1
integrations/weaviate/docker-compose.yml:1
CI/CD securitycontainers
high Security checks cicd CI/CD security conf 0.62 4 occurrences Compose service lacks no-new-privileges hardening
no-new-privileges prevents processes from gaining additional privileges through setuid binaries or file capabilities.
4 files, 4 locations
integrations/falkordb/docker-compose.yml:1
integrations/oracle/docker-compose.yml:1
integrations/vespa/docker-compose.yml:1
integrations/weaviate/docker-compose.yml:1
CI/CD securitycontainers
low Security checks cicd CI/CD security conf 0.72 Dockerfile installs recommended OS packages
Installing recommended packages often pulls in unnecessary runtime surface area.
integrations/supabase/Dockerfile.pgroonga:3 CI/CD securitycontainers
low Security checks quality Quality conf 0.60 30 occurrences Duplicated implementation block across source files
Duplicate implementation blocks are maintenance debt. Keep them visible, but they are not a high-severity defect unless the duplicated logic is security-sensitive or drifting.
12 files, 19 locations
integrations/chonkie/src/haystack_integrations/components/preprocessors/chonkie/token_splitter.py:60, 80, 82 (3 hits)
integrations/amazon_bedrock/src/haystack_integrations/components/generators/amazon_bedrock/generator.py:149, 254 (2 hits)
integrations/anthropic/src/haystack_integrations/components/generators/anthropic/chat/vertex_chat_generator.py:87, 97 (2 hits)
integrations/chonkie/src/haystack_integrations/components/preprocessors/chonkie/sentence_splitter.py:82, 102 (2 hits)
integrations/elasticsearch/src/haystack_integrations/components/retrievers/elasticsearch/inference_sparse_retriever.py:67, 73 (2 hits)
integrations/elasticsearch/src/haystack_integrations/components/retrievers/elasticsearch/sparse_embedding_retriever.py:74, 78 (2 hits)
integrations/alloydb/src/haystack_integrations/components/retrievers/alloydb/keyword_retriever.py:60
integrations/amazon_bedrock/src/haystack_integrations/components/embedders/amazon_bedrock/document_image_embedder.py:41
duplicationquality
low System graph hardware Coverage conf 1.00 Containers defined but no K8s/orchestration manifest found
Repo has Dockerfiles/compose but no Kubernetes/Nomad manifests. If the target deployment is K8s, the manifests may live in a separate ops repo.
Deployment
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/alloydb/examples/embedding_retrieval.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/alloydb/examples/hybrid_retrieval.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/amazon_bedrock/examples/bedrock_ranker_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/amazon_bedrock/examples/chatgenerator_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/amazon_bedrock/examples/embedders_generator_with_rag_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/amazon_bedrock/examples/s3_downloader_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/amazon_textract/examples/analyze_document_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/amazon_textract/examples/queries_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/amazon_textract/examples/text_extraction_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/arcadedb/examples/embedding_retrieval.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/chroma/example/example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/cohere/examples/cohere_embedding.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/cohere/examples/cohere_generation.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/cohere/examples/cohere_ranker.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/deepeval/example/example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/docling/src/docling_haystack/converter.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/github/src/haystack_integrations/prompts/github/context_prompt.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/github/src/haystack_integrations/prompts/github/file_editor_prompt.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/github/src/haystack_integrations/prompts/github/issue_commenter_prompt.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/github/src/haystack_integrations/prompts/github/issue_viewer_prompt.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/github/src/haystack_integrations/prompts/github/pr_creator_prompt.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/github/src/haystack_integrations/prompts/github/repo_forker_prompt.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/github/src/haystack_integrations/prompts/github/repo_viewer_prompt.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/github/src/haystack_integrations/prompts/github/system_prompt.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/google_genai/examples/chatgenerator_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/llama_cpp/examples/llama_cpp_chat_generator_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/llama_cpp/examples/rag_pipeline_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/meta_llama/examples/rag_with_llama.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/mistral/examples/indexing_pipeline.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/mistral/examples/streaming_chat_with_rag.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/mongodb_atlas/examples/embedding_retrieval.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/mongodb_atlas/examples/hybrid_retrieval.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/nvidia/examples/chat_generator_with_structured_outputs.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/opensearch/src/haystack_integrations/document_stores/opensearch/opensearch_scripts.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/optimum/example/example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/oracle/src/haystack_integrations/document_stores/oracle/__about__.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/pgvector/examples/embedding_retrieval.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/pgvector/examples/hybrid_retrieval.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/pinecone/examples/example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/presidio/src/haystack_integrations/components/common/presidio/utils.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/pyversity/examples/pipeline.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/pyversity/examples/standalone.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/ragas/example/evaluation_from_pipeline_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/ragas/example/evaluation_with_components_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/snowflake/example/text2sql_example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/stackit/examples/chat_as_component.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/stackit/examples/chat_as_pipeline.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/stackit/examples/streaming_chat_with_rag.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/valkey/examples/basic_usage.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph software Dead code candidate conf 1.00 File has no detected symbols: integrations/valkey/examples/example.py
Source file with no class/function declarations — possible config, dead code, or scratch file.
low System graph quality Integrity conf 1.00 Near-duplicate function bodies in 111 places
Functions with the same first-5-line body hash: integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:to_dict, integrations/chroma/src/haystack_integrations/components/retrievers/chroma/retriever.py:to_dict, integrations/chroma/src/haystack_integrations/components/r…
duplicatesduplication
low System graph quality Integrity conf 1.00 Near-duplicate function bodies in 14 places
Functions with the same first-5-line body hash: integrations/nvidia/src/haystack_integrations/components/embedders/nvidia/text_embedder.py:from_dict, integrations/nvidia/src/haystack_integrations/components/embedders/nvidia/document_embedder.py:from_dict, integrations/libreoffice/src/haystack_integ…
duplicatesduplication
low System graph quality Integrity conf 1.00 Near-duplicate function bodies in 15 places
Functions with the same first-5-line body hash: integrations/mistral/src/haystack_integrations/components/generators/mistral/chat/chat_generator.py:to_dict, integrations/stackit/src/haystack_integrations/components/generators/stackit/chat/chat_generator.py:to_dict, integrations/nvidia/src/haystack_…
duplicatesduplication
low System graph quality Integrity conf 1.00 8 occurrences Near-duplicate function bodies in 2 places
Functions with the same first-5-line body hash: integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:delete_documents, integrations/mongodb_atlas/src/haystack_integrations/document_stores/mongodb_atlas/document_store.py:delete_documents This is *the* AI-coder fail…
8 occurrences
repo-level (8 hits)
duplicatesduplication
low System graph quality Integrity conf 1.00 Near-duplicate function bodies in 3 places
Functions with the same first-5-line body hash: integrations/nvidia/src/haystack_integrations/components/embedders/nvidia/text_embedder.py:default_model, integrations/nvidia/src/haystack_integrations/components/embedders/nvidia/document_embedder.py:default_model, integrations/nvidia/src/haystack_in…
duplicatesduplication
low System graph quality Integrity conf 1.00 3 occurrences Near-duplicate function bodies in 4 places
Functions with the same first-5-line body hash: integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:get_metadata_field_unique_values, integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:get_metadata_field_unique_values_async, int…
3 occurrences
repo-level (3 hits)
duplicatesduplication
low System graph quality Integrity conf 1.00 2 occurrences Near-duplicate function bodies in 6 places
Functions with the same first-5-line body hash: integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:count_documents_by_filter, integrations/alloydb/src/haystack_integrations/document_stores/alloydb/document_store.py:count_documents_by_filter, integrations/pgvector…
2 occurrences
repo-level (2 hits)
duplicatesduplication
low System graph quality Integrity conf 1.00 Near-duplicate function bodies in 7 places
Functions with the same first-5-line body hash: integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:write_documents, integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:write_documents_async, integrations/arcadedb/src/haystack_in…
duplicatesduplication
low System graph quality Integrity conf 1.00 Near-duplicate function bodies in 89 places
Functions with the same first-5-line body hash: integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:from_dict, integrations/chroma/src/haystack_integrations/components/retrievers/chroma/retriever.py:from_dict, integrations/chroma/src/haystack_integrations/componen…
duplicatesduplication
low System graph quality Integrity conf 1.00 Near-duplicate function bodies in 9 places
Functions with the same first-5-line body hash: integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:delete_by_filter, integrations/alloydb/src/haystack_integrations/document_stores/alloydb/document_store.py:delete_by_filter, integrations/mongodb_atlas/src/haystack…
duplicatesduplication
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `data_copy` in integrations/mcp/src/haystack_integrations/tools/mcp/mcp_tool.py:639
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `docs_copy` in integrations/optimum/tests/test_optimum_document_embedder.py:364
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `Splade_PP_en_v1` in integrations/fastembed/src/haystack_integrations/components/embedders/fastembed/fastembed_sparse_document_embedder.py:27
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `Splade_PP_en_v1` in integrations/fastembed/src/haystack_integrations/components/embedders/fastembed/fastembed_sparse_text_embedder.py:29
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `Splade_PP_en_v1` in integrations/fastembed/tests/test_fastembed_sparse_document_embedder.py:22
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `Splade_PP_en_v1` in integrations/fastembed/tests/test_fastembed_sparse_text_embedder.py:22
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `test_invert_condition_without_operator_returns_copy` in integrations/weaviate/tests/test_filters.py:49
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `test_run_paths_deprecated` in integrations/docling/tests/test_converter.py:272
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph quality Integrity conf 1.00 Old/deprecated-named symbol `test_url_validation_not_ending_with_v1` in integrations/nvidia/tests/test_utils.py:28
Names with suffixes like `_old`, `_v1`, `_deprecated` usually indicate replaced-but-not-removed code (typical AI-coder leftover). Confirm and delete, or rename if it's the active version.
old markerDead code
low System graph software Dead code conf 1.00 Possibly dead Python function: create_hnsw_index_async
No callers detected by AST scan in this repo. Could be exported for external callers or a framework handler.
integrations/oracle/src/haystack_integrations/document_stores/oracle/document_store.py:291
low System graph software Dead code conf 1.00 Possibly dead Python function: create_keyword_index
No callers detected by AST scan in this repo. Could be exported for external callers or a framework handler.
integrations/oracle/src/haystack_integrations/document_stores/oracle/document_store.py:260
low System graph software Dead code conf 1.00 11 occurrences Possibly dead Python function: delete_by_filter_async
No callers detected by AST scan in this repo. Could be exported for external callers or a framework handler.
11 files, 11 locations
integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py:764
integrations/elasticsearch/src/haystack_integrations/document_stores/elasticsearch/document_store.py:1006
integrations/mongodb_atlas/src/haystack_integrations/document_stores/mongodb_atlas/document_store.py:739
integrations/opensearch/src/haystack_integrations/document_stores/opensearch/document_store.py:901
integrations/oracle/src/haystack_integrations/document_stores/oracle/document_store.py:612
integrations/pgvector/src/haystack_integrations/document_stores/pgvector/document_store.py:1088
integrations/pinecone/src/haystack_integrations/document_stores/pinecone/document_store.py:441
integrations/qdrant/src/haystack_integrations/document_stores/qdrant/document_store.py:560
low System graph software Dead code conf 1.00 Possibly dead Python function: invoke_tool
No callers detected by AST scan in this repo. Could be exported for external callers or a framework handler.
integrations/mcp/src/haystack_integrations/tools/mcp/mcp_toolset.py:338
low System graph software Dead code conf 1.00 Possibly dead Python function: store
No callers detected by AST scan in this repo. Could be exported for external callers or a framework handler.
integrations/mem0/src/haystack_integrations/tools/mem0/writer_tool.py:114
low System graph software Dead code conf 1.00 Possibly dead Python function: stream_callback
No callers detected by AST scan in this repo. Could be exported for external callers or a framework handler.
integrations/anthropic/example/prompt_caching.py:28
low System graph quality Complexity conf 1.00 Very large file: integrations/alloydb/src/haystack_integrations/document_stores/alloydb/document_store.py (1312 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/amazon_bedrock/tests/test_chat_generator.py (1586 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/amazon_bedrock/tests/test_chat_generator_utils.py (2141 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/amazon_bedrock/tests/test_generator.py (2037 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/anthropic/tests/test_chat_generator.py (1618 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/chroma/src/haystack_integrations/document_stores/chroma/document_store.py (1451 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/elasticsearch/src/haystack_integrations/document_stores/elasticsearch/document_store.py (2030 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/elasticsearch/tests/test_document_store.py (1380 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/google_genai/tests/test_chat_generator_utils.py (1219 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/huggingface_api/tests/test_chat_generator.py (1724 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/llama_cpp/tests/test_chat_generator.py (1466 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/mcp/src/haystack_integrations/tools/mcp/mcp_tool.py (1385 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/mistral/tests/test_mistral_chat_generator.py (1278 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/mongodb_atlas/src/haystack_integrations/document_stores/mongodb_atlas/document_store.py (1249 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/ollama/tests/test_chat_generator.py (1661 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/opensearch/src/haystack_integrations/document_stores/opensearch/document_store.py (2241 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/opensearch/tests/test_document_store.py (1811 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/pgvector/src/haystack_integrations/document_stores/pgvector/document_store.py (2038 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/qdrant/src/haystack_integrations/document_stores/qdrant/document_store.py (2589 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/valkey/src/haystack_integrations/document_stores/valkey/document_store.py (1568 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/valkey/tests/test_document_store.py (1342 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/weaviate/src/haystack_integrations/document_stores/weaviate/document_store.py (1617 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
low System graph quality Complexity conf 1.00 Very large file: integrations/weaviate/tests/test_document_store.py (1283 lines)
Files with >800 lines often hide complexity hotspots and discourage tests.
For AI agents: Voting guide (TP/FP) MCP manifest Stdio wrapper SARIF Integrate Findings queue Vote TP/FP on findings to calibrate the engine.
For AI agents + API integrations
Email me when this repo regresses
Free. We re-scan periodically; new criticals → your inbox. No signup required for the scan itself.
API access

This page is publicly accessible at: https://repobility.com/scan/45850fae-6fe7-4e53-90fd-1a07f571b585/

To check status programmatically (no auth required):

curl -s https://repobility.com/api/v1/public/scan/45850fae-6fe7-4e53-90fd-1a07f571b585/

Important — please don't re-submit the same URL repeatedly. The submission endpoint is idempotent: re-submitting the same git URL returns this same scan_token, not a new one. To re-scan this repo, sign up free and use the dashboard.