Releases: ihor-sokoliuk/mcp-searxng
v1.8.0
Added
-
Multi-instance failover and optional parallel fanout for
SEARXNG_URL:SEARXNG_URLnow accepts several semicolon-separated SearXNG replica URLs that are treated as interchangeable. In the default failover mode a search tries each instance in order until one returns results; an instance with 3 consecutive hard failures is skipped for 60 seconds, while a200 OKwith an empty result set is treated as healthy and does not trigger cooldown. Set the newSEARXNG_FANOUT=trueto instead query all healthy instances in parallel and merge results — deduplicated by canonical URL, keeping the highest-scoring copy and ordered by descending score. A single-URLSEARXNG_URLbehaves exactly as before, so no configuration change is required. (FEAT-047, #128) -
Capability discovery aggregated across all instances for filter guidance:
searxng_instance_infoand thecategories/enginessearch parameters now aggregate live/configcapabilities from every reachable configured instance instead of a single one. The tool reportscommoncategories and engines (supported on every reachable instance, so safe for consistent multi-instance results) alongside best-effortavailablevalues, keeping filter guidance accurate when replicas differ in their enabled engines. A/configendpoint that fails is skipped for about 60 seconds, or retried immediately whensearxng_instance_infois called withrefresh=true. (FEAT-048, #130)
Fixed
-
safesearchaccepted as a string enum and honoring the instance default when omitted:safesearchis now declared as a string enum ("0","1","2") so MCP clients that send every tool argument as a string — notably Gemini and Antigravity — no longer fail schema validation. The schema default was also dropped, so omittingsafesearchnow falls back to each instance's server-side default instead of forcing a value. (BUG-006, #127) -
Docker Compose HTTP transport reachable from the host: The HTTP transport in the provided
docker-composesetup now binds to0.0.0.0instead of a loopback address, so the mapped port is reachable from the host rather than only from inside the container.
Full Changelog: v1.7.2...v1.8.0
v1.7.2
Security
- Container image now runs as a non-root user (UID 1000): The published Docker image previously ran as
root, so Kubernetes deployments using therunAsNonRoot: truepod security context were rejected at admission. The image now sets a numericUSER 1000(thenodeaccount already present in thenode:lts-alpinebase), which satisfiesrunAsNonRootwithout an additionalrunAsUseroverride and reduces the container's blast radius. No configuration change is required. (Reported by @nogweii, #122)
Full Changelog: v1.7.1...v1.7.2
v1.7.1
Security
- DNS-resolved private-address SSRF in
web_url_readblocked (GHSA-mrvx-jmjw-vggc): The URL reader previously validated only the literal hostname string, so a public-looking hostname that DNS-resolves to a private, loopback, or link-local address (for example a domain pointing at127.0.0.1/10.0.0.0/8or a cloud metadata endpoint like169.254.169.254) bypassed the SSRF guard. Direct (no-proxy) reads now validate every resolved DNS answer before connecting and pin the connection to the validated address, closing the DNS-rebinding window. TheMCP_HTTP_ALLOW_PRIVATE_URLS=trueopt-out still applies. When a URL-reader proxy is configured the proxy performs DNS resolution, so those deployments must rely on egress/firewall controls (documented inSECURITY.md). - Unbounded response-body read in
web_url_readcapped (GHSA-xcqx-9jf5-w339): The page-size limit was advisory only — a server using chunked transfer encoding, a failing/absent HEAD response, or a body larger than its reportedContent-Lengthcould force the entire response into memory (denial of service). The body is now read through a bounded stream that enforcesURL_READ_MAX_CONTENT_LENGTH_BYTES(default 5 MB) against the decompressed size and stops once the cap is exceeded, before any conversion or caching.
Full Changelog: v1.7.0...v1.7.1
v1.7.0
✨ Added
- HTML-search fallback (
SEARXNG_HTML_FALLBACK=true) — opt-in compatibility mode for SearXNG instances that disable JSON output. When a search hits a403/404or a non-JSON response, it is automatically retried withoutformat=jsonand results (title, URL, snippet) are parsed from the regular HTML results page and markedsourceFormat: "html". Triggers strictly on format rejections — never on401,5xx, network, or timeout errors. Enabling JSON on a SearXNG instance you control remains the recommended setup (see the README troubleshooting section).
🔒 Security
undici→ 7.28.0 — resolves two HIGH advisories affecting 7.0.0–7.27.2: GHSA-vmh5-mc38-953g (TLS certificate validation bypass in the SOCKS5ProxyAgent) and GHSA-pr7r-676h-xcf6 (cross-user information disclosure via shared-cache whitespace bypass).form-data→ 4.0.6 — clears a CRLF-injection advisory (GHSA-hmw2-7cc7-3qxx) in the test toolchain.
Full Changelog: v1.6.0...v1.7.0
v1.6.0
This release rolls up everything since v1.4.0. Note: 1.5.0 was published to npm and Docker Hub on 2026-06-12 but never received a GitHub release — those changes are included below alongside the new 1.6.0 work.
✨ Added
enginesparameter onsearxng_web_search— a comma-separated list (e.g.google,bing,duckduckgo) routes a search to specific SearXNG engines instead of the category defaults.- Validated & normalized
categories/engines— values are trimmed and matched case-insensitively against the connected instance's live/config, and canonical names are sent to SearXNG. Unknown values are rejected up front with the available options listed, fixing silent search degradation from miscased names. - Configurable URL cache controls —
CACHE_TTL_MS(default 24 h) andCACHE_MAX_ENTRIES(default 500). - Bounded URL cache eviction — entries track hit counts and use LFU eviction with oldest-entry tie-breaking.
searxng_suggestionstool — returns search autocomplete suggestions from the instance.searxng_instance_infotool — discovers instance capabilities (engines, categories, languages, safe-search).- JSON response format —
searxng_web_searchacceptsresponse_format("text"|"json") for programmatic result processing. - Search metadata in text output — answers, spelling corrections, infoboxes, and suggestions surface alongside ranked results.
🔧 Changed
- URL cache TTL default raised from 60 s to 24 h within a running server (entries still expire/evict).
🐛 Fixed
- Metadata (answers, corrections, infoboxes) is preserved in text output even when
min_scorefilters out all web results. - Unresponsive engines are no longer listed in text output.
searxng_suggestionsandsearxng_instance_infonow route through the configured search proxy and default TLS dispatcher.
🔒 Security
- Least-privilege Docker workflow permissions —
security-events: writeis isolated to a dedicated image-scan job in both the publish and rebuild workflows, withid-token: writeconfined to the publish/sign job and workflow-level permissions kept read-only. - Patched bundled
hono— pinned the transitivehonodependency to ≥ 4.12.25 (npmoverrides) to resolve CVE-2026-54290 (CORS middleware origin reflection) in the published Docker image.
🏗️ Build / CI
- Added a CI workflow running lint plus unit and integration tests on every pull request and push to
main.
Full Changelog: v1.4.0...v1.6.0
v1.5.0
Backfilled release —
1.5.0was published to npm and Docker Hub on 2026-06-12 but the GitHub release was missed at the time.
✨ Added
searxng_suggestionstool — returns search autocomplete suggestions from the SearXNG instance.searxng_instance_infotool — discovers the connected instance's capabilities (enabled engines, supported categories, available languages, safe-search settings).- JSON response format —
searxng_web_searchaccepts aresponse_formatparameter ("text"|"json");"json"returns raw structured data for programmatic processing. - Search metadata in text output —
searxng_web_searchtext responses now include answers, spelling corrections, infoboxes, and autocomplete suggestions when the instance returns them.
🐛 Fixed
- Metadata (answers, corrections, infoboxes) is preserved in text output even when
min_scorefilters out all web results. - Unresponsive engines are no longer listed in text output.
searxng_suggestionsandsearxng_instance_inforequests route through the configured search proxy and default TLS dispatcher.
Full Changelog: v1.4.0...v1.5.0
v1.4.0
Added
-
Result count control:
num_resultsparameter onsearxng_web_search(1–20) lets callers request only as many results as they need.SEARXNG_MAX_RESULTSenv var sets an operator-level hard cap that applies even whennum_resultsis omitted — useful for reducing token spend across all callers. -
Token budget limits:
SEARXNG_MAX_RESULT_CHARSenv var truncates each search result snippet to a character limit (appending…) before returning.URL_READ_MAX_CHARSenv var sets a defaultmaxLengthfor URL reads when the caller omits it — both controls are recommended for local models with small context windows. -
HEAD preflight for URL reader: A fast HEAD request is made before every URL fetch to check
Content-Length. If the server reports a size aboveURL_READ_MAX_CONTENT_LENGTH_BYTES(default 5 MB), the download is blocked and a descriptive message withreadHeadings/sectionpagination hints is returned instead of downloading an unbounded body. -
categoriesparameter onsearxng_web_search: Routes searches to specific SearXNG categories —general,news,images,videos,it,science,files,social media. Omitting the parameter uses the SearXNG instance default (general). -
Configurable search defaults:
SEARXNG_DEFAULT_LANGUAGEandSEARXNG_DEFAULT_SAFESEARCHenv vars set operator-level defaults for language and safe-search level. Per-call parameters still take precedence. InvalidSEARXNG_DEFAULT_SAFESEARCHvalues (not0,1, or2) are logged and ignored. -
Configurable timeouts:
SEARXNG_TIMEOUT_MScontrols the search request timeout andFETCH_TIMEOUT_MScontrols the URL reader fetch timeout (both default to10000ms). -
Lite tool schemas (
SEARXNG_LITE_TOOLS=true): When set, registers minimalquery-only andurl-only tool schemas instead of the full parameter list. Reduces context overhead for local models with small context windows while still forwarding any extra arguments the caller provides.
Security
- Pinned the npm trusted publishing installer step in the publish workflow to a full commit SHA to guard against tag-swap supply-chain attacks.
Full Changelog: v1.3.4...v1.4.0
v1.3.4
Security
- Docker images are now signed with Cosign (keyless OIDC). Verify a published image with:
cosign verify docker.io/isokoliuk/mcp-searxng:latest \ --certificate-identity-regexp 'https://github.com/ihor-sokoliuk/mcp-searxng/.github/workflows/docker-publish.yml@.*' \ --certificate-oidc-issuer https://token.actions.githubusercontent.com - Expanded fuzz test coverage: search parameter handling and URL read arguments are now fuzz-tested on every CI run.
- Tightened GitHub Actions workflow permissions to least-privilege and switched to reproducible
npm ciinstalls in the publish pipeline.
Full Changelog: v1.3.3...v1.3.4
v1.3.3
Fixed
test:coveragescript now enforces the coverage threshold mechanically.- Gitignored AI process artifacts (plans, drafts) so they can never be committed.
Security
- Docker base image (
node:lts-alpine) is now pinned by digest and bumped automatically via Dependabot. - Added a weekly rebuild workflow: when upstream patches the base image, the published Docker image is rebuilt from the latest release tag, re-scanned with Trivy, and republished under the same version tags. Published images now embed the
org.opencontainers.image.base.digestOCI label for auditability.
Full Changelog: v1.3.2...v1.3.3
v1.3.2
Fixed
- Expanded
SearXNGWebresponse interface to include all fields returned by the API. - Search requests now use
AbortControllerto enforce the configured timeout and prevent hanging.
Security
- Pinned all GitHub Actions workflow steps to full commit SHAs to guard against tag-swap supply-chain attacks.
- Added CodeQL static analysis, Trivy Docker image scanning, and ClusterFuzzLite continuous fuzzing.
- Added Dependabot for automated npm and GitHub Actions dependency updates.
- Verified
mcp-publisherbinary integrity with SHA-256 checksum before use.
Full Changelog: v1.3.1...v1.3.2