What’s New in Higress 2.0? 30 Updates Including RAG MCP Server and Performance Fixes
The Higress 2.0 release introduces 30 changes—13 new features such as a RAG MCP server and ECDS‑based configuration refactor, 7 bug fixes, 5 refactorings, documentation updates and a test improvement—providing developers with enhanced knowledge‑management capabilities, more stable routing, and clearer documentation for cloud‑native service‑mesh environments.
Release Overview
This release bundles 30 updates (including 2 critical ones) for the Higress cloud‑native gateway, covering new functionality, bug fixes, refactoring, documentation, and testing improvements.
Key New Features
1. RAG MCP Server
Background : Modern applications need efficient knowledge‑management and retrieval. The RAG (Retrieval‑Augmented Generation) approach combines vector search with LLM generation. This PR adds a Model Context Protocol (MCP) server that provides knowledge‑block creation, block management, keyword search, and chat capabilities.
Core Functions :
Knowledge management – create knowledge blocks from text.
Block management – list and delete blocks.
Search – keyword‑based retrieval.
Chat – send messages and receive LLM responses.
The implementation uses external libraries such as github.com/dlclark/regexp2, github.com/milvus-io/milvus-sdk-go/v2, and github.com/pkoukk/tiktoken-go for regex, vector DB, and token encoding.
Configuration Steps :
Enable the MCP server in higress-config and set the path and options.
Configure RAG parameters (splitter type, chunk size, overlap, top_k, threshold).
Set LLM provider credentials (API key, model name).
Set embedding provider credentials.
Configure the vector‑database connection.
rag:
splitter:
type: "recursive"
chunk_size: 500
chunk_overlap: 50
top_k: 5
threshold: 0.5
llm:
provider: "openai"
api_key: "your-llm-api-key"
model: "gpt-3.5-turbo"
embedding:
provider: "openai"
api_key: "your-embedding-api-key"
model: "text-embedding-ada-002"
vectordb:
provider: "milvus"
host: "localhost"
port: 19530
collection: "test_collection"Value : Provides a complete knowledge‑management pipeline, improving efficiency and accuracy for large‑scale text processing.
2. ECDS Configuration Refactor
Problem : Directly embedding Golang filter configuration in HTTP_FILTER caused map ordering issues and connection‑drain when the configuration changed.
Solution : Split the configuration into two parts— HTTP_FILTER now only references a config_discovery filter, while the actual Golang filter settings are moved to EXTENSION_CONFIG. The refactor introduces constructMcpSessionStruct and constructMcpServerStruct to produce an EXTENSION_CONFIG -compatible structure and updates unit tests accordingly.
Benefit : Eliminates connection‑drain during config updates, improves stability, and simplifies future configuration changes.
Bug Fixes
Fix Authorization header overwrite issue (#2904) – ensures the original Authorization header is preserved.
Support AttackLevel field in MultiModalGuard (#2938) – restores security checks for high‑risk prompts.
Correct JSON unmarshalling for Claude API array‑content responses (#2898) – prevents parsing errors.
Resolve SSE chunk handling for Claude streaming (#2889) – prevents connection blockage.
Fix HTTP request handling that could drop the original Authorization header (#2904).
Refactoring and Optimizations
Removed duplicate think tags in Bedrock and Vertex plugins (#2933) – reduces code redundancy.
Optimized host matching by replacing regex with simple string operations and port stripping (#2890) – improves performance.
Updated matchDomain to use HostMatcher structure for faster matching.
General code cleanup and removal of unused fields in MCP server implementation.
Documentation Updates
Fixed broken link in README_JP.md and added missing sections in README.md (#2915).
Improved English and Chinese docs for hmac-auth-apisix plugin (#2912).
Corrected syntax errors across README files (#2880).
Added detailed usage instructions for the RAG MCP server and ECDS refactor.
Testing Improvements
Extended test coverage for ai-security-guard component, adding new cases and adjusting existing logic (#2928).
Release Statistics
🚀 New Features: 13 items
🐛 Bug Fixes: 7 items
♻️ Refactoring/Optimizations: 5 items
📚 Documentation Updates: 4 items
🧪 Test Improvements: 1 item
Overall, this release enhances Higress’s knowledge‑management capabilities, stabilizes configuration handling, and provides clearer documentation, delivering a more flexible and reliable cloud‑native gateway experience.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
Alibaba Cloud Native
We publish cloud-native tech news, curate in-depth content, host regular events and live streams, and share Alibaba product and user case studies. Join us to explore and share the cloud-native insights you need.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
