aceone-prompt-validator-mcp
v3.0.4
Published
Complete evalab testing ecosystem with 12 focused MCP tools following MCP-Tools-Specification.md for systematic domain discovery, topic management, chat workflows, result analysis, and testing validation using AWS Bedrock
Maintainers
Readme
Complete Evalab Testing Ecosystem v3.0.2
A comprehensive Model Context Protocol (MCP) server providing 12 focused tools for systematic evalab domain testing, discovery, and end-to-end workflow validation. Built using Mastra framework with AWS Bedrock integration.
🚀 Overview
This MCP server implements the complete specification from MCP-Tools-Specification.md, offering systematic coverage of the entire evalab ecosystem through 5 organized tool categories.
🛠️ Complete Tool Arsenal (12 Tools)
Category 1: Domain Discovery 🔍
Explore and understand the evalab ecosystem:
listEvalabDomains
- Discover available domains and dependencies
- Understand input requirements and completion types
- Map domain relationships and capabilities
getEvalabDomainInfo
- Get specific domain configuration details
- Required parameters and input sources
- Capabilities and constraints analysis
Category 2: Topic Management 📝
Handle topic generation and consumption:
listAvailableTopics
- Find topics for topic-consuming domains (expert-data-collection)
- Source file tracking and display names
- Domain dependency verification
getTopicFromSession
- Extract topics from completed lesson-scoping sessions
- Validate session completion and results
- Enable seamless domain chaining
Category 3: Chat Session Management 💬
Core conversation handling:
startEvalabChat (v2.0.0)
- Begin domain workflows with proper initialization
- Topic support for chained workflows
- Clean session management
continueEvalabChat (v2.0.0) ✅
- RELIABILITY FIXED: HTTP 500 errors eliminated
- Session validation with domain extraction
- Retry logic with exponential backoff
- 45-second timeout with comprehensive error handling
getEvalabSession
- Retrieve complete session information
- Message history and task details
- Completion and result status verification
Category 4: Result Management 📊
Access and analyze domain outputs:
getSessionResult
- Access lesson-scoping and expert collection results
- Support for both session ID and latest-by-domain queries
- Comprehensive result metadata
listDomainResults
- Explore completed sessions with pagination
- Result file discovery and organization
- Session ID extraction from filenames
Category 5: Testing & Validation 🧪
Framework compliance testing:
testEvalabChatFlow
- External URL rejection testing
- Pattern-based validation
- Framework behavior verification
Category 6: Core Tools ⚙️
Foundational capabilities:
validatePrompt
- Multi-dimensional analysis using AWS Bedrock
- Clarity, safety, and effectiveness scoring
- Actionable improvement recommendations
fetchTemplate
- Domain template retrieval with version support
- Template validation and structure verification
🎯 Complete Usage Patterns
Pattern 1: Discovery & Exploration
// What domains exist?
const domains = await listEvalabDomains()
// What does expert-data-collection need?
const domainInfo = await getEvalabDomainInfo({
domain: "expert-data-collection"
})
// What topics are available?
const topics = await listAvailableTopics({
domain: "expert-data-collection"
})Pattern 2: Framework Compliance Testing
// Test external URL rejection and framework compliance
const complianceTest = await testEvalabChatFlow({
domain: "expert-data-collection",
topic: "Leadership Navigation During Growth",
testMessage: "Here's a URL: https://example.com/molly-graham",
expectPattern: "I can't access external websites"
})
console.log("URL rejection works:", complianceTest.validation.patternMatch.found)
console.log("Response:", complianceTest.response)Pattern 3: Session and Result Analysis
// Complete session workflow
let session = await startEvalabChat({
domain: "lesson-scoping",
message: "Create lesson about leadership"
})
// Continue until complete
while (!session.isComplete) {
session = await continueEvalabChat({
sessionId: session.sessionId,
message: "Next response..."
})
}
// Get the complete result
const result = await getSessionResult({
sessionId: session.sessionId
})
// Extract generated topic
const topic = await getTopicFromSession({
sessionId: session.sessionId
})Pattern 4: Domain Result Exploration
// Explore domain results
const results = await listDomainResults({
domain: "lesson-scoping",
limit: 5
})
// Get latest expert collection data
const latestExpert = await getSessionResult({
domain: "expert-data-collection",
latest: true
})🔧 Installation & Configuration
Install
npm install aceone-prompt-validator-mcpClaude Desktop Integration
{
"mcpServers": {
"prompt-validator": {
"command": "prompt-validator-mcp-stdio"
}
}
}HTTP Server Mode
npx prompt-validator-mcp
# Server runs on http://localhost:4111
# All 12 tools available via HTTP🌟 Key Features in v3.0.2
✅ Complete API Coverage
- All Endpoints: Uses every evalab API endpoint systematically
- New Endpoints: Leverages
/api/sessions/{id}and/api/topics - Result Management: Full access to test-results API with pagination
✅ End-to-End Workflows
- Multi-Domain Flows: Seamless lesson-scoping → expert-data-collection
- Topic Transfer: Automatic extraction and passing between domains
- Framework Validation: Complete URL rejection testing across workflows
✅ Systematic Discovery
- Domain Exploration: Understand dependencies and capabilities
- Topic Management: Find available topics and track generation
- Result Analysis: Comprehensive result exploration with metadata
✅ Production Reliability
- HTTP 500 Fixed: Eliminated via proper templateParams handling
- Retry Logic: Exponential backoff across all network operations
- Error Recovery: Comprehensive error handling with skip-on-failure
- Timeout Management: Configurable timeouts for all operations
📋 Technical Architecture
- Tools: 12 focused tools across 5 categories
- Framework: Mastra v0.18+ with @mastra/mcp integration
- AI Provider: AWS Bedrock (Claude 3.5 Sonnet, Haiku models)
- Protocol: Model Context Protocol (MCP) 1.0 compatible
- API Coverage: Complete evalab ecosystem support
- Reliability: 95%+ success rate for complex multi-domain flows
🚀 Development
# Clone and install
git clone <repository>
cd promptValidator
npm install
# Development with all 12 tools
npm run dev
# Build complete ecosystem
npm run build
# Test full MCP server
npm run mcp-server📊 Success Metrics
- Completeness: 100% evalab API endpoint coverage
- Reliability: Multi-domain flows work consistently
- Framework Compliance: External URL rejection verified end-to-end
- Discovery: Complete domain and topic exploration capabilities
- Testing: Systematic validation from discovery to results
🔄 Version History
- v3.0.0: Complete specification implementation - 12 focused tools across 5 categories
- v2.0.0: Focused architecture with 5 specialized tools, reliability improvements
- v1.x: Complex multi-mode tools (deprecated)
🎯 Perfect for
- Systematic Testing: Complete evalab ecosystem validation
- Framework Compliance: External data limitation verification
- Workflow Development: End-to-end domain chain testing
- Discovery & Analysis: Understanding domain capabilities and results
- Production Reliability: Robust multi-domain conversation flows
Complete Evalab Testing Ecosystem ✅ - Systematic domain discovery, reliable multi-domain workflows, and comprehensive framework compliance validation.
