Multi-source async competitive intelligence engine for AI training data ecosystems with watermark-driven incremental scanning & anomaly detection. CLI + MCP ready.