EDV uses multiple heterogeneous agents to generate diverse solution approaches, an independent verifier, and a consensus mechanism to filter out erroneous experiences before they are stored.
LLM agents can commit early to an incorrect interpretation without final answer correctness revealing this — hidden-state convergence enables early detection of this failure mode.