Mind2Web 2: A new era of “agent-based” web search
🧠 Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Agentic Search is one of the most promising applications of modern AI. Imagine a virtual assistant that doesn’t just look up information for you but can autonomously search the web, navigate pages, find facts, and return well-structured answers with citations. That’s the idea behind tools like OpenAI’s Deep Research. However, how do we evaluate if such an AI is doing a good job? ...