Microsoft Research Blog

BenchmarkQED: Automated benchmarking of RAG systems

BenchmarkQED is an open-source toolkit for benchmarking RAG systems using automated query generation, evaluation, and dataset prep. It shows that LazyGraphRAG outperforms standard methods, especially on complex, global queries.

Recent Posts

  1. Research Focus: May 07, 2025

    Research Focus: Week of May 7, 2025 

    May 7, 2025

    In this issue: New research on compound AI systems and causal verification of the Confidential Consortium Framework; release of Phi-4-reasoning; enriching tabular data with semantic structure, and more.

  2. Research Focus: April 23, 2025

    Research Focus: Week of April 21, 2025 

    April 23, 2025

    In this issue: our CHI 2025 & ICLR 2025 contributions, plus research on causal reasoning & LLMs; countering LLM jailbreak attacks; and how people use AI vs. AI-alone. Also, SVP of Microsoft Health Jim Weinstein talks rural healthcare innovation.

  3. Research Focus: April 09, 2025

    Research Focus: Week of April 7, 2025 

    April 9, 2025

    In this issue: We introduce a new dataset designed to assist renewable energy infrastructure planners, a new method for denoising MRI imagery, and an AI tool for analyzing distant galaxies. Check out our latest research and other updates. 

Explore More

Events & conferences

Events & conferences 

Meet our community of researchers, learn about exciting research topics, and grow your network

Podcasts

Podcasts 

Ongoing conversations at the cutting edge of research

Microsoft Research Forum

Microsoft Research Forum 

Join us for a continuous exchange of ideas about research in the era of general AI