This document discusses evaluating measures for diversified search results. It introduces several existing measures for ad-hoc retrieval and diversified retrieval, and proposes some new measures. It describes using data from past NTCIR evaluations involving diversity search to compare these measures offline. The goal is to determine which measures best align with users' preferences for search result pages by collecting users' direct feedback on sample search results.