This document summarizes a study that examined how people comprehend illustrations showing physical actions from different perspectives. The study presented participants with images of a man holding or swinging a bat from different angles and heights. Participants had to identify which overhead image matched each image. Results showed that images with canonical views (e.g. 1/3 side views) had somewhat higher accuracy than non-canonical views, but accuracy was still high for non-canonical views with more practice. The study suggests that with more time, people can perform mental rotation tasks to comprehend images from different perspectives.