This document discusses building predictive models from large datasets using learning with counts and Azure Machine Learning. It provides an overview of the NYC taxi trip dataset, tools for working with large data like HDInsight and IPython notebooks, and building models with Azure ML Studio and the learning with counts algorithm. The learning with counts approach represents features as conditional counts that can be aggregated and used to train models in a scalable way for multi-entity domains. The document puts it all together by discussing how to use HDInsight for count aggregation, Azure ML for model training and deployment, and learning with counts as an intuitive large-scale machine learning solution.