Hadoop at LinkedIn has grown significantly over time, from 1 cluster with 20 nodes in 2008 to over 10 clusters with over 10,000 nodes now. The number of users and workflows has also increased dramatically. While hardware scaling is difficult, scaling human infrastructure and managing dependencies between data producers, consumers, and infrastructure providers is even harder. The Dali system aims to abstract away physical data details and make data easier to access and manage through a dataset API, views, and lineage tracking. Views allow decoupling data APIs from the underlying datasets and enable safe evolution of these APIs through versioning. Contracts expressed as logical constraints on views provide clear, understandable, and modifiable agreements between producers and consumers. This approach has helped large projects