Metadata describe information about a dataset, such that a dataset can be understood, re-used, and integrated with other datasets.

Information described in a metadata record includes where the data were collected, who is responsible for the dataset, why the dataset was created, and how the data are organized.

protocol for now - we will come back to this in a future blog post.

Between our self-describing JSON Schemas and our self-describing JSONs, we have joined up all of our implicit knowledge about this JSON instance and its JSON Schema.

At Snowplow we have been thinking a lot about how to add schemas to our data models, in place of the implicit data models and wiki-based tracker protocols that we have today.

Crucially, whatever we come up with must also work for Snowplow users, who want to be able to add schemas to their own unstructured events and custom contexts in Snowplow.

There are many uses for metadata, even beyond the simple discovery of datasets.

Metadata can be used for understanding data, analysis and synthesis, maintaining longevity of a dataset for an organization, tracking the progress of a research project, and demonstrating the return on investment for research at an institution.

A data model is a diagram (along with supporting documentation) that describes business terms (such as Order and Product) and the relationships between these terms (such as a Product may appear on many Orders).It is so important to get the data model right because the data model is the foundation for the database and development work.