Pro / Agency License

The Dataset Builder

Google operates a completely separate search engine called Google Dataset Search, used by researchers, scientists, and journalists. The SPP Dataset builder structures your tables, CSVs, and statistical reports to ensure they get indexed in this specialized directory.

1. Spatial and Temporal Coverage

Researchers filter data by *when* it was collected, and *where* it applies. Mapping these two fields correctly is crucial for visibility.

Temporal Coverage

Define the timeframe the data covers using ISO 8601 format. For a specific year, use 2023. For a range, use slashes: 2020-01-01/2023-12-31.

Spatial Coverage

Define the geographic area the data covers (e.g., "United States" or "London").

Pro Tip: Global Data Automation

If your dataset applies to the entire world, simply type the exact word Global into the Spatial Coverage field. The SPP Engine will automatically intercept this and translate it into a mathematically perfect GeoShape box covering the entire Earth for Google's crawlers.

2. File Distributions & Variables

A Dataset schema must point to the actual data files so users can download them, and it needs to explain what the data contains.

  • Distributions (Downloads): Use the repeater tool to link directly to your raw data files. Ensure you select the correct encoding format (e.g., text/csv, application/json, or application/pdf) and provide the direct URL.
  • Measured Variables: Use the Variables repeater to define what your data actually tracks. Think of these as the column headers in your spreadsheet (e.g., "Temperature", "Population Count").

3. Authority, Citations, & Licensing

If you want researchers to use your data, you must tell them how to cite it and under what license it is provided.

Use the Authority & Catalog section to provide the exact Citation text researchers should copy and paste. You should also provide the direct URL to your License agreement (SPP defaults this to the standard Creative Commons BY 4.0 license for convenience).