Search…
Submit or update Datasets

Dolt SQL Editor

You can use SQL statements to add new datasets right in Dolt. It's possible to generate these Insert statements from spreadsheets or write them manually.
Note: CSV uploads via the DoltHub UI are not working well for this process. Use at your own risk.

INSERT statement generator table

    1.
    Make a copy of the Dataset Submission Template and populate information about new datasets as you work.
    2.
    Use the Dataset Properties guide to find the appropriate IDs.
    3.
    Navigate to the Query table and note that each row generates a new SQL query.
    4.
    Fork the DoltHub Datasets repo. If you don't yet have an account, sign up here.
    5.
    Paste the queries individually into your fork's SQL console and run them.
    6.
    When you're done, create a new Pull Request against our repository by selecting your fork as the from repository.
    7.
    Join Dolt's Discord and ask in the #data-bounties channel for someone to approve it.

CLI

If you can improve these docs, click the link in the upper right to Edit on GitHub.
    2.
    Fork the project to your DoltHub account (you will need to create an account if you don't have one).
    3.
    Clone down your copy of the repository.
    1
    dolt clone <your account name>/datasets && cd datasets
    2
    dolt table export datasets > datasets.csv
    Copied!
    4.
    Open the datasets.csv file, make changes, and save. Leave id blank—UUIDs are generated automatically. Make sure you're not adding a URL that already exists.
    1
    dolt branch <branch name e.g. add-CA-counties>
    2
    dolt checkout <your branch>
    3
    dolt table import -u datasets datasets.csv
    Copied!
    5.
    Run this command to add your csv to your checked out branch.
    1
    dolt add .
    Copied!
    6.
    Push the commit.
    1
    dolt commit -m “<message e.g. added 5 Alameda County datasets>”
    2
    dolt push --set-upstream origin <your branch>
    Copied!
    8.
    Create a new Pull Request to merge your dataset into master. Select your fork as the from repository, and then your branch will appear as an option. Read more about DoltHub Pull Requests here.
Last modified 4mo ago