Label new Data Sources

Growing our Data Sources Database

Why we're here

Labeling potential Data Sources helps us make a more complete database! Our source collectors crawl the internet using a variety of strategies, and volunteers catalogue the sources they find. One day, we may also be able to train machine learning models to identify useful data.

For more background and a technical perspective, check out the Data Source Identification repo.

How to start

Reach out to [email protected] or use this volunteer form with "data labeling" selected. We'll give you instructions for accessing the labeling interface!

Content warning

This content contains annotations related to the criminal legal system, which may include discussions of topics such as crime, legal proceedings, incarceration, and violence. Additionally, part of our work is labeling websites which may appear to be related but do not pertain to the criminal legal system, and might contain anything at all. It's ok to take a break, or do something else.

Last updated

Was this helpful?