githubEdit

Resources for using data

Using & analyzing data

Common abbreviations in criminal legal system recordsarrow-up-right

QA process for data analysisarrow-up-right

CPE's guide on analyzing traffic stop dataarrow-up-right

RATHarrow-up-right by Kanaries for no-code exploratory data analysis

Splunkarrow-up-right is an intelligence analysis tool to make sense of vast amounts of data.

Collecting data

Compendium of state open records lawsarrow-up-right

ScrapingBee for headless websitesarrow-up-right

Best practices for data collectionarrow-up-right from Inspect Element

Finding undocumented APIsarrow-up-right from Inspect Element

Automatic DocumentCloud scrapingarrow-up-right which requires a verified MuckRock account. If you don't have one, you can use ours! Reach out.

The Berkeley COPWATCH "People's Database" guidearrow-up-right for creating a community accountability tool

Processing data

Crosswalkerarrow-up-right is for joining columns of text data that don’t match perfectly.

CJWorkbencharrow-up-right, a spreadsheet-like program with powerful tools for data journalism.

PDF processing and OCRarrow-up-right from Chad Day for NICAR

Useful file types: Parquet, SQLite, FlatGeobufarrow-up-right from Alex Garcia

Frictionless dataarrow-up-right for transforming and describing messy datasets, making them more interoperable, and creating a pipeline

Kyle Walker's guidearrow-up-right for using Python to do spatial mapping and analysis

Learning Python

Consuming APIs with Pythonarrow-up-right from Realpython

An intro to your first notebook using Python & Jupyterarrow-up-right from Palewire

A mini Python and Jupyter bootcamparrow-up-right from NICAR

Python scrapingarrow-up-right from Oxylabs

Last updated

Was this helpful?