Resources for using data

Using & analyzing data

Common abbreviations in criminal legal system records

QA process for data analysis

CPE's guide on analyzing traffic stop data

RATH by Kanaries for no-code exploratory data analysis

Splunk is an intelligence analysis tool to make sense of vast amounts of data.

Collecting data

Compendium of state open records laws

ScrapingBee for headless websites

Best practices for data collection from Inspect Element

Finding undocumented APIs from Inspect Element

Automatic DocumentCloud scraping which requires a verified MuckRock account. If you don't have one, you can use ours! Reach out.

Processing data

Crosswalker is for joining columns of text data that don’t match perfectly.

CJWorkbench, a spreadsheet-like program with powerful tools for data journalism.

PDF processing and OCR from Chad Day for NICAR

Useful file types: Parquet, SQLite, FlatGeobuf from Alex Garcia

Frictionless data for transforming and describing messy datasets, making them more interoperable, and creating a pipeline

Kyle Walker's guide for using Python to do spatial mapping and analysis

Learning Python

Consuming APIs with Python from Realpython

An intro to your first notebook using Python & Jupyter from Palewire

A mini Python and Jupyter bootcamp from NICAR

Python scraping from Oxylabs

Last updated