Resources for using data
Using & analyzing data
Common abbreviations in criminal legal system records
CPE's guide on analyzing traffic stop data
RATH by Kanaries for no-code exploratory data analysis
Splunk is an intelligence analysis tool to make sense of vast amounts of data.
Collecting data
Compendium of state open records laws
ScrapingBee for headless websites
Best practices for data collection from Inspect Element
Finding undocumented APIs from Inspect Element
Automatic DocumentCloud scraping which requires a verified MuckRock account. If you don't have one, you can use ours! Reach out.
Processing data
Crosswalker is for joining columns of text data that donβt match perfectly.
CJWorkbench, a spreadsheet-like program with powerful tools for data journalism.
PDF processing and OCR from Chad Day for NICAR
Useful file types: Parquet, SQLite, FlatGeobuf from Alex Garcia
Frictionless data for transforming and describing messy datasets, making them more interoperable, and creating a pipeline
Kyle Walker's guide for using Python to do spatial mapping and analysis
Learning Python
Consuming APIs with Python from Realpython
An intro to your first notebook using Python & Jupyter from Palewire
A mini Python and Jupyter bootcamp from NICAR
Python scraping from Oxylabs
Last updated