Links

Resources for working with data

Using & analyzing data

RATH by Kanaries for no-code exploratory data analysis

Collecting data

Best practices for data collection from Inspect Element
Finding undocumented APIs from Inspect Element
Automatic DocumentCloud scraping which requires a verified MuckRock account. If you don't have one, you can use ours! Reach out.

Processing data

Crosswalker is for joining columns of text data that don’t match perfectly.
CJWorkbench, a spreadsheet-like program with powerful tools for data journalism.
PDF processing and OCR from Chad Day for NICAR
Frictionless data for transforming and describing messy datasets, making them more interoperable, and creating a pipeline
Kyle Walker's guide for using Python to do spatial mapping and analysis

Learning Python

Consuming APIs with Python from Realpython