Searchβ¦
β¬
back to pdap.io
π
Welcome to PDAP!
β‘
Activities
Our Process
Data Sources
Data Extractions
Scrapers repo
FOIA Requests
π
Tools & Resources
Internal tools
DoltHub
Splunk
Related Projects
Third-Party Resources
π£
Updates
Blog
Working sessions
π
Meta
About
Policy
Legal
β
Elsewhere
Discord
GitHub org
DoltHub org
Patreon
Powered By
GitBook
Data Extractions
Running Scrapers and using the generated Extractions.
Current status
We're still in the iteration and case study phase. To submit data:
1.
Run a Scraper you wrote, or an approved Scraper from the
Scrapers Repo
, to get an Extraction.
2.
Share your Extraction in Discord. We'll work with you to write and publish a case study.
3.
We'll all learn from the experience, and brainstorm ways our tools could better facilitate your work.
4.
Repeat!
Requirements for published Extractions
We always need to know
when
Extractions were collected.
We need to know
which code
was used to Scrape the Extraction.
We need to know from
which Data Source
the data was Extracted.
Since we have git version control on our scrapers, we can audit any Extraction to understand how it was gathered.
Previous
Validate Data Sources PRs
Next
FOIA Requests
Last modified
10d ago
Copy link
Edit on GitHub
Outline
Current status
Requirements for published Extractions