📖
Police Data Access Point Docs
pdap.ioGitHub
  • 👋Welcome
  • ⚡Activities
    • Label new Data Sources
      • Labeling events
    • Volunteer for Data Requests
    • Search for Data Sources
    • Publish data
    • Web scraping
    • FOIA requests
    • Advocate for open data
  • 🔬About
    • Search the PDAP database
    • Terms & definitions
      • What is a Data Source?
      • Terminology
    • Database details
      • Data Sources data dictionary
      • Agencies data dictionary
      • Requests data dictionary
      • Record Types taxonomy
      • Hidden properties
    • GitHub
    • Hugging Face
  • 📡API
    • Introduction/Getting Started
  • 🛠️Tools & Resources
    • Related projects
    • Resources for using data
    • Using LLMs like ChatGPT
  • 🔁Meta
    • Internal Tools (Retool)
    • Internal dev resources
      • GitHub issue template
      • GitHub pull request template
      • Product changes checklist
      • ☑️Production QA Checklist
      • Retool
    • Operations
      • Staff resources
        • Meeting Minutes
          • 2021-07-14
          • 2021-06-16
          • 2021-03-14
          • 2020 11-21 Tech Stack discussion
          • 2020-09-30 Leadership Cadence
          • 2020-10-14 Leadership Cadence
          • 2020-10-21 Leadership Cadence
          • 2020-10-28 Leadership Cadence
          • 2020-11-04 Leadership Cadence
          • 2020-11-12 Leadership Cadence
          • 2020-11-18 Leadership Cadence
          • 2020-11-25 Leadership Cadence Notes
          • 2020-12-02 Leadership Cadence Notes
          • 2020-12-09 Leadership Cadence Notes
          • 2020-12-12 Working Session
          • 2020-12-16 Leadership Cadence
          • 2020-12-30 Leadership Cadence Notes
          • 2021-01-06 Leadership Cadence Notes
          • 2021-01-13 Leadership Cadence Notes
          • 2021-01-20 Leadership Cadence Notes
          • 2021-01-27 Leadership Cadence Notes
          • 2021-02-03 Leadership Cadence Notes
          • 2021-02-10 Leadership Cadence Notes
          • 2021-02-17 Leadership Cadence Notes
          • 2021-02-24 Leadership Cadence Notes
          • 2021-03-03 Leadership Cadence Notes
          • 2021-03-10 Leadership Cadence Notes
          • 2021-03-16 Leadership Cadence Notes
          • 2021-03-27 database working session
          • 2021-03-31
          • 2020-12-1
          • 2021-01-23
          • 2021-04-10 Meeting notes
          • 2021-04-17 Meeting notes
          • 2021-04-21 Leadership Cadence
          • 2021-04-28 Leadership Cadence
          • 2021-05-05 Leadership Cadence
          • 2021-05-12 Leadership Cadence
          • 2021-05-19 Leadership Cadence
          • 2021-05-26 Leadership Cadence
          • 2021-06-02 Leadership Cadence
          • Decision log
        • Brand assets
      • Legal
        • Public records access laws & precedent
        • Legal Data Scraping
        • State Computer Crimes laws
      • Policy
        • Impartiality resolution
        • PDAP Access
        • PDAP Privacy Policy
        • Password Management
        • Personally Identifiable Information
    • Community calls
      • October 17, 2023
      • February 22, 2023
      • February 1, 2023
      • January 20, 2023
      • January 5, 2023
      • October 25, 2022
      • September 22, 2022
      • August 23, 2022
      • October 2, 2021
      • September 25, 2021
      • September 11, 2021
      • September 4, 2021
      • August 7, 2021
      • July 27 Dolt Bounty retro
      • July 17, 2021
      • July 10, 2021
      • June 26, 2021
      • June 19, 2021
      • June 12, 2021
      • June 5, 2021
      • May 1, 2021
      • April 24, 2021
    • Newsletter
    • Join our Discord
Powered by GitBook
On this page
  • Attendees
  • Topics

Was this helpful?

Edit on GitHub
  1. Meta
  2. Community calls

May 1, 2021

Attendees

  • Josh

  • Mitch

  • Jeff

  • Eddie

Topics

Topic

Notes

Downloadable scrapers package

Chain of custody

  • Where

  • When

  • Who

  • With what scraper code

What granularity do we want for audit history? Cellwise

We need auth—anything else is too easy to spoof

Data collision

Treat each timestamp as its own piece of data

Datasets

Maintain source info / keep it up to date

Auth

  • Keybase (infra)

  • Django built-in

  • GitHub for scraping (pub/privkey)

  • Medium-term SSO

Incentives

  • Dolt bounties

  • Paid scrapers—pay for a quick project (fiverr, freelancer)

    • Someone paid has ~guaranteed expertise. Collect feedback from professional consumers

$

We're covering a few licenses, but need more donations. Considering Patreon

Support

  1. volunteer time

  2. volunteer money

  3. introduce us

  4. use the data

Scraper utilities

Data centers

We have NY and SF data centers in DigitalOcean, but they don't talk to each other

Security

  1. perimeter

  2. secrets manager

Who can make data PR approvals?

Front end

PreviousJune 5, 2021NextApril 24, 2021

Last updated 4 years ago

Was this helpful?

Proof of concept:

, ACLU,

🔁
https://github.com/EricTurner3/pdap-intake-ui/tree/overhaul
EFF
NFOIC
https://pdap.ericturner.it/datamap/
https://pdap.ericturner.it/datamap/schema
https://pdap.atlassian.net/browse/PDAP-162
https://github.com/Police-Data-Accessibility-Project/gatsby-pdap-frontpage/pull/17