📖
Police Data Access Point Docs
pdap.ioGitHub
  • 👋Welcome
  • ⚡Activities
    • Label new Data Sources
      • Labeling events
    • Volunteer for Data Requests
    • Search for Data Sources
    • Publish data
    • Web scraping
    • FOIA requests
    • Advocate for open data
  • 🔬About
    • Search the PDAP database
    • Terms & definitions
      • What is a Data Source?
      • Terminology
    • Database details
      • Data Sources data dictionary
      • Agencies data dictionary
      • Requests data dictionary
      • Record Types taxonomy
      • Hidden properties
    • GitHub
    • Hugging Face
  • 📡API
    • Introduction/Getting Started
  • 🛠️Tools & Resources
    • Related projects
    • Resources for using data
    • Using LLMs like ChatGPT
  • 🔁Meta
    • Internal Tools (Retool)
    • Internal dev resources
      • GitHub issue template
      • GitHub pull request template
      • Product changes checklist
      • ☑️Production QA Checklist
      • Retool
    • Operations
      • Staff resources
        • Meeting Minutes
          • 2021-07-14
          • 2021-06-16
          • 2021-03-14
          • 2020 11-21 Tech Stack discussion
          • 2020-09-30 Leadership Cadence
          • 2020-10-14 Leadership Cadence
          • 2020-10-21 Leadership Cadence
          • 2020-10-28 Leadership Cadence
          • 2020-11-04 Leadership Cadence
          • 2020-11-12 Leadership Cadence
          • 2020-11-18 Leadership Cadence
          • 2020-11-25 Leadership Cadence Notes
          • 2020-12-02 Leadership Cadence Notes
          • 2020-12-09 Leadership Cadence Notes
          • 2020-12-12 Working Session
          • 2020-12-16 Leadership Cadence
          • 2020-12-30 Leadership Cadence Notes
          • 2021-01-06 Leadership Cadence Notes
          • 2021-01-13 Leadership Cadence Notes
          • 2021-01-20 Leadership Cadence Notes
          • 2021-01-27 Leadership Cadence Notes
          • 2021-02-03 Leadership Cadence Notes
          • 2021-02-10 Leadership Cadence Notes
          • 2021-02-17 Leadership Cadence Notes
          • 2021-02-24 Leadership Cadence Notes
          • 2021-03-03 Leadership Cadence Notes
          • 2021-03-10 Leadership Cadence Notes
          • 2021-03-16 Leadership Cadence Notes
          • 2021-03-27 database working session
          • 2021-03-31
          • 2020-12-1
          • 2021-01-23
          • 2021-04-10 Meeting notes
          • 2021-04-17 Meeting notes
          • 2021-04-21 Leadership Cadence
          • 2021-04-28 Leadership Cadence
          • 2021-05-05 Leadership Cadence
          • 2021-05-12 Leadership Cadence
          • 2021-05-19 Leadership Cadence
          • 2021-05-26 Leadership Cadence
          • 2021-06-02 Leadership Cadence
          • Decision log
        • Brand assets
      • Legal
        • Public records access laws & precedent
        • Legal Data Scraping
        • State Computer Crimes laws
      • Policy
        • Impartiality resolution
        • PDAP Access
        • PDAP Privacy Policy
        • Password Management
        • Personally Identifiable Information
    • Community calls
      • October 17, 2023
      • February 22, 2023
      • February 1, 2023
      • January 20, 2023
      • January 5, 2023
      • October 25, 2022
      • September 22, 2022
      • August 23, 2022
      • October 2, 2021
      • September 25, 2021
      • September 11, 2021
      • September 4, 2021
      • August 7, 2021
      • July 27 Dolt Bounty retro
      • July 17, 2021
      • July 10, 2021
      • June 26, 2021
      • June 19, 2021
      • June 12, 2021
      • June 5, 2021
      • May 1, 2021
      • April 24, 2021
    • Newsletter
    • Join our Discord
Powered by GitBook
On this page
  • 44RDCXDSDDDate
  • Participants
  • Discussion topics
  • Action items

Was this helpful?

Edit on GitHub
  1. Meta
  2. Operations
  3. Staff resources
  4. Meeting Minutes

2021-04-17 Meeting notes

Previous2021-04-10 Meeting notesNext2021-04-21 Leadership Cadence

Was this helpful?

44RDCXDSDDDate

17 Apr 2021

Participants

Mitch Miller

Jeff Joskisch

Discussion topics

Item

Notes

Dolt / Databases

  • Lots of changes being made in datasets

    • agencies

  • Does Dolt support SQL COPY

  • Richard has MongoDB creds for anyone who would like to experiment

  • How slow is Dolt? Breakingly? We’re going to have ~2 million records.

  • We can host a mirror on our server and run an instance if people want the same data quicker / without the dolt UI

Podcast fame

OCR

Tensorflow has been suggested

We may need a lot of training data, which we don’t have.

  • Is there a way we could slowly start to feed this stuff to tensorflow now?

Requirements:

  • We need to be able to comma delimit things on the way in

There’s no harm in the meantime with publishing unedited PDFs

  • may inspire contributors to help with OCR

FE

There are some folks ready to work on stuff for when we have data

For now it’s pretty small and people could clone it and run it locally

Miles is converting gatsby to JSX which will make iteration easier

mongodb

We should have a template if we’re using Mongo

Docker compose file

Mitch is working on a docker compose file for dolt and mongo, which will be helpful as we get our ETL framework together

Action items

Jeff was interviewed on releasing this week (wednesday) and mentioned PDAP

to ping Mitch in slack when sql-server POC is done (nearly)

policy / rationale for PII → docs (this is a high priority)

make shitty base tables from examples of other data types

Do meeting notes in Docs next time so they can be shared

🔁
Josh Chamberlain
Eric Turner
Richard Ji
Eric Turner
Josh Chamberlain
Josh Chamberlain
Josh Chamberlain
Privacy Please