# 2021-03-31

## Date <a href="#id-2021-03-31-date" id="id-2021-03-31-date"></a>

31 Mar 2021

## Participants <a href="#id-2021-03-31-participants" id="id-2021-03-31-participants"></a>

* [Former user (Deleted)](https://pdap.atlassian.net/wiki/people/5f8f95be40588b0077ed830a?ref=confluence)
* [Eddie Brown (Unlicensed)](https://pdap.atlassian.net/wiki/people/5fd63e354d2179006ecbcb80?ref=confluence)
* [Alec Akin](https://pdap.atlassian.net/wiki/people/60319bf02a42cc0069af9ac8?ref=confluence)
* [Richard Ji](https://pdap.atlassian.net/wiki/people/5f8f95be0e068b00766b6903?ref=confluence)

## Discussion topics <a href="#id-2021-03-31-discussiontopics" id="id-2021-03-31-discussiontopics"></a>

| Item                            | Notes                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
| ------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
| Dolt                            | <ul><li>works as POC</li><li>requires scraper</li><li>Richard is waiting for a bug fix from dolt and will merge his branch after that</li></ul>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
| Scrapers                        | <ul><li><p><a href="https://pdap.atlassian.net/wiki/people/5f8f95be40588b0077ed830a?ref=confluence">Former user (Deleted)</a> add “convert data to a known type or create a new type” to scraper instructions</p><ul><li>This is a good amount of work because all would need to be kept up to date</li></ul></li><li><p><a href="https://pdap.atlassian.net/wiki/people/5f8f95be40588b0077ed830a?ref=confluence">Former user (Deleted)</a>We should use <a href="https://github.com/Police-Data-Accessibility-Project/Scrapers">github</a> to store the DDL scripts and reference it in dolthub, should make a folder for them</p><ul><li>people would reference these → their columns would be kept up to date</li></ul></li><li><a href="https://pdap.atlassian.net/wiki/people/5f8f95be0e068b00766b6903?ref=confluence">Richard Ji</a> basic Python style guide for scrapers</li></ul> |
| <p>What data to collect<br></p> | <ul><li>We need a decision / definition of “what we really want”</li><li><p>The decision from last meeting is something like</p><ul><li>Accept all data that is up and out publicly</li><li><strong>Tier and sort and prioritize</strong> data based on good or consistent formatting, but <strong>accept all</strong> legal public data. Omission is not a good start to our process if our goal is “a source of truth for police data.”</li><li>Only surface PII</li><li>Classification should not be a barrier—we only <em>need</em> to classify what is</li></ul></li><li><a href="https://pdap.atlassian.net/wiki/people/5f8f95be40588b0077ed830a?ref=confluence">Former user (Deleted)</a> draft a policy → publish</li></ul>                                                                                                                                                        |
| Business / professional things  | <ul><li>Eddie working with Denice Ross on New Jersey data that’s not public. Someone needs to explain to Eddie what’s going on with New Jersey data → he can use that as more specific rationale for more investigation</li><li>Eddie ensuring our taxes are paid.</li></ul>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |

## Action items <a href="#id-2021-03-31-actionitems" id="id-2021-03-31-actionitems"></a>

* [Alec Akin](https://pdap.atlassian.net/wiki/people/60319bf02a42cc0069af9ac8?ref=confluence) to dig deeper on what blockers were for getting New Jersey data + communicate to Eddie
* [Richard Ji](https://pdap.atlassian.net/wiki/people/5f8f95be0e068b00766b6903?ref=confluence) basic python style guide for beginner scrapers
