📖
Police Data Access Point Docs
pdap.ioGitHub
  • 👋Welcome
  • ⚡Activities
    • Label new Data Sources
      • Labeling events
    • Volunteer for Data Requests
    • Search for Data Sources
    • Publish data
    • Web scraping
    • FOIA requests
    • Advocate for open data
  • 🔬About
    • Search the PDAP database
    • Terms & definitions
      • What is a Data Source?
      • Terminology
    • Database details
      • Data Sources data dictionary
      • Agencies data dictionary
      • Requests data dictionary
      • Record Types taxonomy
      • Hidden properties
    • GitHub
    • Hugging Face
  • 📡API
    • Introduction/Getting Started
  • 🛠️Tools & Resources
    • Related projects
    • Resources for using data
    • Using LLMs like ChatGPT
  • 🔁Meta
    • Internal Tools (Retool)
    • Internal dev resources
      • GitHub issue template
      • GitHub pull request template
      • Product changes checklist
      • ☑️Production QA Checklist
      • Retool
    • Operations
      • Staff resources
        • Meeting Minutes
          • 2021-07-14
          • 2021-06-16
          • 2021-03-14
          • 2020 11-21 Tech Stack discussion
          • 2020-09-30 Leadership Cadence
          • 2020-10-14 Leadership Cadence
          • 2020-10-21 Leadership Cadence
          • 2020-10-28 Leadership Cadence
          • 2020-11-04 Leadership Cadence
          • 2020-11-12 Leadership Cadence
          • 2020-11-18 Leadership Cadence
          • 2020-11-25 Leadership Cadence Notes
          • 2020-12-02 Leadership Cadence Notes
          • 2020-12-09 Leadership Cadence Notes
          • 2020-12-12 Working Session
          • 2020-12-16 Leadership Cadence
          • 2020-12-30 Leadership Cadence Notes
          • 2021-01-06 Leadership Cadence Notes
          • 2021-01-13 Leadership Cadence Notes
          • 2021-01-20 Leadership Cadence Notes
          • 2021-01-27 Leadership Cadence Notes
          • 2021-02-03 Leadership Cadence Notes
          • 2021-02-10 Leadership Cadence Notes
          • 2021-02-17 Leadership Cadence Notes
          • 2021-02-24 Leadership Cadence Notes
          • 2021-03-03 Leadership Cadence Notes
          • 2021-03-10 Leadership Cadence Notes
          • 2021-03-16 Leadership Cadence Notes
          • 2021-03-27 database working session
          • 2021-03-31
          • 2020-12-1
          • 2021-01-23
          • 2021-04-10 Meeting notes
          • 2021-04-17 Meeting notes
          • 2021-04-21 Leadership Cadence
          • 2021-04-28 Leadership Cadence
          • 2021-05-05 Leadership Cadence
          • 2021-05-12 Leadership Cadence
          • 2021-05-19 Leadership Cadence
          • 2021-05-26 Leadership Cadence
          • 2021-06-02 Leadership Cadence
          • Decision log
        • Brand assets
      • Legal
        • Public records access laws & precedent
        • Legal Data Scraping
        • State Computer Crimes laws
      • Policy
        • Impartiality resolution
        • PDAP Access
        • PDAP Privacy Policy
        • Password Management
        • Personally Identifiable Information
    • Community calls
      • October 17, 2023
      • February 22, 2023
      • February 1, 2023
      • January 20, 2023
      • January 5, 2023
      • October 25, 2022
      • September 22, 2022
      • August 23, 2022
      • October 2, 2021
      • September 25, 2021
      • September 11, 2021
      • September 4, 2021
      • August 7, 2021
      • July 27 Dolt Bounty retro
      • July 17, 2021
      • July 10, 2021
      • June 26, 2021
      • June 19, 2021
      • June 12, 2021
      • June 5, 2021
      • May 1, 2021
      • April 24, 2021
    • Newsletter
    • Join our Discord
Powered by GitBook
On this page
  • Required properties for submission
  • What is it?
  • Agency
  • Provenance
  • Access & format
  • Coverage & retention
  • Meta & utility

Was this helpful?

Edit on GitHub
  1. About
  2. Database details

Data Sources data dictionary

PreviousDatabase detailsNextAgencies data dictionary

Last updated 9 months ago

Was this helpful?

To see which options are available for select fields, consult the .

Required properties for submission

submitted_name, submitter_contact_info, record_type, agency_supplied (+ other "" properties, if "no")

What is it?

Property
Type
Description

name

string

Uses submitted_name if present or concatenates record_type + " for " + agency_described; can get weird when one or both are not present

submitted_name

string

Required for individual Data Source submissions for clarity.

record_type

string

tags

array

Are there any keyword descriptors which might help people find this in a search? Try to limit tags to information which can't be contained in other properties.

description

string (textarea)

Information to give clarity and confidence about what this source is, how it was processed, and whether the person reading the description might want to use it. Especially important if the source is difficult to preview or categorize.

Agency

Property
Type
Description

agency_described

array (foreignkey based on an agency's ID within Airtable)

To which criminal legal system agency or agencies does this Data Source refer?

agency_aggregation

array

If present, the Data Source describes multiple agencies. Can be an item like local or county.

state

string

2-character ISO code, related to the associated Agency object, if present.

county

string

Related to the associated Agency object, if present.

municipality

string

Related to the associated Agency object, if present.

agency_type

string

Related to the associated Agency object, if present.

jurisdiction_type

array

Related to the associated Agency object, if present. What is the highest level of jurisdiction for the agency? Can be an item like local or county.

Provenance

Where did it come from?

Property
Type
Description

agency_supplied

boolean

Is the relevant Agency also the entity supplying the data? This may be "no" if the Agency or local government contracted with a third party to publish this data, or if a third party was the original record-keeper.

supplying_entity

string

If the Agency didn't publish this, who did?

agency_originated

boolean

Is the relevant Agency also the original record-keeper? This is usually "yes", unless a third party collected data about a police Agency.

originating_entity

string

If the Agency was not the original record-keeper, who was?

Access & format

Property
Type
Description

source_url

string

A URL where these records can be found or are referenced.

readme_url

string

A URL where supplementary information about the source is published.

access_type

array

Array items can have values such as Web pageor API

record_format

array

What format(s) are the records in natively? Array items can have values such as CSV, JSON, XML, RDF, RSS, HTML table and others

detail_level

array

Is this an individual record, an aggregated set of records, or a summary without underlying data?

size

string

The file size on disk of all the data at this source, if downloaded.

data_portal_type

string

Some data is published via a standard third-party portal, typically named somewhere on the page.

access_notes

string

Is anything special required to access the data?

last_cached

date

Coverage & retention

Property
Type
Description

coverage_start

date

The earliest date covered by this source, if known, in the format YYYY-DD-MM.

coverage_end

date

The date at which updates stop, in the format YYYY-DD-MM.

source_last_updated

date

The date this source was last updated, in the format YYYY-DD-MM.

update_frequency

array

How often is this data source updated?

update_method

array

Are records replaced (Overwrite) or added (Insert)?

retention_schedule

array

How long are records kept? Are there published guidelines regarding how long important information must remain accessible for future use?

number_of_records_available

integer

How many similar pieces of information are available at this source?

Meta & utility

Property
Type
Description
Default value

scraper_url

string

The url of any web scraping efforts associated with this Data Source.

url_status

array

The status of the source_url, including options like ok , none found , broken

"ok"

approval_status

array

Set manually by the PDAP team; statuses include: approved rejected needs identification

null

data_source_created

datetime

The date this source was first created in our database.

Date of data source submission, in the format YYYY-DD-MM.

agency_described_linked_uid

string

The Airtable-generated UID of an associated Agency

airtable_uid

string

The Airtable-generated UID of this particular data source

What kind of data is accessible from this source? For more info, see the .

When was this last archived by our ? Formatted as YYYY-DD-MM.

🔬
submission form
provenance
Record Types taxonomy
automated archives app