Methodology & Data Sources

Data Source

[PortalName] uses data from [Source Agency] ([Dataset Name]). [1-2 sentences about why this source is authoritative — e.g., "This is the official federal dataset mandated by Congress since 1984, covering all 50 states."]

Data Vintage & Coverage

Our current data covers [Year/Range]. [Source Agency] publishes updated data [Update Frequency]. We refresh our database within [timeframe, e.g., "2 weeks"] of each new release.

  • Geographic coverage: [e.g., "All 50 US states + DC + territories"]
  • Entity count: [e.g., "13,138 school districts"]
  • Time span: [e.g., "Fiscal year 2022-2023"]

How We Process the Data

Our data pipeline transforms raw government data into searchable pages:

  1. Download raw data files from [Source Agency] ([format, e.g., "CSV bulk download"])
  2. [Processing step — e.g., "Parse and validate 232,000 employer records"]
  3. [Processing step — e.g., "Compute average salary per district from total salaries ÷ FTE count"]
  4. [Processing step — e.g., "Generate geographic groupings by state and metro area"]
  5. Load into our optimized database with full-text search indexing

No data is fabricated, interpolated, or editorially modified. All values come directly from the source dataset. Where we compute derived metrics (averages, rankings, percentages), the formula is documented on the relevant page.

Limitations

  • [Limitation — e.g., "Data may be up to 12 months behind the current date due to the annual release cycle"]
  • [Limitation — e.g., "Small districts with fewer than 3 staff may have suppressed salary data for privacy"]
  • [Limitation — e.g., "Self-reported data by employers — we cannot independently verify accuracy"]
  • This data is provided for informational purposes only.

Contact

Questions about our methodology or data? Contact us — we welcome feedback and corrections.