feat(pipeline): add Meltano + dbt + Airflow ELT pipeline scaffold
Replaces the hand-rolled integrator with a production-grade ELT pipeline
using Meltano (Singer taps), dbt Core (medallion architecture), and
Apache Airflow (orchestration). Adds Typesense for search and PostGIS
for geospatial queries.
- 6 custom Singer taps (GIAS, EES, Ofsted, Parent View, FBIT, IDACI)
- dbt project: 12 staging, 5 intermediate, 12 mart models
- 3 Airflow DAGs (daily/monthly/annual schedules)
- Typesense sync + batch geocoding scripts
- docker-compose: add Airflow, Typesense; upgrade to PostGIS
- Portainer stack definition matching live deployment topology
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-26 08:37:53 +00:00
|
|
|
project_id: school-compare-pipeline
|
2026-03-26 14:01:31 +00:00
|
|
|
default_environment: production
|
feat(pipeline): add Meltano + dbt + Airflow ELT pipeline scaffold
Replaces the hand-rolled integrator with a production-grade ELT pipeline
using Meltano (Singer taps), dbt Core (medallion architecture), and
Apache Airflow (orchestration). Adds Typesense for search and PostGIS
for geospatial queries.
- 6 custom Singer taps (GIAS, EES, Ofsted, Parent View, FBIT, IDACI)
- dbt project: 12 staging, 5 intermediate, 12 mart models
- 3 Airflow DAGs (daily/monthly/annual schedules)
- Typesense sync + batch geocoding scripts
- docker-compose: add Airflow, Typesense; upgrade to PostGIS
- Portainer stack definition matching live deployment topology
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-26 08:37:53 +00:00
|
|
|
|
|
|
|
|
plugins:
|
|
|
|
|
extractors:
|
|
|
|
|
- name: tap-uk-gias
|
|
|
|
|
namespace: uk_gias
|
|
|
|
|
pip_url: ./plugins/extractors/tap-uk-gias
|
|
|
|
|
executable: tap-uk-gias
|
|
|
|
|
settings:
|
|
|
|
|
- name: download_url
|
|
|
|
|
kind: string
|
|
|
|
|
description: GIAS bulk CSV download URL
|
|
|
|
|
|
|
|
|
|
- name: tap-uk-ees
|
|
|
|
|
namespace: uk_ees
|
|
|
|
|
pip_url: ./plugins/extractors/tap-uk-ees
|
|
|
|
|
executable: tap-uk-ees
|
|
|
|
|
settings:
|
|
|
|
|
- name: base_url
|
|
|
|
|
kind: string
|
|
|
|
|
value: https://content.explore-education-statistics.service.gov.uk/api/v1
|
|
|
|
|
- name: datasets
|
|
|
|
|
kind: array
|
|
|
|
|
description: List of EES dataset configs to extract
|
2026-03-31 22:44:11 +01:00
|
|
|
- name: legacy_ks2_urls
|
|
|
|
|
kind: object
|
|
|
|
|
description: "Year code → URL mapping for legacy KS2 CSVs"
|
|
|
|
|
config:
|
|
|
|
|
legacy_ks2_urls:
|
|
|
|
|
"201516": "http://10.0.1.224:8081/filebrowser/api/public/dl/R9jjXFWa?inline=true"
|
|
|
|
|
"201617": "http://10.0.1.224:8081/filebrowser/api/public/dl/tIwJPVQS?inline=true"
|
|
|
|
|
"201718": "http://10.0.1.224:8081/filebrowser/api/public/dl/GO7SKE0p?inline=true"
|
|
|
|
|
"201819": "http://10.0.1.224:8081/filebrowser/api/public/dl/jchDEHsv?inline=true"
|
feat(pipeline): add Meltano + dbt + Airflow ELT pipeline scaffold
Replaces the hand-rolled integrator with a production-grade ELT pipeline
using Meltano (Singer taps), dbt Core (medallion architecture), and
Apache Airflow (orchestration). Adds Typesense for search and PostGIS
for geospatial queries.
- 6 custom Singer taps (GIAS, EES, Ofsted, Parent View, FBIT, IDACI)
- dbt project: 12 staging, 5 intermediate, 12 mart models
- 3 Airflow DAGs (daily/monthly/annual schedules)
- Typesense sync + batch geocoding scripts
- docker-compose: add Airflow, Typesense; upgrade to PostGIS
- Portainer stack definition matching live deployment topology
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-26 08:37:53 +00:00
|
|
|
|
|
|
|
|
- name: tap-uk-ofsted
|
|
|
|
|
namespace: uk_ofsted
|
|
|
|
|
pip_url: ./plugins/extractors/tap-uk-ofsted
|
|
|
|
|
executable: tap-uk-ofsted
|
|
|
|
|
settings:
|
|
|
|
|
- name: mi_url
|
|
|
|
|
kind: string
|
|
|
|
|
description: Ofsted Management Information download URL
|
|
|
|
|
|
|
|
|
|
- name: tap-uk-parent-view
|
|
|
|
|
namespace: uk_parent_view
|
|
|
|
|
pip_url: ./plugins/extractors/tap-uk-parent-view
|
|
|
|
|
executable: tap-uk-parent-view
|
|
|
|
|
|
|
|
|
|
- name: tap-uk-fbit
|
|
|
|
|
namespace: uk_fbit
|
|
|
|
|
pip_url: ./plugins/extractors/tap-uk-fbit
|
|
|
|
|
executable: tap-uk-fbit
|
|
|
|
|
settings:
|
|
|
|
|
- name: base_url
|
|
|
|
|
kind: string
|
|
|
|
|
value: https://financial-benchmarking-and-insights-tool.education.gov.uk/api
|
|
|
|
|
|
|
|
|
|
- name: tap-uk-idaci
|
|
|
|
|
namespace: uk_idaci
|
|
|
|
|
pip_url: ./plugins/extractors/tap-uk-idaci
|
|
|
|
|
executable: tap-uk-idaci
|
|
|
|
|
|
|
|
|
|
loaders:
|
|
|
|
|
- name: target-postgres
|
2026-03-26 09:18:11 +00:00
|
|
|
variant: meltanolabs
|
|
|
|
|
pip_url: meltanolabs-target-postgres
|
feat(pipeline): add Meltano + dbt + Airflow ELT pipeline scaffold
Replaces the hand-rolled integrator with a production-grade ELT pipeline
using Meltano (Singer taps), dbt Core (medallion architecture), and
Apache Airflow (orchestration). Adds Typesense for search and PostGIS
for geospatial queries.
- 6 custom Singer taps (GIAS, EES, Ofsted, Parent View, FBIT, IDACI)
- dbt project: 12 staging, 5 intermediate, 12 mart models
- 3 Airflow DAGs (daily/monthly/annual schedules)
- Typesense sync + batch geocoding scripts
- docker-compose: add Airflow, Typesense; upgrade to PostGIS
- Portainer stack definition matching live deployment topology
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-26 08:37:53 +00:00
|
|
|
config:
|
|
|
|
|
host: $PG_HOST
|
|
|
|
|
port: $PG_PORT
|
|
|
|
|
user: $PG_USER
|
|
|
|
|
password: $PG_PASSWORD
|
2026-03-26 13:53:49 +00:00
|
|
|
database: $PG_DATABASE
|
feat(pipeline): add Meltano + dbt + Airflow ELT pipeline scaffold
Replaces the hand-rolled integrator with a production-grade ELT pipeline
using Meltano (Singer taps), dbt Core (medallion architecture), and
Apache Airflow (orchestration). Adds Typesense for search and PostGIS
for geospatial queries.
- 6 custom Singer taps (GIAS, EES, Ofsted, Parent View, FBIT, IDACI)
- dbt project: 12 staging, 5 intermediate, 12 mart models
- 3 Airflow DAGs (daily/monthly/annual schedules)
- Typesense sync + batch geocoding scripts
- docker-compose: add Airflow, Typesense; upgrade to PostGIS
- Portainer stack definition matching live deployment topology
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-26 08:37:53 +00:00
|
|
|
default_target_schema: raw
|
|
|
|
|
|
|
|
|
|
utilities:
|
|
|
|
|
- name: dbt-postgres
|
|
|
|
|
variant: dbt-labs
|
2026-03-26 09:18:11 +00:00
|
|
|
pip_url: dbt-postgres~=1.10
|
feat(pipeline): add Meltano + dbt + Airflow ELT pipeline scaffold
Replaces the hand-rolled integrator with a production-grade ELT pipeline
using Meltano (Singer taps), dbt Core (medallion architecture), and
Apache Airflow (orchestration). Adds Typesense for search and PostGIS
for geospatial queries.
- 6 custom Singer taps (GIAS, EES, Ofsted, Parent View, FBIT, IDACI)
- dbt project: 12 staging, 5 intermediate, 12 mart models
- 3 Airflow DAGs (daily/monthly/annual schedules)
- Typesense sync + batch geocoding scripts
- docker-compose: add Airflow, Typesense; upgrade to PostGIS
- Portainer stack definition matching live deployment topology
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-26 08:37:53 +00:00
|
|
|
config:
|
|
|
|
|
project_dir: $MELTANO_PROJECT_ROOT/transform
|
|
|
|
|
profiles_dir: $MELTANO_PROJECT_ROOT/transform
|
|
|
|
|
|
|
|
|
|
environments:
|
|
|
|
|
- name: dev
|
|
|
|
|
config:
|
|
|
|
|
plugins:
|
|
|
|
|
loaders:
|
|
|
|
|
- name: target-postgres
|
|
|
|
|
config:
|
|
|
|
|
host: localhost
|
|
|
|
|
port: 5432
|
|
|
|
|
user: postgres
|
|
|
|
|
password: postgres
|
2026-03-26 13:53:49 +00:00
|
|
|
database: school_compare
|
feat(pipeline): add Meltano + dbt + Airflow ELT pipeline scaffold
Replaces the hand-rolled integrator with a production-grade ELT pipeline
using Meltano (Singer taps), dbt Core (medallion architecture), and
Apache Airflow (orchestration). Adds Typesense for search and PostGIS
for geospatial queries.
- 6 custom Singer taps (GIAS, EES, Ofsted, Parent View, FBIT, IDACI)
- dbt project: 12 staging, 5 intermediate, 12 mart models
- 3 Airflow DAGs (daily/monthly/annual schedules)
- Typesense sync + batch geocoding scripts
- docker-compose: add Airflow, Typesense; upgrade to PostGIS
- Portainer stack definition matching live deployment topology
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-26 08:37:53 +00:00
|
|
|
- name: production
|
|
|
|
|
config:
|
|
|
|
|
plugins:
|
|
|
|
|
loaders:
|
|
|
|
|
- name: target-postgres
|
|
|
|
|
config:
|
|
|
|
|
host: ${PG_HOST}
|
|
|
|
|
port: ${PG_PORT}
|
|
|
|
|
user: ${PG_USER}
|
|
|
|
|
password: ${PG_PASSWORD}
|
2026-03-26 13:53:49 +00:00
|
|
|
database: ${PG_DATABASE}
|