fix(ees-tap): filter out rows with null URN before emitting

The admissions school-level file contains some rows with null school_urn (LA/category aggregates that survive the geographic_level filter). These cause a not-null constraint violation at target-postgres. Drop any row where the URN column is null or empty before yielding records. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-27 10:13:17 +00:00
parent c7357336e3
commit 8e8d1bd8c5
1 changed files with 5 additions and 0 deletions
@@ -95,6 +95,11 @@ class EESDatasetStream(Stream):
        if "geographic_level" in df.columns:
            df = df[df["geographic_level"] == "School"]
        # Drop rows with no URN (LA/category aggregates that slip through the level filter)
        urn_col = self._urn_column
        if urn_col in df.columns:
            df = df[df[urn_col].notna() & (df[urn_col] != "")]
        self.logger.info("Emitting %d school-level rows", len(df))
        for _, row in df.iterrows():