Enhance survey data processing to clean and link participant organization records #2

Closed
opened 2025-05-05 22:51:07 -04:00 by nick · 0 comments
Owner

Description:

We need to improve the processing of the 25 Million Trees Initiative survey data to:

  • Exclude responses with missing organization information
  • Replace entries labeled as "Other" with the provided custom text
  • Standardize organization names by replacing underscores with spaces
  • Link survey records with associated participant organization data via GlobalID

This enhancement will ensure cleaner, more usable data for analysis and reporting.

Acceptance Criteria:

  • Only survey records with valid participant orgs (explicit or in "Other") are kept
  • Underscores in organization names are replaced by spaces
  • Data is joined correctly with participant_organizations
  • Resulting dataset is ready for analysis or export
### Description: We need to improve the processing of the 25 Million Trees Initiative survey data to: - Exclude responses with missing organization information - Replace entries labeled as "Other" with the provided custom text - Standardize organization names by replacing underscores with spaces - Link survey records with associated participant organization data via GlobalID This enhancement will ensure cleaner, more usable data for analysis and reporting. ### Acceptance Criteria: - Only survey records with valid participant orgs (explicit or in "Other") are kept - Underscores in organization names are replaced by spaces - Data is joined correctly with participant_organizations - Resulting dataset is ready for analysis or export
nick added the
enhancement
label 2025-05-05 22:51:07 -04:00
nick self-assigned this 2025-05-05 22:51:15 -04:00
nick added reference enhancement/clean-survey-participant-orgs 2025-05-05 22:52:23 -04:00
nick closed this issue 2025-05-05 22:53:51 -04:00
Sign in to join this conversation.
No Milestone
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: nick/tree-tracker-report#2
No description provided.