Integrate Google Sheets for Dataset Processing Status Tracking #51

Open
opened 2026-01-16 14:27:50 +00:00 by aditya · 0 comments
Member

Description:
Add optional Google Sheets integration to track dataset processing status (NOT STARTED, IN PROGRESS, DONE) and checkpoint data (last completed shard, total rows, skip counts). The system should automatically update status during processing and enable resume from sheet-stored checkpoints.

Acceptance Criteria:

  • Track processing status in Google Sheets
  • Store checkpoint data (shard number, row count, skip count) in sheet columns
  • Auto-reset to NOT STARTED with error message on failures
  • Support resume from sheet checkpoints when local checkpoint file unavailable
**Description:** Add optional Google Sheets integration to track dataset processing status (NOT STARTED, IN PROGRESS, DONE) and checkpoint data (last completed shard, total rows, skip counts). The system should automatically update status during processing and enable resume from sheet-stored checkpoints. **Acceptance Criteria:** - Track processing status in Google Sheets - Store checkpoint data (shard number, row count, skip count) in sheet columns - Auto-reset to NOT STARTED with error message on failures - Support resume from sheet checkpoints when local checkpoint file unavailable
aditya self-assigned this 2026-01-16 14:27:51 +00:00
Sign in to join this conversation.
No milestone
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
cleverdatasets/dataset-uploader#51
No description provided.