19-rewrite-upload-all-datasets #23

Open
brent.edwards wants to merge 9 commits from 19-rewrite-upload-all-datasets into 16-error-messages-and-logging
Member

This is the current version of what I'm running to translate wikidata-truthy.

This is the current version of what I'm running to translate `wikidata-truthy`.
aditya approved these changes 2025-12-03 14:51:36 +00:00
Dismissed
khird approved these changes 2025-12-04 20:39:50 +00:00
Dismissed
khird left a comment
First-time contributor

No blocking changes, just suggestions

No blocking changes, just suggestions
@ -698,0 +671,4 @@
# Update progress
total_triples += len(chunk)
chunk_count += 1
First-time contributor

You've gone to the trouble of enumerate()ing the chunks to get the chunk_idx, which is never used, but you're tracking it manually here. This looks rather like the worst of both worlds to be honest.

You've gone to the trouble of `enumerate()`ing the chunks to get the `chunk_idx`, which is never used, but you're tracking it manually here. This looks rather like the worst of both worlds to be honest.
Author
Member

Fixed!

Fixed!
@ -608,0 +847,4 @@
hf_datasets directory.
Returns:
Whether the conversion was successful.
First-time contributor

This is slightly unusual - I'd think it more natural in Python to return None but raise an exception in the case of an unsuccessful conversion, which could contain more information that a simple boolean "there was a failure".

This is slightly unusual - I'd think it more natural in Python to return None but raise an exception in the case of an unsuccessful conversion, which could contain more information that a simple boolean "there was a failure".
Author
Member

Fixed!

Fixed!
@ -1269,2 +1074,2 @@
capture_output=False,
)
Returns:
Whether the upload was successful.
First-time contributor

Ditto the above

Ditto the above
Author
Member

Fixed!

Fixed!
Author
Member

I have updated the changes with ideas from Kyle. Thank you!

I have updated the changes with ideas from Kyle. Thank you!
khird approved these changes 2025-12-05 01:45:56 +00:00
aditya approved these changes 2025-12-05 06:03:00 +00:00
This pull request can be merged automatically.
You are not authorized to merge this pull request.
View command line instructions

Checkout

From your project repository, check out a new branch and test the changes.
git fetch -u origin 19-rewrite-upload-all-datasets:19-rewrite-upload-all-datasets
git switch 19-rewrite-upload-all-datasets

Merge

Merge the changes and update on Forgejo.

Warning: The "Autodetect manual merge" setting is not enabled for this repository, you will have to mark this pull request as manually merged afterwards.

git switch 16-error-messages-and-logging
git merge --no-ff 19-rewrite-upload-all-datasets
git switch 19-rewrite-upload-all-datasets
git rebase 16-error-messages-and-logging
git switch 16-error-messages-and-logging
git merge --ff-only 19-rewrite-upload-all-datasets
git switch 19-rewrite-upload-all-datasets
git rebase 16-error-messages-and-logging
git switch 16-error-messages-and-logging
git merge --no-ff 19-rewrite-upload-all-datasets
git switch 16-error-messages-and-logging
git merge --squash 19-rewrite-upload-all-datasets
git switch 16-error-messages-and-logging
git merge --ff-only 19-rewrite-upload-all-datasets
git switch 16-error-messages-and-logging
git merge 19-rewrite-upload-all-datasets
git push origin 16-error-messages-and-logging
Sign in to join this conversation.
No reviewers
No milestone
No project
No assignees
3 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
cleverdatasets/dataset-uploader!23
No description provided.