Convert code to a proper Python app. #5

Open
opened 2025-11-19 23:00:33 +00:00 by brent.edwards · 0 comments
Member

We cannot estimate how much time this will take until Aditya, Jeff, and I agree on what needs to be done.

Here are some ideas from Brent:

Medium priority:

  • Unify all "progress" scripts into one file.

  • Make adding new data sources easier.

Low priority:

  • Rework as true streaming: instead of uncompressing from streamable compression methods (like bz2) into a complete file, stream directly from the bz2 file.

  • Find out whether converting fb15k237 CAN be parallelized.

  • Find out whether converting nell-995 CAN be parallelized.

  • Find out whether converting conceptnet CAN be parallelized.

  • Integrate geonames with the rest of the software.

We cannot estimate how much time this will take until Aditya, Jeff, and I agree on what needs to be done. Here are some ideas from Brent: Medium priority: - Unify all "progress" scripts into one file. - Make adding new data sources easier. Low priority: - Rework as true streaming: instead of uncompressing from streamable compression methods (like bz2) into a complete file, stream directly from the bz2 file. - Find out whether converting fb15k237 CAN be parallelized. - Find out whether converting nell-995 CAN be parallelized. - Find out whether converting conceptnet CAN be parallelized. - Integrate `geonames` with the rest of the software.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Depends on
You do not have permission to read 1 dependency
Reference
cleverdatasets/dataset-uploader#5
No description provided.