DevContainer setup for VS Code and PyCharm - zero-configuration dataset upload environment #59

Open
opened 2026-01-29 11:48:23 +00:00 by aditya · 0 comments
Member

Description:

Provide a preconfigured DevContainer environment that allows developers to open the project in VS Code or PyCharm and immediately start uploading datasets without manual setup. The container includes Python 3.13, all required dependencies (rdflib, huggingface-hub, pyarrow), development tools (ruff, pyright, nox), and automatically installs project dependencies on container creation. Developers can open the devcontainer and run dataset upload scripts directly without installing Python packages, configuring environment variables, or setting up credentials manually.

Acceptance Criteria:

[ ] DevContainer works in both VS Code (with Dev Containers extension) and PyCharm Professional
[ ] All Python dependencies automatically installed via postCreateCommand (project dependencies with [tests,docs] extras)
[ ] Python environment configured correctly (PYTHONPATH, interpreter paths)
[ ] User can immediately run python scripts/upload_all_datasets.py or python scripts/rdf_to_hf_incremental.py after opening container
[ ] Documentation updated with quick-start instructions for both IDEs

### **Description**: Provide a preconfigured DevContainer environment that allows developers to open the project in VS Code or PyCharm and immediately start uploading datasets without manual setup. The container includes Python 3.13, all required dependencies (rdflib, huggingface-hub, pyarrow), development tools (ruff, pyright, nox), and automatically installs project dependencies on container creation. Developers can open the devcontainer and run dataset upload scripts directly without installing Python packages, configuring environment variables, or setting up credentials manually. ### **Acceptance Criteria:** [ ] DevContainer works in both VS Code (with Dev Containers extension) and PyCharm Professional [ ] All Python dependencies automatically installed via postCreateCommand (project dependencies with [tests,docs] extras) [ ] Python environment configured correctly (PYTHONPATH, interpreter paths) [ ] User can immediately run python scripts/upload_all_datasets.py or python scripts/rdf_to_hf_incremental.py after opening container [ ] Documentation updated with quick-start instructions for both IDEs
aditya self-assigned this 2026-01-29 11:48:23 +00:00
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
cleverdatasets/dataset-uploader#59
No description provided.