DevContainer setup for VS Code and PyCharm - zero-configuration dataset upload environment #59
Labels
No labels
Blocked
Bounty
$100
Bounty
$1000
Bounty
$10000
Bounty
$20
Bounty
$2000
Bounty
$250
Bounty
$50
Bounty
$500
Bounty
$5000
Bounty
$750
MoSCoW
Could have
MoSCoW
Must have
MoSCoW
Should have
Needs feedback
Points
1
Points
13
Points
2
Points
21
Points
3
Points
34
Points
5
Points
55
Points
8
Points
88
Priority
Backlog
Priority
Critical
Priority
High
Priority
Low
Priority
Medium
Signed-off: Owner
Signed-off: Scrum Master
Signed-off: Tech Lead
Spike
State
Completed
State
Duplicate
State
In Progress
State
In Review
State
Paused
State
Unverified
State
Verified
State
Wont Do
Type
Bug
Type
Discussion
Type
Documentation
Type
Epic
Type
Feature
Type
Legendary
Type
Support
Type
Task
Type
Testing
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
cleverdatasets/dataset-uploader#59
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Description:
Provide a preconfigured DevContainer environment that allows developers to open the project in VS Code or PyCharm and immediately start uploading datasets without manual setup. The container includes Python 3.13, all required dependencies (rdflib, huggingface-hub, pyarrow), development tools (ruff, pyright, nox), and automatically installs project dependencies on container creation. Developers can open the devcontainer and run dataset upload scripts directly without installing Python packages, configuring environment variables, or setting up credentials manually.
Acceptance Criteria:
[ ] DevContainer works in both VS Code (with Dev Containers extension) and PyCharm Professional
[ ] All Python dependencies automatically installed via postCreateCommand (project dependencies with [tests,docs] extras)
[ ] Python environment configured correctly (PYTHONPATH, interpreter paths)
[ ] User can immediately run python scripts/upload_all_datasets.py or python scripts/rdf_to_hf_incremental.py after opening container
[ ] Documentation updated with quick-start instructions for both IDEs