Common Errors
Note: In examples below, replace
<project-name>with your actual project name. The$(whoami)part auto-substitutes your username.
File Issues
Can't Find My Files
Symptoms:
- Files missing from container
- Workspace appears empty
Solutions:
-
Check both locations:
# On hostls ~/workspace/<project-name>/# In containerdocker exec <project-name>._.$(whoami) ls /workspace/ -
Remember the mapping:
Host: ~/workspace/<project-name>/Container: /workspace/ -
Verify workspace mount:
docker inspect <project-name>._.$(whoami) | grep -A 5 "Mounts"
Permission Denied on Files
Symptoms:
$ touch /workspace/file.txt
Permission denied
Solutions:
-
Check ownership:
ls -ld ~/workspace/<project-name>/# Should be owned by you -
Fix permissions (on host):
sudo chown -R $(whoami):$(whoami) ~/workspace/<project-name>/ -
Check disk space:
df -h | grep home
Out of Disk Space
Symptoms:
No space left on device
Solutions:
Best to notify DSL admin by raising an issue ticket in ds01-hub repo. Most user permissions are restricted so you will not be able to do a full clean of docker/disk except from those files related to you (limited).
-
Check usage:
# Workspacedu -sh ~/workspace/*# Dockerdocker system df -
Clean up:
# Remove old projectsrm -rf ~/workspace/old-project/# Clean Dockerdocker image prunedocker system prune# Remove old checkpointsfind ~/workspace -name "checkpoint-*.pt" -mtime +30 -delete
Permission Issues
Docker Permission Denied
Symptoms:
$ docker ps
Permission denied while trying to connect to the Docker daemon socket
Cause: Not in docker group
Solution:
# Check groups
groups | grep docker
If not in docker group, ask DSL admin to add you
Commands Not Found
Symptoms:
$ container-deploy my-project
bash: container-deploy: command not found
Solutions:
-
Check PATH:
echo $PATH | grep ds01 -
Use full path:
/opt/ds01-infra/scripts/user/orchestrators/container-deploy my-project -
Fix PATH:
shell-setupsource ~/.bashrc
Network Issues
Can't Access Jupyter
Symptoms:
- Jupyter running but can't access in browser
Solutions:
-
Check Jupyter is running:
docker exec <project-name>._.$(whoami) ps aux | grep jupyter -
Check port:
docker port <project-name>._.$(whoami) -
Set up SSH tunnel:
# On your laptopssh -L 8888:localhost:8888 <user-id>@ds01# Then access: http://localhost:8888 -
Start Jupyter correctly:
jupyter lab --ip=0.0.0.0 --port=8888 --no-browser
Git Issues
Can't Push to GitHub
Symptoms:
$ git push
Permission denied (publickey)
Solutions:
-
Check SSH key:
ls ~/.ssh/cat ~/.ssh/id_ed25519.pub -
Add key to GitHub:
- Copy public key
- GitHub → Settings → SSH Keys → Add
-
Test connection:
ssh -T git@github.com -
Use HTTPS instead:
git remote set-url origin https://github.com/user/repo.git
Resource Limits
Memory Limit Exceeded
Symptoms:
- Container killed
- OOMKilled in logs
Solutions:
-
Check limits:
check-limits -
Reduce memory usage:
- Process data in chunks
- Use data generators
- Clear variables when done
Error Message Reference
| Error | Meaning | Solution |
|---|---|---|
No GPUs available | All GPUs allocated | Wait or retire old containers |
OOMKilled | Out of memory | Reduce memory usage |
Permission denied | Not in docker group or file permissions | Check groups, fix permissions |
Container not found | Container removed or wrong name | Recreate or check name |
Image not found | Image doesn't exist | Build image first |
Network unreachable | Network issue | Check network, retry |
Quota exceeded | Hit disk quota | Clean up old files |