Skip to content

[WIP] Add Docker environment and testing the scripts #4

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 10 commits into
base: main
Choose a base branch
from

Conversation

Rub21
Copy link

@Rub21 Rub21 commented Jun 14, 2023

Adding docker environment in order to run scripts and notebooks using the right version of the dependencies.

cc. @srmsoumya @batpad @yellowcap

Copy link
Member

@yellowcap yellowcap left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice! 🐳

@Rub21 Rub21 changed the title Add Docker environment [WIP] Add Docker environment and testing the scripts Jun 19, 2023
@@ -92,35 +94,35 @@
np.savez_compressed(wd / "cubesxy" / f"{geojson.stem}.npz", X=cdata.astype("uint16"), attrs=data.imagery.attrs)
continue
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@srmsoumya Looks like the code after the continue wont run anymore.

environment.yml Outdated
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Rub21 I think we should clean up the environment.yml file to packages we are using in the repository.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@srmsoumya yes, it would be good, I set that because i did not want to miss any of the libraries that you used for training the module.

@yellowcap
Copy link
Member

yellowcap commented Jul 3, 2023

I rebased this PR from main after changing some file names. Hope I did not break anything. @Rub21 and @srmsoumya you might have to reset your local branches after this (hope I did not create too much confusion with that. Otherwise feel free to undo my rebase)

@Rub21
Copy link
Author

Rub21 commented Jul 7, 2023

@srmsoumya @yellowcap, I am in the process of recreating the LULC module training, but I'm encountering issues with recreating the same data generations that you've successfully uploaded to s3.

Firstly, I've created a Docker file that installs all the modules Soumya used for module training. The file can be found here: https://github.com/developmentseed/time-series-for-lulc/blob/env/docker/environment.yml

Secondly, I've executed the data generation script sequentially, following Soumya's guidance.

To verify the consistency of my results with Soumya's, I implemented an MD5 comparison script. However, my results aren't aligning with Soumya's data from S3. In fact, many of the files do not even exist. This discrepancy has led me to question whether you've used all the training geojson files from s3://ds-data-projects/reforestamos/reforestamos_sentinel/geojson/, or if these were specific to a particular region only.

I'd appreciate any insights you could provide to help resolve these issues..

In case it is required , I am uploading the diles to this folder:

s3://ds-labs-lulc/rub21/stacks/
s3://ds-labs-lulc/rub21/cubesxy/

Also the Instruction are here: https://github.com/developmentseed/time-series-for-lulc/blob/env/docker/start.sh

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants