This is the technical documentation of our project with RSIM of TU BERLIN.

Setup¶

First install the conda environment at cluster_instructions/conda_env.yml and activate it.

Data Set (Deep Globe Patches)¶

Then download the Deepglobe dataset from KAGGLE. Place it in the project with the path data/deepglobe. Then run the patch-sampling pipeline: data_pipeline/deepglobe/patch_sampling.py. This creates LMDB files for the train, test and valid set at data/deepglobe_patches/[train/test/valid]/.

Word Embeddings¶

The folder data/glove contains the embeddings for the deepglobe-labels for embeddings spaces of 50 and 300. If other embeddings are necessary, or a different dataset is used. These can be created by modifying the src/wordembedding/glove.py and running it. To use this downlaod the glove txt files from Stanford. Currently d = [50,100,200,300] are available there. For different embedding size, retrain the glove model.

Run Example¶

The most important parameters are contained in this exemplary run. For all parameters check out src/config_args.py, for our parameterized runs check out cluster_instructions/<model>_<loss>.sh

python main.py -model CbMLC -loss weighted_bce -optim sgd -d_model 50-lr 0.0001 -add_noise 0.1 -sub_noise 0.1