* explain that the corpus is stored in GCS * explain that google-account should be added to the corpus owners * can we do it automatically? * explain how to install gsutil * explain how to download files.