ppai/weather-forecasting: add video link + fix links (#9023)

davidcavazos · web-flow · commit c0a7a8f8039a · 2023-01-20T18:24:10.000Z
## Description * Add links to video * Fixes relative paths to source files * Fixes broken links * Add titles to notebooks ## Checklist - [ ] I have followed [Sample Guidelines from AUTHORING_GUIDE.MD](https://github.com/GoogleCloudPlatform/python-docs-samples/blob/main/AUTHORING_GUIDE.md) - [ ] README is updated to include [all relevant information](https://github.com/GoogleCloudPlatform/python-docs-samples/blob/main/AUTHORING_GUIDE.md#readme-file) - [ ] **Tests** pass: `nox -s py-3.9` (see [Test Environment Setup](https://github.com/GoogleCloudPlatform/python-docs-samples/blob/main/AUTHORING_GUIDE.md#test-environment-setup)) - [ ] **Lint** pass: `nox -s lint` (see [Test Environment Setup](https://github.com/GoogleCloudPlatform/python-docs-samples/blob/main/AUTHORING_GUIDE.md#test-environment-setup)) - [ ] These samples need a new **API enabled** in testing projects to pass (let us know which ones) - [ ] These samples need a new/updated **env vars** in testing projects set to pass (let us know which ones) - [ ] Please **merge** this PR for me once it is approved. - [ ] This sample adds a new sample directory, and I updated the [CODEOWNERS file](https://github.com/GoogleCloudPlatform/python-docs-samples/blob/main/.github/CODEOWNERS) with the codeowners for this sample
diff --git a/people-and-planet-ai/README.md b/people-and-planet-ai/README.md
@@ -92,8 +92,9 @@ This model uses satellite data to classify what is on Earth. The satellite data
 
 ## 🌦 Weather forecasting -- _timeseries regression_
 
-> [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/GoogleCloudPlatform/python-docs-samples/blob/main/people-and-planet-ai/weather-forecasting/README.ipynb)
-<!-- > [Watch the video in YouTube<br> ![thumbnail](http://img.youtube.com/vi/TODO/0.jpg)](https://youtu.be/TODO) -->
+> [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/GoogleCloudPlatform/python-docs-samples/blob/main/people-and-planet-ai/weather-forecasting/notebooks/1-overview.ipynb)
+>
+> [Watch the video in YouTube<br> ![thumbnail](http://img.youtube.com/vi/6-UJzEXMvGY/0.jpg)](https://youtu.be/6-UJzEXMvGY)
 
 This model uses satellite data to forecast precipitation for the next 2 and 6 hours. The satellite data comes from [Google Earth Engine.](https://earthengine.google.com/)
 
diff --git a/people-and-planet-ai/weather-forecasting/README.md b/people-and-planet-ai/weather-forecasting/README.md
@@ -1,7 +1,8 @@
 ## 🌦 Weather forecasting -- _timeseries regression_
 
-> [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/GoogleCloudPlatform/python-docs-samples/blob/main/people-and-planet-ai/weather-forecasting/README.ipynb)
-<!-- > [Watch the video in YouTube<br> ![thumbnail](http://img.youtube.com/vi/TODO/0.jpg)](https://youtu.be/TODO) -->
+> [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/GoogleCloudPlatform/python-docs-samples/blob/main/people-and-planet-ai/weather-forecasting/notebooks/1-overview.ipynb)
+>
+> [Watch the video in YouTube<br> ![thumbnail](http://img.youtube.com/vi/6-UJzEXMvGY/0.jpg)](https://youtu.be/6-UJzEXMvGY)
 
 This model uses satellite data to forecast precipitation for the next 2 and 6 hours. The satellite data comes from [Google Earth Engine.](https://earthengine.google.com/)
 
diff --git a/people-and-planet-ai/weather-forecasting/notebooks/1-overview.ipynb b/people-and-planet-ai/weather-forecasting/notebooks/1-overview.ipynb
@@ -36,7 +36,7 @@
         "id": "HtysPAVSvcMg"
       },
       "source": [
-        "# 🌦️ Weather forecasting\n",
+        "# 🌦️ Weather forecasting -- _Overview_\n",
         "\n",
         "[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/GoogleCloudPlatform/python-docs-samples/blob/main/people-and-planet-ai/weather-forecasting/notebooks/1-overview.ipynb)\n",
         "\n",
@@ -225,7 +225,7 @@
         "This is what we want to predict, so we'll use them for our _labels_.\n",
         "But it's also useful for the model to look at the precipitation from the _past_, so we'll also use it as _inputs_.\n",
         "\n",
-        "In the [`serving/data.py`](serving/data.py) file, we defined a function called `get_gpm_sequence` which returns us an `ee.Image` with the precipitation values for the time sequence we give it.\n",
+        "In the [`serving/data.py`](../serving/data.py) file, we defined a function called `get_gpm_sequence` which returns us an `ee.Image` with the precipitation values for the time sequence we give it.\n",
         "Each time step is stored in a different band with the index as a prefix.\n",
         "For example, the band corresponding to the first time step in the sequence would be `0_precipitationCal`, and the second time step would be `1_precipitationCal`."
       ],
@@ -302,7 +302,7 @@
         "It includes measurements from the _visible_, _near-infrared_, and _infrared_ spectrum.\n",
         "It is a [geostationary](https://en.wikipedia.org/wiki/Geostationary_orbit) satellite, so its orbit is synchronized with the Earth's rotation, and it provides a view centered in the Americas.\n",
         "\n",
-        "In the [`serving/data.py`](serving/data.py) file, we defined a function called `get_goes16_sequence` which returns us an `ee.Image` with the cloud and moisture data for the time sequence we give it."
+        "In the [`serving/data.py`](../serving/data.py) file, we defined a function called `get_goes16_sequence` which returns us an `ee.Image` with the cloud and moisture data for the time sequence we give it."
       ],
       "id": "y3NRvQndX66i"
     },
@@ -367,7 +367,7 @@
         "Elevation could also give the model useful information.\n",
         "We use the [MERIT Terrain DEM](https://developers.google.com/earth-engine/datasets/catalog/MERIT_DEM_v1_0_3) dataset to get the elvation.\n",
         "\n",
-        "In the [`serving/data.py`](serving/data.py) file, we defined a function called `get_elevation` which returns us an `ee.Image` with the elevation measured in meters."
+        "In the [`serving/data.py`](../serving/data.py) file, we defined a function called `get_elevation` which returns us an `ee.Image` with the elevation measured in meters."
       ],
       "id": "gqUhsl1UE2Xs"
     },
@@ -491,7 +491,7 @@
         "\n",
         "We chose to predict precipitation for 2 and 6 hours in the future, but it could be anything as long as we have the right _labels_.\n",
         "\n",
-        "In the [`serving/data.py`](serving/data.py) file, we defined a function called `get_labels_image` which returns us an `ee.Image` with bands for each time step of precipitation."
+        "In the [`serving/data.py`](../serving/data.py) file, we defined a function called `get_labels_image` which returns us an `ee.Image` with bands for each time step of precipitation."
       ],
       "id": "kRZlrlaXYRA0"
     },
diff --git a/people-and-planet-ai/weather-forecasting/notebooks/2-dataset.ipynb b/people-and-planet-ai/weather-forecasting/notebooks/2-dataset.ipynb
@@ -36,7 +36,7 @@
         "id": "HtysPAVSvcMg"
       },
       "source": [
-        "# 🌦️ Weather forecasting\n",
+        "# 🌦️ Weather forecasting -- _Dataset_\n",
         "\n",
         "[![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/GoogleCloudPlatform/python-docs-samples/blob/main/people-and-planet-ai/weather-forecasting/notebooks/2-dataset.ipynb)\n",
         "\n",
@@ -137,7 +137,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 3,
+      "execution_count": null,
       "metadata": {
         "id": "xGXRHJ9TFs24"
       },
@@ -282,13 +282,13 @@
         "\n",
         "Once we have bins for both precipitation and elevation, we combine them into a single \"unique\" bin value to make sure we get all the possible precipitation values for each elevation.\n",
         "\n",
-        "In [`create_dataset.py`](create_dataset.py) we defined a function called `sample_points` that gives us a balanced selction of `(longitude, latitude)` coordinates for a given date."
+        "In [`create_dataset.py`](../create_dataset.py) we defined a function called `sample_points` that gives us a balanced selction of `(longitude, latitude)` coordinates for a given date."
       ],
       "id": "hWq2BMYMcAEj"
     },
     {
       "cell_type": "code",
-      "execution_count": 4,
+      "execution_count": null,
       "metadata": {
         "colab": {
           "base_uri": "https://localhost:8080/"
@@ -369,14 +369,14 @@
         "We predefined that all our training examples would be 5 pixels width by 5 pixels height, but we could choose any size as long as the model accepts it.\n",
         "We also want all the training examples to be the same size so we can batch them.\n",
         "\n",
-        "In [`create_dataset.py`](create_dataset.py) we defined `get_training_example`, which fetches an `(inputs, labels)` pair for the given date and (longitude, latitude) coordinate.\n",
+        "In [`create_dataset.py`](../create_dataset.py) we defined `get_training_example`, which fetches an `(inputs, labels)` pair for the given date and (longitude, latitude) coordinate.\n",
         "Let's see how a 64x64 patch looks like, since a 5x5 patch will only look like a bunch of random pixels to us."
       ],
       "id": "W5mr765Ahsd5"
     },
     {
       "cell_type": "code",
-      "execution_count": 5,
+      "execution_count": null,
       "metadata": {
         "colab": {
           "base_uri": "https://localhost:8080/"
@@ -419,7 +419,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 6,
+      "execution_count": null,
       "metadata": {
         "colab": {
           "base_uri": "https://localhost:8080/",
@@ -488,7 +488,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 7,
+      "execution_count": null,
       "metadata": {
         "colab": {
           "base_uri": "https://localhost:8080/",
@@ -578,7 +578,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 24,
+      "execution_count": null,
       "metadata": {
         "colab": {
           "base_uri": "https://localhost:8080/",
@@ -624,7 +624,7 @@
         "outputId": "2a818de7-128e-4200-f196-f629e698d985"
       },
       "id": "tcD44qxkSSya",
-      "execution_count": 25,
+      "execution_count": null,
       "outputs": [
         {
           "output_type": "stream",
@@ -713,11 +713,11 @@
         "Local testing works great for creating small datasets and making sure everything works, but to run on a large dataset at scale it's best to use a distributed runner like\n",
         "[Dataflow](https://cloud.google.com/dataflow).\n",
         "\n",
-        "We can run [`create_dataset.py`](create_dataset.py) as a script and run it in [Dataflow](https://cloud.google.com/dataflow).\n",
+        "We can run [`create_dataset.py`](../create_dataset.py) as a script and run it in [Dataflow](https://cloud.google.com/dataflow).\n",
         "You can control the number of dates to sample with `--num-dates` _(default=100)_, and the number of bins to use for the stratified sampling with `--num-bins` _(default=10)_.\n",
         "\n",
         "We are using the same data extraction functions for both training and prediction.\n",
-        "This means our Dataflow pipelines needs access to the [`serving/weather-data`](serving/weather-data) module.\n",
+        "This means our Dataflow pipelines needs access to the [`serving/weather-data`](../serving/weather-data) module.\n",
         "Since it's a local module that does not live in [PyPI](https://pypi.org), we have to first build the module with [`build`](https://pypa-build.readthedocs.io/en/latest) and then include the package for Dataflow."
       ],
       "id": "YWAI6AetcxRH"
@@ -748,7 +748,7 @@
         "outputId": "516fb9b4-328a-4d41-af2a-028448559882"
       },
       "id": "1NtAJBl0TKyE",
-      "execution_count": 17,
+      "execution_count": null,
       "outputs": [
         {
           "output_type": "stream",
@@ -769,8 +769,10 @@
       },
       "outputs": [],
       "source": [
+        "data_path = f\"gs://{bucket}/weather/data\"\n",
+        "\n",
         "!python create_dataset.py \\\n",
-        "  --data-path=\"gs://{bucket}/weather/data\" \\\n",
+        "  --data-path=\"{data_path}\" \\\n",
         "  --runner=\"DataflowRunner\" \\\n",
         "  --project=\"{project}\" \\\n",
         "  --region=\"{location}\" \\\n",
diff --git a/people-and-planet-ai/weather-forecasting/notebooks/3-training.ipynb b/people-and-planet-ai/weather-forecasting/notebooks/3-training.ipynb
diff --git a/people-and-planet-ai/weather-forecasting/notebooks/4-predictions.ipynb b/people-and-planet-ai/weather-forecasting/notebooks/4-predictions.ipynb