Add tutorial notebook for generating embeddings from pretrained models #2959

calebrob6 · 2025-08-28T16:36:19Z

Notebook link: https://github.com/torchgeo/torchgeo/blob/embedding_tutorial/docs/tutorials/pretrained_embeddings.ipynb

Copilot

Pull Request Overview

This PR adds a comprehensive tutorial notebook demonstrating how to extract fixed-length embeddings from pretrained models in TorchGeo. The tutorial covers using DOFA and ResNet-18 pretrained models to generate embeddings from EuroSAT imagery and evaluating them with k-NN classifiers.

Adds a complete Jupyter notebook tutorial for pretrained embedding extraction
Demonstrates usage of two different pretrained models (DOFA and ResNet-18) with specific preprocessing requirements
Includes visualization and evaluation of embeddings using PCA and k-NN classification

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
docs/tutorials/pretrained_embeddings.ipynb	New tutorial notebook demonstrating embedding extraction from pretrained models
docs/tutorials/basic_usage.rst	Updated documentation index to include the new embeddings tutorial

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

docs/tutorials/pretrained_embeddings.ipynb

isaaccorley · 2025-08-28T22:10:59Z

This LGTM. Not sure if @adamjstewart has any other nits

adamjstewart

I always have nits 😈

But seriously though, this is mostly good, glad we finally have this!

adamjstewart · 2025-09-03T13:24:20Z

docs/tutorials/pretrained_embeddings.ipynb

+   "source": [
+    "# On Colab, this ensures the latest TorchGeo is available.\n",
+    "\n",
+    "%pip install torchgeo"


Suggested change

"%pip install torchgeo"

"%pip install torchgeo scikit-learn tqdm"

tqdm is pulled in by torch, but scikit-learn isn't guaranteed to be installed, so we definitely want to add these

adamjstewart · 2025-09-03T13:25:58Z

docs/tutorials/pretrained_embeddings.ipynb

+    "\n",
+    "root = os.path.join(tempfile.gettempdir(), 'eurosat100')\n",
+    "datamodule = EuroSAT100DataModule(\n",
+    "    root=root, batch_size=10, num_workers=2, download=True, bands=('B02', 'B03', 'B04')\n",


Why RGB-only?

adamjstewart · 2025-09-03T13:28:57Z

docs/tutorials/pretrained_embeddings.ipynb

+    "# Fit a k-NN classifier on DOFA train embeddings and evaluate on validation embeddings.\n",
+    "# This gives a quick, label-efficient baseline without fine-tuning.\n",


Some of these would be better suited as markdown cells. I don't love having multiple code cells in a row without explanations.

adamjstewart · 2025-09-03T13:29:56Z

docs/tutorials/pretrained_embeddings.ipynb

+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now let's do the same thing with a ResNet18 model pretrained on Sentinel-2 RGB imagery (from the SSL4EO paper).\n",


Out of curiosity (and without running the notebook myself), which did better?

adamjstewart · 2025-09-03T13:30:22Z

docs/tutorials/pretrained_embeddings.ipynb

+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "pca = PCA(n_components=2, whiten=True)\n",


t-SNE might be cool as well, but more work.

adamjstewart · 2025-09-03T13:31:08Z

docs/tutorials/pretrained_embeddings.ipynb

+    "            embeddings = model.forward_features(x)\n",
+    "            embeddings = torch.mean(\n",
+    "                embeddings, dim=(-2, -1)\n",
+    "            )  # global average pooling over the spatial dims\n",


Some of these comments can be moved from in-line to their own line to reduce formatting changes

adamjstewart · 2025-09-03T13:32:22Z

docs/tutorials/pretrained_embeddings.ipynb

+    "\n",
+    "train_dl = datamodule.train_dataloader()\n",
+    "val_dl = datamodule.val_dataloader()\n",
+    "test_dl = datamodule.test_dataloader()"


The test set isn't being used at the moment. Would it be better to only use val or test since we aren't doing fine-tuning?

Add tutorial

648c987

github-actions bot added the documentation Improvements or additions to documentation label Aug 28, 2025

Ruff

945b9a0

adamjstewart added this to the 0.7.2 milestone Aug 28, 2025

calebrob6 added 4 commits August 28, 2025 19:54

Sorting

b796661

remove fast dev

a633069

Formatting again?

90c2a49

Good grief

1e8bfdc

isaaccorley requested review from Copilot and adamjstewart August 28, 2025 21:04

Copilot AI reviewed Aug 28, 2025

View reviewed changes

docs/tutorials/pretrained_embeddings.ipynb Show resolved Hide resolved

docs/tutorials/pretrained_embeddings.ipynb Show resolved Hide resolved

docs/tutorials/pretrained_embeddings.ipynb Show resolved Hide resolved

isaaccorley approved these changes Aug 31, 2025

View reviewed changes

adamjstewart requested changes Sep 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add tutorial notebook for generating embeddings from pretrained models #2959

Add tutorial notebook for generating embeddings from pretrained models #2959

Uh oh!

calebrob6 commented Aug 28, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isaaccorley commented Aug 28, 2025

Uh oh!

adamjstewart left a comment

Uh oh!

adamjstewart Sep 3, 2025

Uh oh!

adamjstewart Sep 3, 2025

Uh oh!

adamjstewart Sep 3, 2025

Uh oh!

adamjstewart Sep 3, 2025

Uh oh!

adamjstewart Sep 3, 2025

Uh oh!

adamjstewart Sep 3, 2025

Uh oh!

adamjstewart Sep 3, 2025

Uh oh!

Uh oh!

	"%pip install torchgeo"
	"%pip install torchgeo scikit-learn tqdm"

		"# Fit a k-NN classifier on DOFA train embeddings and evaluate on validation embeddings.\n",
		"# This gives a quick, label-efficient baseline without fine-tuning.\n",

Add tutorial notebook for generating embeddings from pretrained models #2959

Are you sure you want to change the base?

Add tutorial notebook for generating embeddings from pretrained models #2959

Uh oh!

Conversation

calebrob6 commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

isaaccorley commented Aug 28, 2025

Uh oh!

adamjstewart left a comment

Choose a reason for hiding this comment

Uh oh!

adamjstewart Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

adamjstewart Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

adamjstewart Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

adamjstewart Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

adamjstewart Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

adamjstewart Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

adamjstewart Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

calebrob6 commented Aug 28, 2025 •

edited

Loading