add 2025 assignment 3

yunfanjiang · yunfanjiang · commit 82033758c072 · 2025-05-15T23:19:40.000-07:00
diff --git a/assignments/2025/assignment3.md b/assignments/2025/assignment3.md
@@ -5,68 +5,81 @@ mathjax: true
 permalink: /assignments2025/assignment3/
 ---
 
-<span style="color:red">This assignment is due on **Tuesday, May 28 2024** at 11:59pm PST.</span>
+<span style="color:red">This assignment is due on **Friday, May 30 2025** at 11:59pm PST.</span>
 
-Starter code containing Colab notebooks can be [downloaded here]({{site.hw_3_colab}}).
+Starter code containing Colab notebooks can
+be [downloaded here](https://drive.google.com/file/d/1m4eU68YJOqsX842otWS0z8hEaBB8c3EH/view?usp=sharing).
 
 - [Setup](#setup)
 - [Goals](#goals)
-- [Q1: Image Captioning with Vanilla RNNs](#q1-image-captioning-with-vanilla-rnns)
-- [Q2: Image Captioning with Transformers](#q2-image-captioning-with-transformers)
-- [Q3: Generative Adversarial Networks](#q3-generative-adversarial-networks)
-- [Q4: Self-Supervised Learning for Image Classification](#q4-self-supervised-learning-for-image-classification)
-- [Extra Credit: Image Captioning with LSTMs](#extra-credit-image-captioning-with-lstms-5-points)
+- [Q1: Image Captioning with Transformers](#q1-image-captioning-with-transformers)
+- [Q2: Self-Supervised Learning for Image Classification](#q2-self-supervised-learning-for-image-classification)
+- [Q3: Denoising Diffusion Probabilistic Models](#q3-denoising-diffusion-probabilistic-models)
+- [Q4: CLIP and Dino](#q4-clip-and-dino)
 - [Submitting your work](#submitting-your-work)
 
 ### Setup
 
-Please familiarize yourself with the [recommended workflow]({{site.baseurl}}/setup-instructions/#working-remotely-on-google-colaboratory) before starting the assignment. You should also watch the Colab walkthrough tutorial below.
+Please familiarize yourself with
+the [recommended workflow]({{site.baseurl}}/setup-instructions/#working-remotely-on-google-colaboratory) before starting
+the assignment. You should also watch the Colab walkthrough tutorial below.
 
 <iframe style="display: block; margin: auto;" width="560" height="315" src="https://www.youtube.com/embed/DsGd2e9JNH4" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>
 
-**Note**. Ensure you are periodically saving your notebook (`File -> Save`) so that you don't lose your progress if you step away from the assignment and the Colab VM disconnects.
+**Note**. Ensure you are periodically saving your notebook (`File -> Save`) so that you don't lose your progress if you
+step away from the assignment and the Colab VM disconnects.
 
-While we don't officially support local development, we've added a <b>requirements.txt</b> file that you can use to setup a virtual env.
+While we don't officially support local development, we've added a <b>requirements.txt</b> file that you can use to
+setup a virtual env.
 
-Once you have completed all Colab notebooks **except `collect_submission.ipynb`**, proceed to the [submission instructions](#submitting-your-work).
+Once you have completed all Colab notebooks **except `collect_submission.ipynb`**, proceed to
+the [submission instructions](#submitting-your-work).
 
 ### Goals
 
-In this assignment, you will implement language networks and apply them to image captioning on the COCO dataset. Then you will train a Generative Adversarial Network to generate images that look like a training dataset. Finally, you will be introduced to self-supervised learning to automatically learn the visual representations of an unlabeled dataset.
+In this assignment, you will implement language networks and apply them to image captioning on the COCO dataset. Then
+you will be introduced to self-supervised learning to automatically learn the visual representations of an unlabeled
+dataset. Next, you will implement diffusion models (DDPMs) and apply them to image generation. Finally, you will explore
+CLIP and DINO, two self-supervised learning methods that leverage large amounts of unlabeled data to learn visual
+representations.
 
 The goals of this assignment are as follows:
 
-- Understand and implement RNN and Transformer networks. Combine them with CNN networks for image captioning.
-- Understand how to train and implement a Generative Adversarial Network (GAN) to produce images that resemble samples from a dataset.
+- Understand and implement Transformer networks. Combine them with CNN networks for image captioning.
 - Understand how to leverage self-supervised learning techniques to help with image classification tasks.
+- Implement and understand diffusion models (DDPMs) and apply them to image generation.
+- Implement and understand CLIP and DINO, two self-supervised learning methods that leverage large amounts of unlabeled
+  data to learn visual representations.
 
 **You will use PyTorch for the majority of this homework.**
 
-### Q1: Image Captioning with Vanilla RNNs
+### Q1: Image Captioning with Transformers
 
-The notebook `RNN_Captioning.ipynb` will walk you through the implementation of vanilla recurrent neural networks and apply them to image captioning on COCO.
+The notebook `Transformer_Captioning.ipynb` will walk you through the implementation of a Transformer model and apply it
+to image captioning on COCO.
 
-### Q2: Image Captioning with Transformers
+### Q2: Self-Supervised Learning for Image Classification
 
-The notebook `Transformer_Captioning.ipynb` will walk you through the implementation of a Transformer model and apply it to image captioning on COCO.
+In the notebook `Self_Supervised_Learning.ipynb`, you will learn how to leverage self-supervised pretraining to obtain
+better performance on image classification tasks. **When first opening the notebook, go
+to `Runtime > Change runtime type` and set `Hardware accelerator` to `GPU`.**
 
-### Q3: Generative Adversarial Networks 
+### Q3: Denoising Diffusion Probabilistic Models
 
-In the notebook `Generative_Adversarial_Networks.ipynb` you will learn how to generate images that match a training dataset and use these models to improve classifier performance when training on a large amount of unlabeled data and a small amount of labeled data. **When first opening the notebook, go to `Runtime > Change runtime type` and set `Hardware accelerator` to `GPU`.**
+In the notebook `DDPM.ipynb`, you will implement a Denoising Diffusion Probabilistic Model
+(DDPM) and apply it to image generation.
 
-### Q4: Self-Supervised Learning for Image Classification 
+### Q4: CLIP and Dino
 
-In the notebook `Self_Supervised_Learning.ipynb`, you will learn how to leverage self-supervised pretraining to obtain better performance on image classification tasks. **When first opening the notebook, go to `Runtime > Change runtime type` and set `Hardware accelerator` to `GPU`.**
-
-### Extra Credit: Image Captioning with LSTMs
-
-The notebook `LSTM_Captioning.ipynb` will walk you through the implementation of Long-Short Term Memory (LSTM) RNNs and apply them to image captioning on COCO.
+In the notebook `CLIP_DINO.ipynb`, you will implement CLIP and DINO, two self-supervised learning methods that leverage
+large amounts of unlabeled data to learn visual representations.
 
 ### Submitting your work
 
 **Important**. Please make sure that the submitted notebooks have been run and the cell outputs are visible.
 
-Once you have completed all notebooks and filled out the necessary code, you need to follow the below instructions to submit your work:
+Once you have completed all notebooks and filled out the necessary code, you need to follow the below instructions to
+submit your work:
 
 **1.** Open `collect_submission.ipynb` in Colab and execute the notebook cells.