CS 5043: HW3: Convolutional Neural Networks

Objectives

Assignment Notes

Data Set

The Core50 data set is a large database of videos of objects as they are being moved/rotated under a variety of different lighting and background conditions. Our general task is to classify the object being shown in a single frame of one of these videos.

Data Organization


Provided Code

We are providing the following code posted on the main course web page:


Prediction Problem

We will focus on the distinction between mugs, scissors, and glasses, for which we only have five distinct example objects (though, for each, we have many different perspectives and conditions). Our goal is to construct a model that will be generally applicable: ideally, it will be able to distinguish between any mug, any pair of scissors, and any glasses. However, given the small number of objects, this is a challenge. For the purposes of this assignment, we will use three objects from each class for training and one distinct object from each class for each of validation and testing. For rotation 0:


Architectures

You will create two convolutional neural networks to distinguish the mug, scissors, and glasses: one will be a shallow network and the other will be a deep network. Each will nominally have the following structure:

Since the data set is relatively small (in terms of the number of distinct objects), it is important to take steps to address the over-fitting problem. Here are the key tools that you have:


Experiments


Hints / Notes


What to Hand In

A single zip file that contains:

Grading


andrewhfagg -- gmail.com

Last modified: Tue Mar 8 13:09:44 2022