CPSC150A
Scientific Computing

Project

You have learned a lot over this semester, so it is time for you to show everyone how much. For the project you will write a program to analyze, visualize, and help answer a question about a dataset. The dataset and question can be anything you want!

Creating interesting questions about a dataset requires a certain level of familiarity with your dataset. You will need to know what each of the variables is and how they are related to each other. Pick a dataset that is interesting to you and large enough to create interesting questions.

Simple questions ask about the correlation between two variables in the dataset. The two variables can be two columns of your dataset, but more often they will need to be modified to answer the question. An interesting question's answer will help people make more informed decisions to help themselves or society.


Details

The presentation should include the following elements:

  • Dataset - To understand the question you are answering about your dataset, your audience will need to know certain details about the dataset. Describe the dataset, including details such as where the data comes from and how it was collected. Also, describe each of the variables used in your analysis and how they are related to each other.
  • Question - In addition to the questions itself, describe why the question is interesting and how the answer to the question could be used to personal or societal benefit. Also, include your hypothesized answer to the question including the basis for why. It is okay if the actual answer is different than your hypothesis. It is both good to have the data support logical reasoning and to have data subvert expectations so that assumptions are reevaluated.
  • Plot - Run a program that you create which displays a bar chart, with error bars, to illustrate the relationship between two variables.
  • Analysis - Explain how the plot can be used to answer your question. This is your opportunity to tell a story, to make the data come alive for your audience, to convince them of something interesting, or surprising, or actionable. Describe why the plot looks the way it does and relate this to your hypothesis. Be sure to also include why the question's answer matters.

In the real world, programming is a collaborative effort. For this project, you are encouraged, but not required, to work with a partner. When working with a partner, both members should be present and actively contributing to every line of code that is written. So, be sure to choose a partner that you get along with, is available to work at the same time as you, and wants to devote the same amount of time to the project as you.


Presentation

On Friday, December 6th, we will present projects during our class period. During this time, you will be asked to present an analyis of your dataset, including code and plots, to the rest of the class. You should be prepared to answer questions about how your code works.

Submit the URL for your project the course Inquire site before class on the day of the showcase. Late submissions will not be accepted. Only one submission per group is required.


Grade

The project will be graded according to the following criteria and percentages:

  • Dataset - 20%
  • Question - 20%
  • Plot - 30%
  • Analysis - 30%

Extra Credit

Bonus points will be awarded to projects that are viewed favorably by fellow students.