Part 1
Please watch the videos in Videos: introduction to JMP Pro to learn the very basics of JMP Pro. This takes about 25 minutes.
Part 2
Recall the titanic.csv dataset: each row of the data corresponds to a passenger; each column is described below:
- survival: Survival (0 = No, 1 = Yes)
- pclass: Ticket class (1 = 1st, 2 = 2nd, 3 = 3rd). This is a A proxy for socio-economic status (SES): 1st = Upper, 2nd = Middle, 3rd = Lower.
- sex: gender
- Age: Age in years (Age is fractional if less than 1. If the age is estimated, is it in the form of xx.5)
- sibsp: # of siblings / spouses aboard the Titanic (The dataset defines family relations in this way: Sibling = brother, sister, stepbrother, stepsister; Spouse = husband, wife)
- parch: # of parents / children aboard the Titanic (Some children traveled only with a nanny, therefore parch=0 for them)
- ticket: Ticket number
- fare: Passenger fare
- cabin: Cabin number
- embarked: Port of Embarkation (C = Cherbourg, Q = Queenstown, S = Southampton)
Files to submit:
- A JMP data file: .jmp
- Do not forget to save a script for each of the steps below (as shown in the last video of Videos: introduction to JMP Pro). Submit the resulting JMP file to Canvas.
Questions:
- Import the Titanic dataset to a JMP table.
- Change the data type of the following variables:
- PassengerId to nominal
- Pclass to ordinal
- Click on ‘Distribution’ button above the table, and create a histogram and boxplot for the variables ‘Fare’ and ‘Age’.
- Using the Graph Builder, create a bar chart that shows the number of survivors per gender:
- Put the ‘Sex’ variable in x-axis and ‘Survived’ variable in y-axis;
- choose bar chart;
- use SUM as summary statistic;
- label the bars with percentage of the total.
- Create a pie chart of the variable ‘Pclass’. Show percentage of each ticket class.
- Tabulate* the number of passengers who survived the Titanic by their gender.
- Tabulate* the average fare for each Ticket class.
* By tabulate, I mean creating a pivot table.
Leave a Reply