Passengers on the Titanic
Titanicp.Rd
Data on passengers on the RMS Titanic, excluding the Crew and some individual identifier variables.
Usage
data(Titanicp)
Format
A data frame with 1309 observations on the following 6 variables.
pclass
a factor with levels
1st
2nd
3rd
survived
a factor with levels
died
survived
sex
a factor with levels
female
male
age
passenger age in years (or fractions of a year, for children), a numeric vector; age is missing for 263 of the passengers
sibsp
number of siblings or spouses aboard, integer:
0:8
parch
number of parents or children aboard, integer:
0:6
Details
There are a number of related versions of the Titanic data, in various formats.
This version was derived from ptitanic
in the rpart.plot package,
modifying it to remove the Class 'labelled'
attributes for some variables
(inherited from Frank Harrell's titanic3
version)
which caused problems with some applications, notably ggplot2
.
Other versions:
Titanic
is the 4-way frequency table of all 2201 people
aboard the Titanic, including passengers and crew.
Source
The original R source for this dataset was compiled by Frank Harrell and Robert Dawson: https://biostat.app.vumc.org/wiki/pub/Main/DataSets/titanic.html, described in more detail in https://biostat.app.vumc.org/wiki/pub/Main/DataSets/titanic3info.txt
For this version of the Titanic data, passenger details were deleted,
survived was cast as a factor, and the name changed to Titanicp
to minimize confusion
with other versions.
Examples
data(Titanicp)
## maybe str(Titanicp) ; plot(Titanicp) ...