Data on passengers on the RMS Titanic, excluding the Crew and some individual identifier variables.
Format
A data frame with 1309 observations on the following 6 variables.
pclassa factor with levels
1st2nd3rdsurviveda factor with levels
diedsurvivedsexa factor with levels
femalemaleagepassenger age in years (or fractions of a year, for children), a numeric vector; age is missing for 263 of the passengers
sibspnumber of siblings or spouses aboard, integer:
0:8parchnumber of parents or children aboard, integer:
0:6
Source
The original R source for this dataset was compiled by Frank Harrell and Robert Dawson: https://hbiostat.org/data/repo/titanic.txt, described in more detail in https://hbiostat.org/data/repo/titanic
For this version of the Titanic data, passenger details were deleted,
survived was cast as a factor, and the name changed to Titanicp to
minimize confusion with other versions.
Details
There are a number of related versions of the Titanic data, in various
formats. This version was derived from ptitanic in the
rpart.plot package, modifying it to remove the Class 'labelled'
attributes for some variables (inherited from Frank Harrell's
titanic3 version) which caused problems with some applications,
notably ggplot2.
Other versions:
Titanic is the 4-way frequency table of all 2201
people aboard the Titanic, including passengers and crew.
Examples
data(Titanicp)
## maybe str(Titanicp) ; plot(Titanicp) ...