tidytitanic: Dataframes Based on Titanic Passengers and Crew

A version of the Titanic survival data tailored for people analytics demonstrations and practice. While another package, 'titanic', reproduces the Kaggle competition files with minimal preprocessing, 'tidytitanic' combines the train and test datasets into the single dataset, 'passengers', for exploration and summary across all passengers. It also extracts personal identifiers—such as first names, last names, and titles from the raw 'name' field, enabling demographic analysis. The 'passengers' data does not cover the crew, but this package also provides the more bare-bones, crew-containing datasets 'tidy_titanic' and 'flat_titanic' based on the 'Titanic' data set from 'datasets' for further exploration. This human-centered data package is designed to support exploratory data analysis, feature engineering, and pedagogical use cases.

Version: 0.0.1
Depends: R (≥ 2.10)
Published: 2025-07-11
DOI: 10.32614/CRAN.package.tidytitanic
Author: Evangeline Reynolds [aut, cre]
Maintainer: Evangeline Reynolds <evangeline.mae at gmail.com>
License: MIT + file LICENSE
NeedsCompilation: no
CRAN checks: tidytitanic results

Documentation:

Reference manual: tidytitanic.pdf

Downloads:

Package source: tidytitanic_0.0.1.tar.gz
Windows binaries: r-devel: not available, r-release: tidytitanic_0.0.1.zip, r-oldrel: not available
macOS binaries: r-release (arm64): not available, r-oldrel (arm64): not available, r-release (x86_64): not available, r-oldrel (x86_64): not available

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tidytitanic to link to this page.