| Title: | Publication-Ready PCA, t-SNE, and UMAP Plots | 
| Version: | 0.1.3 | 
| Description: | Provides tools for creating publication-ready dimensionality reduction plots, including Principal Component Analysis (PCA), t-Distributed Stochastic Neighbor Embedding (t-SNE), and Uniform Manifold Approximation and Projection (UMAP). This package helps visualize high-dimensional data with options for custom labels, density plots, and faceting, using the 'ggplot2' framework Wickham (2016) <doi:10.1007/978-3-319-24277-4>. | 
| License: | GPL-3 | 
| Imports: | config (≥ 0.3.2), golem (≥ 0.4.1), shiny (≥ 1.8.1.1), rlang, Rtsne, cowplot, dplyr, ggplot2, umap | 
| Encoding: | UTF-8 | 
| RoxygenNote: | 7.3.2 | 
| Suggests: | knitr, tibble, rmarkdown | 
| VignetteBuilder: | knitr | 
| NeedsCompilation: | no | 
| Packaged: | 2025-02-04 01:38:37 UTC; Bach | 
| Author: | Yaoxiang Li [cre, aut] | 
| Maintainer: | Yaoxiang Li <liyaoxiang@outlook.com> | 
| Repository: | CRAN | 
| Date/Publication: | 2025-02-04 03:20:02 UTC | 
Create publication-ready PCA, t-SNE, or UMAP plots
Description
This function generates dimensionality reduction plots (PCA, t-SNE, UMAP) with options for custom labels, titles, density plots, and faceting. It allows users to visualize high-dimensional data using various dimensionality reduction techniques.
Usage
ggpca(
  data,
  metadata_cols,
  mode = c("pca", "tsne", "umap"),
  scale = TRUE,
  x_pc = "PC1",
  y_pc = "PC2",
  color_var = NULL,
  ellipse = TRUE,
  ellipse_level = 0.9,
  ellipse_type = "norm",
  ellipse_alpha = 0.9,
  point_size = 3,
  point_alpha = 0.6,
  facet_var = NULL,
  tsne_perplexity = 30,
  umap_n_neighbors = 15,
  density_plot = "none",
  color_palette = "Set1",
  xlab = NULL,
  ylab = NULL,
  title = NULL,
  subtitle = NULL,
  caption = NULL
)
Arguments
| data | A data frame containing the data to be plotted. Must include both feature columns (numeric) and metadata columns (categorical). | 
| metadata_cols | A character vector of column names or a numeric vector of column indices for the metadata columns. These columns are used for grouping and faceting. | 
| mode | The dimensionality reduction method to use. One of  | 
| scale | Logical indicating whether to scale features (default:  | 
| x_pc | Name of the principal component or dimension to plot on the x-axis (default:  | 
| y_pc | Name of the principal component or dimension to plot on the y-axis (default:  | 
| color_var | (Optional) Name of the column used to color points in the plot. If  | 
| ellipse | Logical indicating whether to add confidence ellipses for groups (only supported for PCA and only if  | 
| ellipse_level | Confidence level for ellipses (default:  | 
| ellipse_type | Type of ellipse to plot, e.g., "norm" for normal distribution (default:  | 
| ellipse_alpha | Transparency level for ellipses, where 0 is fully transparent and 1 is fully opaque (default:  | 
| point_size | Size of the points in the plot (default:  | 
| point_alpha | Transparency level for the points, where 0 is fully transparent and 1 is fully opaque (default:  | 
| facet_var | Formula for faceting the plot (e.g.,  | 
| tsne_perplexity | Perplexity parameter for t-SNE, which balances local and global aspects of the data (default:  | 
| umap_n_neighbors | Number of neighbors for UMAP, which determines the local structure (default:  | 
| density_plot | Controls whether to add density plots for the x, y, or both axes. Accepts one of  | 
| color_palette | Name of the color palette (used for discrete variables) to use for the plot. Supports  | 
| xlab | Custom x-axis label (default:  | 
| ylab | Custom y-axis label (default:  | 
| title | Plot title (default:  | 
| subtitle | Plot subtitle (default:  | 
| caption | Plot caption (default:  | 
Value
A ggplot2 object representing the dimensionality reduction plot, including scatter plots, optional density plots, and faceting options. The plot can be further customized using ggplot2 functions.
Author(s)
Yaoxiang Li
Examples
# Load dataset
pca_data <- read.csv(system.file("extdata", "example.csv", package = "ggpca"))
# PCA example
p_pca_y_group <- ggpca(
  pca_data,
  metadata_cols = c(1:6),
  mode = "pca",
  color_var = "group",
  ellipse = TRUE,
  density_plot = "y",
  title = "PCA with Y-axis Density Plot",
  subtitle = "Example dataset, colored by group",
  caption = "Data source: Example dataset"
)
print(p_pca_y_group)
# t-SNE example
p_tsne_time <- ggpca(
  pca_data,
  metadata_cols = c(1:6),
  mode = "tsne",
  color_var = "time",
  tsne_perplexity = 30,
  title = "t-SNE Plot of Example Dataset",
  subtitle = "Colored by time",
  caption = "Data source: Example dataset"
)
print(p_tsne_time)
Process Missing Values in a Data Frame
Description
This function filters columns in a data frame based on a specified threshold for missing values and performs imputation on remaining non-metadata columns using half of the minimum value found in each column. Metadata columns are specified by the user and are exempt from filtering and imputation.
Usage
process_missing_value(data, missing_threshold = 25, metadata_cols = NULL)
Arguments
| data | A data frame containing the data to be processed. | 
| missing_threshold | A numeric value representing the percentage threshold of missing values which should lead to the removal of a column. Default is 25. | 
| metadata_cols | A vector of either column names or indices that should be treated as metadata and thus exempt from missing value filtering and imputation. If NULL, no columns are treated as metadata. | 
Value
A data frame with filtered and imputed columns as necessary.
Examples
data <- data.frame(
  A = c(1, 2, NA, 4),
  B = c(NA, NA, NA, 4),
  C = c(1, 2, 3, 4)
)
# Process missing values while ignoring column 'C' as metadata
processed_data <- process_missing_value(data, missing_threshold = 50, metadata_cols = "C")
Run the Shiny Application
Description
This function launches the Shiny application with the specified user interface and server function. The function does not return a value but starts the Shiny app, allowing users to interact with it.
Usage
run_app(
  onStart = NULL,
  options = list(),
  enableBookmarking = NULL,
  uiPattern = "/",
  ...
)
Arguments
| onStart | A function that will be called before the app is actually run.
This is only needed for  | 
| options | Named options that should be passed to the  | 
| enableBookmarking | Can be one of  | 
| uiPattern | A regular expression that will be applied to each  | 
| ... | Arguments to pass to 'golem_opts'. See '?golem::get_golem_options' for more details. | 
Value
No return value, called for side effects.