Skip to contents

This function takes a dataset with a column containing rsIDs (SNP IDs) and adds the corresponding chromosome (CHR) and position (POS) information. It queries the SNPlocs.Hsapiens.dbSNP155.GRCh37 database (or GRCh38 if specified) to retrieve the genomic positions. The function returns a dataframe with the additional 'CHR' and 'POS' columns appended.

Usage

add_chrpos(dat, snp_col = "SNP", ref = "GRCh37")

Arguments

dat

A dataframe containing at least a column with SNP IDs (rsIDs).

snp_col

A string indicating the column name containing SNP IDs (default is "SNP").

ref

A string indicating the reference genome version. Default is "GRCh37", can also use "GRCh38".

Value

A dataframe with additional 'CHR' and 'POS' columns.

Examples

if (FALSE) { # \dontrun{
pacman::p_load(data.table, dplyr, BSgenome, SNPlocs.Hsapiens.dbSNP155.GRCh37)
zuo_ref <- fread("/path/to/1KG-EAS-EAF.txt.gz") # Input dataset with rsID (SNP column)
result <- add_chrpos(zuo_ref, snp_col = "SNP", ref = "GRCh37")
} # }