This tutorial is based off the Integration and Label Transfer Vignette published and developed by the Satija Lab.
git cloneourSeurat.IntegrateDatarepository onto your local machine- Open up your computer's Terminal and
cdinto the same directory as yourgit clonefolder's location. You can also use a graphical interface to navigate to the correct working directory. - Once in the correct directory,
cdinto the/srcfolder and run the following command:Rscript batch_correction.R --input_files demo.txt --use_filenames_for_plots FALSE --ncomps 50 --nCount_RNA TRUE --nFeature_RNA TRUE --output_file_name hnscc_and_mel_bcdemo.txtcontains the file paths for 2 test datasets within the/datadirectory --HNSCC_noribo_small.txtandMEL_noribo_small.txt. These are 500x500 single-cell cancer datasets that will be batch-corrected- We utilize several command line flags to provide our users with the freedom to customize their batch correction output
- To read further about each Command Line Flag / Parameter, take a look at the technical documentation for our module here -> Link
- You will now have 3 output files:
batch_correction_log.txt,hnscc_and_mel_bc.rds, andhnscc_and_mel_bc.pdf- The
.txtfile contains a log of each process carried out during the batch correction script's execution - The
.rdsfile can be used on another one of GenePattern's Seurat suite modules, such as theSeurat.Clusteringmodule - The
.pdffile contains UMAP plots and Violin plots of the integrated HNSCC and MEL datasets
- The