Skip to content

Introduction


G00x - Generalizable Germline-Targeting Clinical Trial Pipeline

About

This is the code for the G00x pipelines including G002 and G003. It is an all-in-one pipeline and analysis that parses, validates, calculates frequencies, runs 10X and combines all analysis into a plottable dataframe.


Source Code: https://github.com/schieflab/G002-and-G003


Quick start installation

While not necessary, we highly recommend using the conda open-source package and environment manager. This allows you to make an environment without destroying your system installed python environment.

Miniconda installers

Mac command line installer

Mac GUI installer

Linux command line installer

To install G00x package, use the following

clone the repository
$ git clone https://github.com/schieflab/G002-and-G003

change into G002-and-G003
$ cd G002-and-G003

create an environment
$ conda create -n g00x python==3.10.6 poetry==1.3.1 -y

activate the environment
$ conda activate g00x


use poetry to install
$ poetry install --with dev

$ g00x --help
Usage: g00x [OPTIONS] COMMAND [ARGS]...

Options:
  --logging-level TEXT  Set logging level
  --help                Show this message and exit.

Commands:
  g002  Run the G002 commands of G00x
  g003  Run the G003 commands of G00x

run g002 part of g00x
$ g00x g002 --help

Usage: g00x g002 [OPTIONS] COMMAND [ARGS]...

  Run the G002 commands of G00x

Options:
  --help  Show this message and exit.

Commands:
  analysis  Commands that will analyze the complete pipeline
  box       Commands that will interact with NIHBox
  globus    Commands that will interact with Globus
  pipeline  Run the 10x pipeline including the SADIE AIRR output
  validate  Commands that will validate the file have all appropriate...

Installation Guide: Cell Ranger 7.0 and bcl2fastq

G00x requires cellranger 7.0.1 to be installed from the 10x website. Please check that your system meets the system requirements. The installation instructions for Cell Ranger are on the 10x website, then add the cellranger binary to your PATH.


1. Install Cell Ranger 7.0.1

  1. Download and Extract
cd ~/apps  # Or any directory in your PATH
wget link to cellranger-7.0.1.tar.gz on 10x website
tar -xzvf cellranger-7.0.1.tar.gz
  1. Add to PATH

Edit your shell config file (~/.bashrc, ~/.zshrc, etc.):

export PATH=/apps/cellranger:$PATH

Then reload:

source ~/.bashrc
# or
source ~/.zshrc
  1. Confirm Installation
cellranger --version
# Should return something like: cellranger 7.0.1

2. Install bcl2fastq

  1. Download from Illumina (login required):
    Illumina bcl2fastq Download

  2. Install:

unzip bcl2fastq2-2.20.0.422-Linux-x86_64.zip
cd bcl2fastq2-2.20.0.422
./install
  1. Add to PATH:
export PATH=/usr/local/bcl2fastq/bin:$PATH
  1. Confirm Installation
bcl2fastq --version
# Should return version info like: bcl2fastq v2.20.0.422

After installation, both tools can be used in your G00x pipeline. The Pipeline will detect them in the path, but you can also specify the path to the tools when running the pipeline


G002 where to go

📊 G002 Data Transfer and use the raw data ➡️ Take me!


G002 Validation Validate your data structures ➡️ Take me!


🧪 G002 Pipeline Use the G002 flow and sequencing pipeline ➡️ Take me!


🔍 G002 Analysis Analyze the dataframes and make figures ➡️ Take me!

G003 where to go

📊 G003 Data Transfer and use the raw data ➡️ Take me!


G003 Validation Validate your data structures ➡️ Take me!


🧪 G003 Pipeline Use the G003 flow and sequencing pipeline ➡️ Take me!


🔍 G003 Analysis Analyze the dataframes and make figures ➡️ Take me!