Introduction

G00x - Generalizable Germline-Targeting Clinical Trial Pipeline
About¶
This is the code for the G00x pipelines including G002 and G003. It is an all-in-one pipeline and analysis that parses, validates, calculates frequencies, runs 10X and combines all analysis into a plottable dataframe.
Source Code: https://github.com/schieflab/G002-and-G003
Quick start installation¶
While not necessary, we highly recommend using the conda open-source package and environment manager. This allows you to make an environment without destroying your system installed python environment.
Miniconda installers
To install G00x package, use the following
clone the repository
$ git clone https://github.com/schieflab/G002-and-G003
change into G002-and-G003
$ cd G002-and-G003
create an environment
$ conda create -n g00x python==3.10.6 poetry==1.3.1 -y
activate the environment
$ conda activate g00x
use poetry to install
$ poetry install --with dev
$ g00x --help
Usage: g00x [OPTIONS] COMMAND [ARGS]...
Options:
--logging-level TEXT Set logging level
--help Show this message and exit.
Commands:
g002 Run the G002 commands of G00x
g003 Run the G003 commands of G00x
run g002 part of g00x
$ g00x g002 --help
Usage: g00x g002 [OPTIONS] COMMAND [ARGS]...
Run the G002 commands of G00x
Options:
--help Show this message and exit.
Commands:
analysis Commands that will analyze the complete pipeline
box Commands that will interact with NIHBox
globus Commands that will interact with Globus
pipeline Run the 10x pipeline including the SADIE AIRR output
validate Commands that will validate the file have all appropriate...
Installation Guide: Cell Ranger 7.0 and bcl2fastq¶
G00x requires cellranger 7.0.1
to be installed from the 10x website. Please check that your system meets the system requirements. The installation instructions for Cell Ranger are on the 10x website, then add the cellranger binary to your PATH.
1. Install Cell Ranger 7.0.1¶
- Download and Extract
cd ~/apps # Or any directory in your PATH
wget link to cellranger-7.0.1.tar.gz on 10x website
tar -xzvf cellranger-7.0.1.tar.gz
- Add to PATH
Edit your shell config file (~/.bashrc
, ~/.zshrc
, etc.):
export PATH=/apps/cellranger:$PATH
Then reload:
source ~/.bashrc
# or
source ~/.zshrc
- Confirm Installation
cellranger --version
# Should return something like: cellranger 7.0.1
2. Install bcl2fastq¶
-
Download from Illumina (login required):
Illumina bcl2fastq Download -
Install:
unzip bcl2fastq2-2.20.0.422-Linux-x86_64.zip
cd bcl2fastq2-2.20.0.422
./install
- Add to PATH:
export PATH=/usr/local/bcl2fastq/bin:$PATH
- Confirm Installation
bcl2fastq --version
# Should return version info like: bcl2fastq v2.20.0.422
After installation, both tools can be used in your G00x pipeline. The Pipeline will detect them in the path, but you can also specify the path to the tools when running the pipeline
G002 where to go¶
📊 G002 Data Transfer and use the raw data ➡️ Take me!
✅ G002 Validation Validate your data structures ➡️ Take me!
🧪 G002 Pipeline Use the G002 flow and sequencing pipeline ➡️ Take me!
🔍 G002 Analysis Analyze the dataframes and make figures ➡️ Take me!
G003 where to go¶
📊 G003 Data Transfer and use the raw data ➡️ Take me!
✅ G003 Validation Validate your data structures ➡️ Take me!
🧪 G003 Pipeline Use the G003 flow and sequencing pipeline ➡️ Take me!
🔍 G003 Analysis Analyze the dataframes and make figures ➡️ Take me!