10x pipeline¶
After you have a validated merged dataframe from validation, you can begin the 10X pipeline consisting of demultiplexing, vdj, and feature counting.
CellRanger 6.1
The CellRanger version in this pipeline is 6.1
Flow¶
The following will analyze the flow path and output a dataframe that will be used in analysis. It computes each count of each gate that are useful in frequency analysis.
$ g00x g002 pipeline flow -o g002/G002/output/flow /path/to/flow
import pandas as pd
from g00x.data import Data
from g00x.flow.flow import parse_flow_data
from pathlib import Path
data = ctx.obj["data"]
folder = 'path/to/flow'
flow_data = parse_flow_data(data, folder)
out = Path(out)
output_feather = Path(out.parent / (out.stem + ".feather"))
output_csv = Path(out.parent / (out.stem + ".csv"))
Here is what the flow dataframe will look like.
run_purpose | run_date | sort_id | ptid | group | weeks | visit_id | probe_set | sample_type | sort_software_dv | sort_file_type | sample_tube | gate | phenotype | value_type | extention | file_path | file_subset | value | branch | easy_name | notes | sort_pool | hashtag | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
119 | PreS | 2022-08-25 | S6C | G002831 | 2 | -5 | V091 | eODGT8 | PBMC | DV | Summary | T1 | P11 | IgD+/Antigen++ | count | .csv | ['g002/G002/sorting/G002/Prescreens/Prescreen_RunDate220825_UploadDate221021/PopulationSummaryFilesFromDV/PreS_220825_S6C_G002831_V091_eODGT8_PBMC_DV_Summary_T1_a.csv'] | ['a'] | 50 | IgD+ | antigen_pos_igd_pos_b_cells | nan | ||
2461 | PreS | 2022-11-04 | S6C | G002136 | 2 | 8 | V200 | eODGT8 | PBMC | DV | Summary | T1 | P13 | IgD+/KO- | count | .csv | ['g002/G002/sorting/G002/Prescreens/Prescreen_RunDate221104_UploadDate221129/PopulationSummaryFilesFromDV/PreS_221104_S6C_G002136_V200_eODGT8_PBMC_DV_Summary_T1_a.csv'] | ['a'] | 190682 | IgD+ | nan | |||
2754 | PreS | 2022-11-04 | S6C | G002947 | 2 | 8 | V200 | Cg28v2 | PBMC | DV | Summary | T1 | P31 | IgG-IgM-/IgA+/KO- | count | .csv | ['g002/G002/sorting/G002/Prescreens/Prescreen_RunDate221104_UploadDate221129/PopulationSummaryFilesFromDV/PreS_221104_S6C_G002947_V200_Cg28v2_PBMC_DV_Summary_T1_a.csv'] | ['a'] | 10498 | IgA+ | nan | |||
4485 | Sort | 2022-10-25 | S6C | G002831 | 2 | 4 | V160 | eODGT8 | PBMC | DV | Summary | T1 | P27 | IgG-IgM-IgD- | count | .csv | ['g002/G002/sorting/G002/Sorts/Sort_RunDate221025_UploadDate221101/ClinicalSamples/PopulationSummaryFilesFromDV/Sort_221025_S6C_G002831_V160_eODGT8_PBMC_HT02_DV_Summary_T1_P03_a.csv'] | ['a'] | 100696 | nan | P03 | HT02 | ||
2343 | PreS | 2022-11-04 | S6C | G002136 | 2 | -5 | V091 | eODGT8 | PBMC | DV | Summary | T1 | P12 | IgD+/Antigen++/KO- | count | .csv | ['g002/G002/sorting/G002/Prescreens/Prescreen_RunDate221104_UploadDate221129/PopulationSummaryFilesFromDV/PreS_221104_S6C_G002136_V091_eODGT8_PBMC_DV_Summary_T1_a.csv'] | ['a'] | 47 | IgD+ | epitope_pos_igd_pos_b_cells_rev | nan |
Demultiplexing¶
The input to the demultiplexing part of the pipeline will be the sequencing and flow file paths
$ g00x g002 pipeline demultiplex -o g002/G002/output/demultiplex -f /path/to/flow -s /path/to/sequencing
import pandas as pd
from g00x.data import Data
from g00x.sequencing.tenX import run_demultiplex
flow_path = "path/to/flow"
sequencing_path = "path/to/sequencing"
data = Data()
merged_dataframe: pd.DataFrame = merge_flow_and_sequencing(data, flow_path, sequencing_path) # type: ignore
demultiplex_df = run_demultiplex(data, merged_dataframe, out, overwrite)
demultiplex_df.to_csv("demultiplex.csv")
demultiplex_df.to_csv("demultiplex.feather")
The demulitplex algorithm will add the following fields in demultiplex output
Column | Definition |
---|---|
vdj_run_dir | The full path to the vdj run directory, e.g. the Illumina directory |
cso_run_dir | The full path to the cso run directory, e.g. the Illumina directory |
vdj_sample_name | The unique vdj sample name given to each row |
cso_sample_name | The unique cso sample name given to each row |
vdj_fastq_dir | The full path to the vdj fastq directory |
cso_fastq_dir | The full path to the cso fastq directory |
An example of a demultiplexing output dataframe is found below.
ptid | group | weeks | visit_id | probe_set | sample_type | run_date | sort_pool | hashtag | run_dir_path | pool_number | sorted_date | vdj_sequencing_replicate | cso_sequencing_replicate | vdj_lirary_replicate | cso_library_replicate | bio_replicate | vdj_index | feature_index | vdj_run_id | cso_run_id | vdj_run_dir_path | cso_run_dir_path | vdj_fastq_dir | vdj_sample_name | cso_fastq_dir | cso_sample_name | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | G002516 | 1 | -5 | V091 | eODGT8 | PBMC | 2022-09-27 | P01 | HT01 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002 | P01 | 2022-09-27 | 0 | 0 | 0 | 0 | 0 | SI-TT-D6 | SI-TN-D6 | 221006_VH00497_31_AAAVKCLHV | 221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path | vdj-SI-TT-D6 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path | cso-SI-TN-D6 |
1 | G002516 | 1 | 4 | V160 | eODGT8 | PBMC | 2022-09-27 | P01 | HT02 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002 | P01 | 2022-09-27 | 0 | 0 | 0 | 0 | 0 | SI-TT-D6 | SI-TN-D6 | 221006_VH00497_31_AAAVKCLHV | 221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path | vdj-SI-TT-D6 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path | cso-SI-TN-D6 |
VDJ¶
The output of demultiplexing pipeline will be used as input, see the pipeline
The following will run the VDJ pipeline from the demultiplex dataframe and output the vdj.feather inside the output folder.
Run it from demultiplexed dataframe
$ g00x g002 pipeline vdj -o g002/G002/output/vdj -d output/demultiplexed.feather
You can run the same with the following Python code.
from g00x.sequencing.tenX import run_vdj
from g00x.data import Data
data = Data()
demultiplex_dataframe = pd.read_feather(demultiplex_dataframe_path)
run_vdj(data, demultiplex_dataframe, out)
The output vdj dataframe will only contain one additional field
Column | Definition |
---|---|
vdj_output | The full path to the vdj output folder |
CSO¶
This CSO pipeline will run the cellranger count part and output a feature matrix. It also uses the demultiplex.feather as input.
The following will run the CSO pipeline from the demultiplex dataframe and output cso.feather inside the output folder.
Run it from demultiplexed dataframe
$ g00x g002 pipeline cso -o g002/G002/output/cso -d output/demultiplexed.feather
You can run the same with the following Python code.
from g00x.sequencing.tenX import run_cso
from g00x.data import Data
data = Data()
demultiplex_dataframe = pd.read_feather(demultiplex_dataframe_path)
run_cso(data, demultiplex_dataframe, out)
The output CSO dataframe will only contain one additional field
Column | Definition |
---|---|
cso_output | The full path to the cso output folder |
AIRR¶
The output of the VDJ and CSO can now be combined to get a final sequencing dataframe. This is the final sequencing dataframe that will be used for the analysis.
The AIRR protocol does the following.
It will...
- Run SADIE AIRR on the VDJ contigs to get a formalized AIRR dataframe.
- Analyze the feature barcodes and assign cellids to the correct participant.
- Add the PubIDs to the AIRR dataframe.
- Run mutational analysis
- Run iGL assignment to all sequences
- Determine if sequence is VRC01 class
- Add mutational sets (find what mutations are VRC01-like)
- Cluster sequences
- Determine isotype
The following will run the AIRR pipeline from the VDJ and CSO dataframes and output airr.feather inside of the output folder.
Run it from VDJ and CSO dataframes
$ g00x g002 pipeline airr -o g002/G002/output/airr -v output/vdj.feather -c output/cso.feather
You can run the same with the following Python code.
from g00x.data import Data
from g00x.sequencing.airr import run_airr
vdj_out = 'output/vdj.feather'
cso_out = 'output/cso.feather'
data = Data()
vdj_dataframe = pd.read_feather(vdj_out)
cso_dataframe = pd.read_feather(cso_out)
output_df = run_airr(data, vdj_dataframe, cso_dataframe, out, overwrite)
An output dataframe will take the following:
cellid | pubID | ptid | group | weeks | visit_id | probe_set | sample_type | run_date | sort_pool | hashtag | run_dir_path | pool_number | sorted_date | vdj_sequencing_replicate | cso_sequencing_replicate | vdj_lirary_replicate | cso_library_replicate | bio_replicate | vdj_index | feature_index | vdj_run_id | cso_run_id | vdj_run_dir_path | cso_run_dir_path | vdj_fastq_dir | vdj_sample_name | cso_fastq_dir | cso_sample_name | vdj_output | cso_output | sadie_airr_path | paired_sadie_airr_path | cellhash | sequence_id_heavy | sequence_heavy | reference_name_heavy | locus_heavy | stop_codon_heavy | vj_in_frame_heavy | v_frameshift_heavy | productive_heavy | rev_comp_heavy | complete_vdj_heavy | v_call_top_heavy | v_call_heavy | d_call_top_heavy | d_call_heavy | j_call_top_heavy | j_call_heavy | c_call_heavy | sequence_alignment_heavy | germline_alignment_heavy | sequence_alignment_aa_heavy | germline_alignment_aa_heavy | v_alignment_start_heavy | v_alignment_end_heavy | d_alignment_start_heavy | d_alignment_end_heavy | j_alignment_start_heavy | j_alignment_end_heavy | c_alignment_start_heavy | c_alignment_end_heavy | v_sequence_alignment_heavy | v_sequence_alignment_aa_heavy | v_germline_alignment_heavy | v_germline_alignment_aa_heavy | d_sequence_alignment_heavy | d_sequence_alignment_aa_heavy | d_germline_alignment_heavy | d_germline_alignment_aa_heavy | j_sequence_alignment_heavy | j_sequence_alignment_aa_heavy | j_germline_alignment_heavy | j_germline_alignment_aa_heavy | c_sequence_alignment_heavy | c_sequence_alignment_aa_heavy | c_germline_alignment_heavy | c_germline_alignment_aa_heavy | fwr1_heavy | fwr1_aa_heavy | cdr1_heavy | cdr1_aa_heavy | fwr2_heavy | fwr2_aa_heavy | cdr2_heavy | cdr2_aa_heavy | fwr3_heavy | fwr3_aa_heavy | fwr4_heavy | fwr4_aa_heavy | cdr3_heavy | cdr3_aa_heavy | junction_heavy | junction_length_heavy | junction_aa_heavy | junction_aa_length_heavy | v_score_heavy | d_score_heavy | j_score_heavy | c_score_heavy | v_cigar_heavy | d_cigar_heavy | j_cigar_heavy | c_cigar_heavy | v_support_heavy | d_support_heavy | j_support_heavy | c_support_heavy | v_identity_heavy | d_identity_heavy | j_identity_heavy | c_identity_heavy | v_sequence_start_heavy | v_sequence_end_heavy | v_germline_start_heavy | v_germline_end_heavy | d_sequence_start_heavy | d_sequence_end_heavy | d_germline_start_heavy | d_germline_end_heavy | j_sequence_start_heavy | j_sequence_end_heavy | j_germline_start_heavy | j_germline_end_heavy | c_sequence_start_heavy | c_sequence_end_heavy | c_germline_start_heavy | c_germline_end_heavy | fwr1_start_heavy | fwr1_end_heavy | cdr1_start_heavy | cdr1_end_heavy | fwr2_start_heavy | fwr2_end_heavy | cdr2_start_heavy | cdr2_end_heavy | fwr3_start_heavy | fwr3_end_heavy | fwr4_start_heavy | fwr4_end_heavy | cdr3_start_heavy | cdr3_end_heavy | np1_heavy | np1_length_heavy | np2_heavy | np2_length_heavy | liable_heavy | vdj_nt_heavy | vdj_aa_heavy | v_mutation_heavy | v_mutation_aa_heavy | d_mutation_heavy | d_mutation_aa_heavy | j_mutation_heavy | j_mutation_aa_heavy | v_penalty_heavy | d_penalty_heavy | j_penalty_heavy | germline_alignment_aa_corrected_heavy | v_germline_alignment_aa_corrected_heavy | sequence_id_light | sequence_light | reference_name_light | locus_light | stop_codon_light | vj_in_frame_light | v_frameshift_light | productive_light | rev_comp_light | complete_vdj_light | v_call_top_light | v_call_light | d_call_top_light | d_call_light | j_call_top_light | j_call_light | c_call_light | sequence_alignment_light | germline_alignment_light | sequence_alignment_aa_light | germline_alignment_aa_light | v_alignment_start_light | v_alignment_end_light | d_alignment_start_light | d_alignment_end_light | j_alignment_start_light | j_alignment_end_light | c_alignment_start_light | c_alignment_end_light | v_sequence_alignment_light | v_sequence_alignment_aa_light | v_germline_alignment_light | v_germline_alignment_aa_light | d_sequence_alignment_light | d_sequence_alignment_aa_light | d_germline_alignment_light | d_germline_alignment_aa_light | j_sequence_alignment_light | j_sequence_alignment_aa_light | j_germline_alignment_light | j_germline_alignment_aa_light | c_sequence_alignment_light | c_sequence_alignment_aa_light | c_germline_alignment_light | c_germline_alignment_aa_light | fwr1_light | fwr1_aa_light | cdr1_light | cdr1_aa_light | fwr2_light | fwr2_aa_light | cdr2_light | cdr2_aa_light | fwr3_light | fwr3_aa_light | fwr4_light | fwr4_aa_light | cdr3_light | cdr3_aa_light | junction_light | junction_length_light | junction_aa_light | junction_aa_length_light | v_score_light | d_score_light | j_score_light | c_score_light | v_cigar_light | d_cigar_light | j_cigar_light | c_cigar_light | v_support_light | d_support_light | j_support_light | c_support_light | v_identity_light | d_identity_light | j_identity_light | c_identity_light | v_sequence_start_light | v_sequence_end_light | v_germline_start_light | v_germline_end_light | d_sequence_start_light | d_sequence_end_light | d_germline_start_light | d_germline_end_light | j_sequence_start_light | j_sequence_end_light | j_germline_start_light | j_germline_end_light | c_sequence_start_light | c_sequence_end_light | c_germline_start_light | c_germline_end_light | fwr1_start_light | fwr1_end_light | cdr1_start_light | cdr1_end_light | fwr2_start_light | fwr2_end_light | cdr2_start_light | cdr2_end_light | fwr3_start_light | fwr3_end_light | fwr4_start_light | fwr4_end_light | cdr3_start_light | cdr3_end_light | np1_light | np1_length_light | np2_light | np2_length_light | liable_light | vdj_nt_light | vdj_aa_light | v_mutation_light | v_mutation_aa_light | d_mutation_light | d_mutation_aa_light | j_mutation_light | j_mutation_aa_light | v_penalty_light | d_penalty_light | j_penalty_light | germline_alignment_aa_corrected_light | v_germline_alignment_aa_corrected_light | HTO | iGL_aa_heavy | iGL_aa_light | mutations_heavy | mutations_light | 100bW | is_vrc01_class | cottrell_focused_v_common_heavy_positive | cottrell_focused_v_common_heavy_negative | cottrell_focused_v_common_score | hcdr3_len | lcdr3_len | top_c_call | cluster | is_centroid | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
7268 | G002-630_2_8_eODGT8_P02_GTCACAAGTTGATTGC-1 | G002-630 | G002630 | 2 | 8 | V200 | eODGT8 | PBMC | 2022-09-30 | P02 | HT08 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002 | P02 | 2022-09-30 | 0 | 0 | 0 | 0 | 0 | SI-TT-H6 | SI-TN-H6 | 221006_VH00497_31_AAAVKCLHV | 221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path | vdj-SI-TT-H6 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path | cso-SI-TN-H6 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/vdj/vdj_output_0004 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/cso/cso_output_0004 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/vdj/vdj_output_0004/outs/sadie_airr.feather | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/vdj/vdj_output_0004/outs/paired_sadie_airr.feather | GTCACAAGTTGATTGC-1 | GTCACAAGTTGATTGC-1_contig_2 | GAGAGCATCACCCAGCAACCACATCTGTCCTCTAGAGAATCCCCTGAGAGCTCCGTTCCTCACCATGGACTGGACCTGGAGGATTCTCTTCTTGGTGGCAGCAGCCACAGGAGCCCACTCCCAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGCATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGTGCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGC | human | IGH | False | True | False | True | False | True | IGHV1-2*02 | IGHV1-2*02 | IGHD1-26*01 | IGHD1-26*01 | IGHJ4*02 | IGHJ4*02 | IGHG1*01 | CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGCATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGTGCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG | CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGGATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTGTATTACTGTGCGAGAGANNNGTATAGTGGGAGCTACTNNNTTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG | QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGCINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCARDLYGGSYSVDYWGQGTLVTVSS | QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCARXXYSGSYXXDYWGQGTLVTVSS | 1 | 296 | 300 | 316 | 320 | 361 | 361 | 428 | CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGCATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGTGCGAGAGA | QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGCINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCAR | CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGGATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTGTATTACTGTGCGAGAGA | QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCAR | GTATGGTGGGAGCTACT | YGGSY | GTATAGTGGGAGCTACT | YSGSY | TTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG | DYWGQGTLVTVSS | TTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG | DYWGQGTLVTVSS | GCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGC | ASTKGPSVFPLAPSSKSTSGGTA | GCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGC | ASTKGPSVFPLAPSSKSTSGGTA | CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCT | QVQLVQSGAEVKKPGASVKVSCKAS | GGATACACCTTCACCGGCTACTAT | GYTFTGYY | ATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGC | MHWVRQAPGQGLEWMGC | ATCAACCCTAACAGTGGTGGCACA | INPNSGGT | AACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGT | NYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYC | TGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA | WGQGTLVTVSS | GCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTAC | ARDLYGGSYSVDY | TGTGCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTACTGG | 45 | CARDLYGGSYSVDYW | 15 | 453.689 | 25.359 | 68.153 | 135.293 | 121S296M132S | 420S1N17M112S2N | 440S6N42M67S | 481S68M226N | 1.482e-129 | 0.00309 | 1.321e-15 | 4.201e-35 | 0.98986 | 0.94118 | 1 | 100 | 122 | 417 | 1 | 296 | 421 | 437 | 2 | 18 | 441 | 482 | 7 | 48 | 482 | 549 | 1 | 68 | 122 | 196 | 197 | 220 | 221 | 271 | 272 | 295 | 296 | 409 | 449 | 481 | 410 | 448 | TCT | 3 | CGG | 3 | False | CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGCATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGTGCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA | QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGCINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCARDLYGGSYSVDYWGQGTLVTVSS | 0.01014 | 0.0102041 | 0.05882 | 0.2 | 0 | 0 | -1 | -1 | -1 | False | False | GTCACAAGTTGATTGC-1_contig_1 | AGGAGTCAGACCCAGTCAGGACACAGCATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTGCTCTGGTTCCCAGGTTCCAGATGCGACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCCCCTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGC | human | IGK | False | True | False | True | False | True | IGKV1-12*01 | IGKV1-1201,IGKV1-1202 | IGKJ4*01 | IGKJ4*01 | IGKC*01 | GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCCCCTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAAC | GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAGCTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCCNCTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAAC | DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSASGTDFTLTISSLQPEDFATYYCQQANSFPLTFGGGTKVEIK | DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQANSFPLTFGGGTKVEIK | 1 | 284 | 286 | 322 | 322 | 458 | GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCC | DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSASGTDFTLTISSLQPEDFATYYCQQANSFP | GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAGCTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCC | DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQANSFP | CTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAAC | LTFGGGTKVEIK | CTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAAC | LTFGGGTKVEIK | CGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGC | RTVAAPSVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNA | CGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGC | RTVAAPSVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNA | GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGT | DIQMTQSPSSVSASVGDRVTITCRAS | CAGGGTATTAGCAGCTGG | QGISSW | TTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTAT | LAWYQQKPGKAPKLLIY | GCTGCATCC | AAS | AGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGT | SLQSGVPSRFSGSASGTDFTLTISSLQPEDFATYYC | TTCGGCGGAGGGACCAAGGTGGAGATCAAA | FGGGTKVEIK | CAACAGGCTAACAGTTTCCCCCTCACT | QQANSFPLT | TGTCAACAGGCTAACAGTTTCCCCCTCACTTTC | 33 | CQQANSFPLTF | 11 | 438.107 | nan | 60.229 | 272.075 | 93S284M174S3N | 378S1N37M136S | 414S137M184N | 7.292e-125 | nan | 3.221e-13 | 2.814e-76 | 0.99296 | nan | 1 | 100 | 94 | 377 | 1 | 284 | 379 | 415 | 2 | 38 | 415 | 551 | 1 | 137 | 94 | 171 | 172 | 189 | 190 | 240 | 241 | 249 | 250 | 357 | 385 | 414 | 358 | 384 | C | 1 | False | GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCCCCTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAA | DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSASGTDFTLTISSLQPEDFATYYCQQANSFPLTFGGGTKVEIK | 0.00704002 | 0.0105263 | nan | nan | 0 | 0 | -1 | -1 | -1 | False | False | HT08 | QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCARDLYSGSYSVDYWGQGTLVTVSS | DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQANSFPLTFGGGTKVEIK | ['W50C' 'S98G'] | ['G66A'] | False | False | [] | ['50'] | -1 | 13 | 9 | IGHG | G002630_False_IGHG_320 | True | |||||||||||||||
10901 | G002-341_2_4_eODGT8_P02_CCTAAAGGTCAAACTC-1 | G002-341 | G002341 | 2 | 4 | V160 | eODGT8 | PBMC | 2022-10-07 | P02 | HT07 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003 | P02 | 2022-10-07 | 0 | 0 | 0 | 0 | 0 | SI-TT-H5 | SI-TN-C7 | 221019_VH00497_32_AAANGGVM5 | 221019_VH00497_32_AAANGGVM5 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/221019_VH00497_32_AAANGGVM5 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/221019_VH00497_32_AAANGGVM5 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/demultiplexed/d1191380460aa54876be7325a32a84c7/outs/fastq_path | vdj-SI-TT-H5 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/demultiplexed/d1191380460aa54876be7325a32a84c7/outs/fastq_path | cso-SI-TN-C7 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/vdj/vdj_output_0007 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/cso/cso_output_0007 | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/vdj/vdj_output_0007/outs/sadie_airr.feather | /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/vdj/vdj_output_0007/outs/paired_sadie_airr.feather | CCTAAAGGTCAAACTC-1 | CCTAAAGGTCAAACTC-1_contig_1 | AGATCTCAGAGAGGAGCCTTAGCCCTGGACTCCAAGGCCTTTCCACTTGGTGATCAGCACTGAGCACAGAGGACTCACCATGGAGTTGGGGCTGAGCTGGGTTTTCCTTGTTGCTATTTTAGAAGGTGTCCAGTGTGAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGA | human | IGH | False | True | False | True | False | True | IGHV3-7*04 | IGHV3-7*04 | IGHD2-15*01 | IGHD2-15*01 | IGHJ5*02 | IGHJ5*02 | IGHG1*01 | GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG | GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGANNNGGTGGTAGNNNNCTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG | EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARDWVEGPWFDPWGQGTLVTVSS | EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARXXVVXXWFDPWGQGTLVTVSS | 1 | 296 | 300 | 307 | 312 | 358 | 358 | 540 | GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGA | EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCAR | GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGA | EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCAR | GGTGGAAG | VE | GGTGGTAG | VV | CTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG | WFDPWGQGTLVTVSS | CTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG | WFDPWGQGTLVTVSS | GCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGA | ASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSG | GCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGA | ASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSG | GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCT | EVQLVESGGGLVQPGGSLRLSCAAS | GGATTCACCTTTAGTAGCTATTGG | GFTFSSYW | ATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAAC | MSWVRQAPGKGLEWVAN | ATAAAGCAAGATGGAAGTGAGAAA | IKQDGSEK | TACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGT | YYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYC | TGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA | WGQGTLVTVSS | GCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCC | ARDWVEGPWFDP | TGTGCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCCTGG | 42 | CARDWVEGPWFDPW | 14 | 463.037 | 11.095 | 76.078 | 363.264 | 136S296M244S | 435S13N8M233S10N | 447S4N47M182S | 493S183M111N | 2.812e-132 | 75.33 | 6.737e-18 | 1.222e-103 | 1 | 0.875 | 1 | 100 | 137 | 432 | 1 | 296 | 436 | 443 | 14 | 21 | 448 | 494 | 5 | 51 | 494 | 676 | 1 | 183 | 137 | 211 | 212 | 235 | 236 | 286 | 287 | 310 | 311 | 424 | 461 | 493 | 425 | 460 | TTG | 3 | GGCC | 4 | False | GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA | EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARDWVEGPWFDPWGQGTLVTVSS | 0 | 0 | 0.125 | 0.5 | 0 | 0 | -1 | -1 | -1 | False | False | CCTAAAGGTCAAACTC-1_contig_2 | AGCTTCAGCTGTGGTAGAGAAGACAGGATTCAGGACAATCTCCAGCATGGCCGGCTTCCCTCTCCTCCTCACCCTCCTCACTCACTGTGCAGGGTCCTGGGCCCAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGTCGGTCTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCGGTCAGCCCAAGGCTGCCCCATCGGTCACTCTGTTCCCACCCTCCTCTGAGGAGCTTCAAGCCAACAAGGCCACACTGGTGTGTCTCGTAAGTGACTTCTACCCGGGAGCCGTGACAGTGGCCTGGAAGGCAGATGGCAGCCCCGTCAAGGTGGGAGTGGAGACCACCAAACCCTCCAAACAAAGCAAC | human | IGL | False | True | False | True | False | True | IGLV1-47*01 | IGLV1-47*01 | IGLJ7*01 | IGLJ7*01 | IGLC7*01 | CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGTCGGTCTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCG | CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGTNNNNCTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCG | QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSRSVFGGGTQLTVL | QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSXXVFGGGTQLTVL | 1 | 291 | 296 | 331 | 331 | 519 | CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGT | QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLS | CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGT | QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLS | CTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCG | VFGGGTQLTVL | CTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCG | VFGGGTQLTVL | GGTCAGCCCAAGGCTGCCCCATCGGTCACTCTGTTCCCACCCTCCTCTGAGGAGCTTCAAGCCAACAAGGCCACACTGGTGTGTCTCGTAAGTGACTTCTACCCGGGAGCCGTGACAGTGGCCTGGAAGGCAGATGGCAGCCCCGTCAAGGTGGGAGTGGAGACCACCAAACCCTCCAAACAAAGCAAC | GQPKAAPSVTLFPPSSEELQANKATLVCLVSDFYPGAVTVAWKADGSPVKVGVETTKPSKQSN | GGTCAGCCCAAGGCTGCCCCCTCGGTCACTCTGTTCCCACCCTCCTCTGAGGAGCTTCAAGCCAACAAGGCCACACTGGTGTGTCTCGTAAGTGACTTCTACCCGGGAGCCGTGACAGTGGCCTGGAAGGCAGATGGCAGCCCCGTCAAGGTGGGAGTGGAGACCACCAAACCCTCCAAACAAAGCAAC | GQPKAAPSVTLFPPSSEELQANKATLVCLVSDFYPGAVTVAWKADGSPVKVGVETTKPSKQSN | CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGC | QSVLTQPPSASGTPGQRVTISCSGS | AGCTCCAACATCGGAAGTAATTAT | SSNIGSNY | GTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTAT | VYWYQQLPGTAPKLLIY | AGGAATAAT | RNN | CAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGT | QRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYC | TTCGGAGGAGGCACCCAGCTGACCGTCCTC | FGGGTQLTVL | GCAGCATGGGATGACAGCCTGAGTCGGTCTGTG | AAWDDSLSRSV | TGTGCAGCATGGGATGACAGCCTGAGTCGGTCTGTGTTC | 39 | CAAWDDSLSRSVF | 13 | 455.247 | nan | 58.644 | 367.228 | 103S291M228S5N | 398S2N36M188S | 433S189M129N | 5.737e-130 | nan | 1.095e-12 | 7.191e-105 | 1 | nan | 1 | 99.471 | 104 | 394 | 1 | 291 | 399 | 434 | 3 | 38 | 434 | 622 | 1 | 189 | 104 | 178 | 179 | 202 | 203 | 253 | 254 | 262 | 263 | 370 | 404 | 433 | 371 | 403 | CGGT | 4 | False | CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGTCGGTCTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTC | QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSRSVFGGGTQLTVL | 0 | 0 | nan | nan | 0 | 0 | -1 | -1 | -1 | False | False | HT07 | EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARDWVVGPWFDPWGQGTLVTVSS | QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSRSVFGGGTQLTVL | ['V98E'] | [] | False | False | [] | [] | 0 | 12 | 11 | IGHG | G002341_False_IGHG_402 | True |