Skip to content

10x pipeline

After you have a validated merged dataframe from validation, you can begin the 10X pipeline consisting of demultiplexing, vdj, and feature counting.

full pipeline

CellRanger 6.1

The CellRanger version in this pipeline is 6.1

Flow

The following will analyze the flow path and output a dataframe that will be used in analysis. It computes each count of each gate that are useful in frequency analysis.

$ g00x g002 pipeline flow -o g002/G002/output/flow /path/to/flow

import pandas as pd
from g00x.data import Data
from g00x.flow.flow import parse_flow_data
from pathlib import Path

data = ctx.obj["data"]
folder = 'path/to/flow'
flow_data = parse_flow_data(data, folder)
out = Path(out)
output_feather = Path(out.parent / (out.stem + ".feather"))
output_csv = Path(out.parent / (out.stem + ".csv"))

Here is what the flow dataframe will look like.

run_purpose run_date sort_id ptid group weeks visit_id probe_set sample_type sort_software_dv sort_file_type sample_tube gate phenotype value_type extention file_path file_subset value branch easy_name notes sort_pool hashtag
119 PreS 2022-08-25 S6C G002831 2 -5 V091 eODGT8 PBMC DV Summary T1 P11 IgD+/Antigen++ count .csv ['g002/G002/sorting/G002/Prescreens/Prescreen_RunDate220825_UploadDate221021/PopulationSummaryFilesFromDV/PreS_220825_S6C_G002831_V091_eODGT8_PBMC_DV_Summary_T1_a.csv'] ['a'] 50 IgD+ antigen_pos_igd_pos_b_cells nan
2461 PreS 2022-11-04 S6C G002136 2 8 V200 eODGT8 PBMC DV Summary T1 P13 IgD+/KO- count .csv ['g002/G002/sorting/G002/Prescreens/Prescreen_RunDate221104_UploadDate221129/PopulationSummaryFilesFromDV/PreS_221104_S6C_G002136_V200_eODGT8_PBMC_DV_Summary_T1_a.csv'] ['a'] 190682 IgD+ nan
2754 PreS 2022-11-04 S6C G002947 2 8 V200 Cg28v2 PBMC DV Summary T1 P31 IgG-IgM-/IgA+/KO- count .csv ['g002/G002/sorting/G002/Prescreens/Prescreen_RunDate221104_UploadDate221129/PopulationSummaryFilesFromDV/PreS_221104_S6C_G002947_V200_Cg28v2_PBMC_DV_Summary_T1_a.csv'] ['a'] 10498 IgA+ nan
4485 Sort 2022-10-25 S6C G002831 2 4 V160 eODGT8 PBMC DV Summary T1 P27 IgG-IgM-IgD- count .csv ['g002/G002/sorting/G002/Sorts/Sort_RunDate221025_UploadDate221101/ClinicalSamples/PopulationSummaryFilesFromDV/Sort_221025_S6C_G002831_V160_eODGT8_PBMC_HT02_DV_Summary_T1_P03_a.csv'] ['a'] 100696 nan P03 HT02
2343 PreS 2022-11-04 S6C G002136 2 -5 V091 eODGT8 PBMC DV Summary T1 P12 IgD+/Antigen++/KO- count .csv ['g002/G002/sorting/G002/Prescreens/Prescreen_RunDate221104_UploadDate221129/PopulationSummaryFilesFromDV/PreS_221104_S6C_G002136_V091_eODGT8_PBMC_DV_Summary_T1_a.csv'] ['a'] 47 IgD+ epitope_pos_igd_pos_b_cells_rev nan

Demultiplexing

The input to the demultiplexing part of the pipeline will be the sequencing and flow file paths

$ g00x g002 pipeline demultiplex -o g002/G002/output/demultiplex -f /path/to/flow -s /path/to/sequencing

import pandas as pd
from g00x.data import Data
from g00x.sequencing.tenX import run_demultiplex

flow_path = "path/to/flow"
sequencing_path = "path/to/sequencing"
data = Data()
merged_dataframe: pd.DataFrame = merge_flow_and_sequencing(data, flow_path, sequencing_path)  # type: ignore
demultiplex_df = run_demultiplex(data, merged_dataframe, out, overwrite)
demultiplex_df.to_csv("demultiplex.csv")
demultiplex_df.to_csv("demultiplex.feather")

The demulitplex algorithm will add the following fields in demultiplex output

Column Definition
vdj_run_dir The full path to the vdj run directory, e.g. the Illumina directory
cso_run_dir The full path to the cso run directory, e.g. the Illumina directory
vdj_sample_name The unique vdj sample name given to each row
cso_sample_name The unique cso sample name given to each row
vdj_fastq_dir The full path to the vdj fastq directory
cso_fastq_dir The full path to the cso fastq directory

An example of a demultiplexing output dataframe is found below.

ptid group weeks visit_id probe_set sample_type run_date sort_pool hashtag run_dir_path pool_number sorted_date vdj_sequencing_replicate cso_sequencing_replicate vdj_lirary_replicate cso_library_replicate bio_replicate vdj_index feature_index vdj_run_id cso_run_id vdj_run_dir_path cso_run_dir_path vdj_fastq_dir vdj_sample_name cso_fastq_dir cso_sample_name
0 G002516 1 -5 V091 eODGT8 PBMC 2022-09-27 P01 HT01 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002 P01 2022-09-27 0 0 0 0 0 SI-TT-D6 SI-TN-D6 221006_VH00497_31_AAAVKCLHV 221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path vdj-SI-TT-D6 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path cso-SI-TN-D6
1 G002516 1 4 V160 eODGT8 PBMC 2022-09-27 P01 HT02 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002 P01 2022-09-27 0 0 0 0 0 SI-TT-D6 SI-TN-D6 221006_VH00497_31_AAAVKCLHV 221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path vdj-SI-TT-D6 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path cso-SI-TN-D6

VDJ

The output of demultiplexing pipeline will be used as input, see the pipeline

The following will run the VDJ pipeline from the demultiplex dataframe and output the vdj.feather inside the output folder.

Run it from demultiplexed dataframe
$ g00x g002 pipeline vdj -o g002/G002/output/vdj -d output/demultiplexed.feather

You can run the same with the following Python code.

from g00x.sequencing.tenX import run_vdj
from g00x.data import Data

data = Data()
demultiplex_dataframe = pd.read_feather(demultiplex_dataframe_path)
run_vdj(data, demultiplex_dataframe, out)

The output vdj dataframe will only contain one additional field

Column Definition
vdj_output The full path to the vdj output folder

CSO

This CSO pipeline will run the cellranger count part and output a feature matrix. It also uses the demultiplex.feather as input.

The following will run the CSO pipeline from the demultiplex dataframe and output cso.feather inside the output folder.

Run it from demultiplexed dataframe
$ g00x g002 pipeline cso -o g002/G002/output/cso -d output/demultiplexed.feather

You can run the same with the following Python code.

from g00x.sequencing.tenX import run_cso
from g00x.data import Data

data = Data()
demultiplex_dataframe = pd.read_feather(demultiplex_dataframe_path)
run_cso(data, demultiplex_dataframe, out)

The output CSO dataframe will only contain one additional field

Column Definition
cso_output The full path to the cso output folder

AIRR

The output of the VDJ and CSO can now be combined to get a final sequencing dataframe. This is the final sequencing dataframe that will be used for the analysis.

The AIRR protocol does the following.

It will...

  1. Run SADIE AIRR on the VDJ contigs to get a formalized AIRR dataframe.
  2. Analyze the feature barcodes and assign cellids to the correct participant.
  3. Add the PubIDs to the AIRR dataframe.
  4. Run mutational analysis
  5. Run iGL assignment to all sequences
  6. Determine if sequence is VRC01 class
  7. Add mutational sets (find what mutations are VRC01-like)
  8. Cluster sequences
  9. Determine isotype

The following will run the AIRR pipeline from the VDJ and CSO dataframes and output airr.feather inside of the output folder.

Run it from VDJ and CSO dataframes
$ g00x g002 pipeline airr -o g002/G002/output/airr -v output/vdj.feather -c output/cso.feather

You can run the same with the following Python code.

from g00x.data import Data
from g00x.sequencing.airr import run_airr

vdj_out = 'output/vdj.feather'
cso_out = 'output/cso.feather'
data = Data()
vdj_dataframe = pd.read_feather(vdj_out)
cso_dataframe = pd.read_feather(cso_out)
output_df = run_airr(data, vdj_dataframe, cso_dataframe, out, overwrite)

An output dataframe will take the following:

cellid pubID ptid group weeks visit_id probe_set sample_type run_date sort_pool hashtag run_dir_path pool_number sorted_date vdj_sequencing_replicate cso_sequencing_replicate vdj_lirary_replicate cso_library_replicate bio_replicate vdj_index feature_index vdj_run_id cso_run_id vdj_run_dir_path cso_run_dir_path vdj_fastq_dir vdj_sample_name cso_fastq_dir cso_sample_name vdj_output cso_output sadie_airr_path paired_sadie_airr_path cellhash sequence_id_heavy sequence_heavy reference_name_heavy locus_heavy stop_codon_heavy vj_in_frame_heavy v_frameshift_heavy productive_heavy rev_comp_heavy complete_vdj_heavy v_call_top_heavy v_call_heavy d_call_top_heavy d_call_heavy j_call_top_heavy j_call_heavy c_call_heavy sequence_alignment_heavy germline_alignment_heavy sequence_alignment_aa_heavy germline_alignment_aa_heavy v_alignment_start_heavy v_alignment_end_heavy d_alignment_start_heavy d_alignment_end_heavy j_alignment_start_heavy j_alignment_end_heavy c_alignment_start_heavy c_alignment_end_heavy v_sequence_alignment_heavy v_sequence_alignment_aa_heavy v_germline_alignment_heavy v_germline_alignment_aa_heavy d_sequence_alignment_heavy d_sequence_alignment_aa_heavy d_germline_alignment_heavy d_germline_alignment_aa_heavy j_sequence_alignment_heavy j_sequence_alignment_aa_heavy j_germline_alignment_heavy j_germline_alignment_aa_heavy c_sequence_alignment_heavy c_sequence_alignment_aa_heavy c_germline_alignment_heavy c_germline_alignment_aa_heavy fwr1_heavy fwr1_aa_heavy cdr1_heavy cdr1_aa_heavy fwr2_heavy fwr2_aa_heavy cdr2_heavy cdr2_aa_heavy fwr3_heavy fwr3_aa_heavy fwr4_heavy fwr4_aa_heavy cdr3_heavy cdr3_aa_heavy junction_heavy junction_length_heavy junction_aa_heavy junction_aa_length_heavy v_score_heavy d_score_heavy j_score_heavy c_score_heavy v_cigar_heavy d_cigar_heavy j_cigar_heavy c_cigar_heavy v_support_heavy d_support_heavy j_support_heavy c_support_heavy v_identity_heavy d_identity_heavy j_identity_heavy c_identity_heavy v_sequence_start_heavy v_sequence_end_heavy v_germline_start_heavy v_germline_end_heavy d_sequence_start_heavy d_sequence_end_heavy d_germline_start_heavy d_germline_end_heavy j_sequence_start_heavy j_sequence_end_heavy j_germline_start_heavy j_germline_end_heavy c_sequence_start_heavy c_sequence_end_heavy c_germline_start_heavy c_germline_end_heavy fwr1_start_heavy fwr1_end_heavy cdr1_start_heavy cdr1_end_heavy fwr2_start_heavy fwr2_end_heavy cdr2_start_heavy cdr2_end_heavy fwr3_start_heavy fwr3_end_heavy fwr4_start_heavy fwr4_end_heavy cdr3_start_heavy cdr3_end_heavy np1_heavy np1_length_heavy np2_heavy np2_length_heavy liable_heavy vdj_nt_heavy vdj_aa_heavy v_mutation_heavy v_mutation_aa_heavy d_mutation_heavy d_mutation_aa_heavy j_mutation_heavy j_mutation_aa_heavy v_penalty_heavy d_penalty_heavy j_penalty_heavy germline_alignment_aa_corrected_heavy v_germline_alignment_aa_corrected_heavy sequence_id_light sequence_light reference_name_light locus_light stop_codon_light vj_in_frame_light v_frameshift_light productive_light rev_comp_light complete_vdj_light v_call_top_light v_call_light d_call_top_light d_call_light j_call_top_light j_call_light c_call_light sequence_alignment_light germline_alignment_light sequence_alignment_aa_light germline_alignment_aa_light v_alignment_start_light v_alignment_end_light d_alignment_start_light d_alignment_end_light j_alignment_start_light j_alignment_end_light c_alignment_start_light c_alignment_end_light v_sequence_alignment_light v_sequence_alignment_aa_light v_germline_alignment_light v_germline_alignment_aa_light d_sequence_alignment_light d_sequence_alignment_aa_light d_germline_alignment_light d_germline_alignment_aa_light j_sequence_alignment_light j_sequence_alignment_aa_light j_germline_alignment_light j_germline_alignment_aa_light c_sequence_alignment_light c_sequence_alignment_aa_light c_germline_alignment_light c_germline_alignment_aa_light fwr1_light fwr1_aa_light cdr1_light cdr1_aa_light fwr2_light fwr2_aa_light cdr2_light cdr2_aa_light fwr3_light fwr3_aa_light fwr4_light fwr4_aa_light cdr3_light cdr3_aa_light junction_light junction_length_light junction_aa_light junction_aa_length_light v_score_light d_score_light j_score_light c_score_light v_cigar_light d_cigar_light j_cigar_light c_cigar_light v_support_light d_support_light j_support_light c_support_light v_identity_light d_identity_light j_identity_light c_identity_light v_sequence_start_light v_sequence_end_light v_germline_start_light v_germline_end_light d_sequence_start_light d_sequence_end_light d_germline_start_light d_germline_end_light j_sequence_start_light j_sequence_end_light j_germline_start_light j_germline_end_light c_sequence_start_light c_sequence_end_light c_germline_start_light c_germline_end_light fwr1_start_light fwr1_end_light cdr1_start_light cdr1_end_light fwr2_start_light fwr2_end_light cdr2_start_light cdr2_end_light fwr3_start_light fwr3_end_light fwr4_start_light fwr4_end_light cdr3_start_light cdr3_end_light np1_light np1_length_light np2_light np2_length_light liable_light vdj_nt_light vdj_aa_light v_mutation_light v_mutation_aa_light d_mutation_light d_mutation_aa_light j_mutation_light j_mutation_aa_light v_penalty_light d_penalty_light j_penalty_light germline_alignment_aa_corrected_light v_germline_alignment_aa_corrected_light HTO iGL_aa_heavy iGL_aa_light mutations_heavy mutations_light 100bW is_vrc01_class cottrell_focused_v_common_heavy_positive cottrell_focused_v_common_heavy_negative cottrell_focused_v_common_score hcdr3_len lcdr3_len top_c_call cluster is_centroid
7268 G002-630_2_8_eODGT8_P02_GTCACAAGTTGATTGC-1 G002-630 G002630 2 8 V200 eODGT8 PBMC 2022-09-30 P02 HT08 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002 P02 2022-09-30 0 0 0 0 0 SI-TT-H6 SI-TN-H6 221006_VH00497_31_AAAVKCLHV 221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/221006_VH00497_31_AAAVKCLHV /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path vdj-SI-TT-H6 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/demultiplexed/29cc0e71cb9200226957921707138c5c/outs/fastq_path cso-SI-TN-H6 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/vdj/vdj_output_0004 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/cso/cso_output_0004 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/vdj/vdj_output_0004/outs/sadie_airr.feather /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0002/working_directory/vdj/vdj_output_0004/outs/paired_sadie_airr.feather GTCACAAGTTGATTGC-1 GTCACAAGTTGATTGC-1_contig_2 GAGAGCATCACCCAGCAACCACATCTGTCCTCTAGAGAATCCCCTGAGAGCTCCGTTCCTCACCATGGACTGGACCTGGAGGATTCTCTTCTTGGTGGCAGCAGCCACAGGAGCCCACTCCCAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGCATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGTGCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGC human IGH False True False True False True IGHV1-2*02 IGHV1-2*02 IGHD1-26*01 IGHD1-26*01 IGHJ4*02 IGHJ4*02 IGHG1*01 CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGCATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGTGCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGGATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTGTATTACTGTGCGAGAGANNNGTATAGTGGGAGCTACTNNNTTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGCINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCARDLYGGSYSVDYWGQGTLVTVSS QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCARXXYSGSYXXDYWGQGTLVTVSS 1 296 300 316 320 361 361 428 CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGCATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGTGCGAGAGA QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGCINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCAR CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGGATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTGTATTACTGTGCGAGAGA QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCAR GTATGGTGGGAGCTACT YGGSY GTATAGTGGGAGCTACT YSGSY TTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG DYWGQGTLVTVSS TTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG DYWGQGTLVTVSS GCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGC ASTKGPSVFPLAPSSKSTSGGTA GCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGC ASTKGPSVFPLAPSSKSTSGGTA CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCT QVQLVQSGAEVKKPGASVKVSCKAS GGATACACCTTCACCGGCTACTAT GYTFTGYY ATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGC MHWVRQAPGQGLEWMGC ATCAACCCTAACAGTGGTGGCACA INPNSGGT AACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGT NYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYC TGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA WGQGTLVTVSS GCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTAC ARDLYGGSYSVDY TGTGCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTACTGG 45 CARDLYGGSYSVDYW 15 453.689 25.359 68.153 135.293 121S296M132S 420S1N17M112S2N 440S6N42M67S 481S68M226N 1.482e-129 0.00309 1.321e-15 4.201e-35 0.98986 0.94118 1 100 122 417 1 296 421 437 2 18 441 482 7 48 482 549 1 68 122 196 197 220 221 271 272 295 296 409 449 481 410 448 TCT 3 CGG 3 False CAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGGCCTCAGTGAAGGTCTCCTGCAAGGCTTCTGGATACACCTTCACCGGCTACTATATGCACTGGGTGCGACAGGCCCCTGGACAAGGGCTTGAGTGGATGGGATGCATCAACCCTAACAGTGGTGGCACAAACTATGCACAGAAGTTTCAGGGCAGGGTCACCATGACCAGGGACACGTCCATCAGCACAGCCTACATGGAGCTGAGCAGGCTGAGATCTGACGACACGGCCGTATATTATTGTGCGAGAGATCTGTATGGTGGGAGCTACTCGGTTGACTACTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGCINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCARDLYGGSYSVDYWGQGTLVTVSS 0.01014 0.0102041 0.05882 0.2 0 0 -1 -1 -1 False False GTCACAAGTTGATTGC-1_contig_1 AGGAGTCAGACCCAGTCAGGACACAGCATGGACATGAGGGTCCCCGCTCAGCTCCTGGGGCTCCTGCTGCTCTGGTTCCCAGGTTCCAGATGCGACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCCCCTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAACGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGC human IGK False True False True False True IGKV1-12*01 IGKV1-1201,IGKV1-1202 IGKJ4*01 IGKJ4*01 IGKC*01 GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCCCCTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAAC GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAGCTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCCNCTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAAC DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSASGTDFTLTISSLQPEDFATYYCQQANSFPLTFGGGTKVEIK DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQANSFPLTFGGGTKVEIK 1 284 286 322 322 458 GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCC DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSASGTDFTLTISSLQPEDFATYYCQQANSFP GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAGCTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGGATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCC DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQANSFP CTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAAC LTFGGGTKVEIK CTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAAC LTFGGGTKVEIK CGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGC RTVAAPSVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNA CGAACTGTGGCTGCACCATCTGTCTTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTATCCCAGAGAGGCCAAAGTACAGTGGAAGGTGGATAACGC RTVAAPSVFIFPPSDEQLKSGTASVVCLLNNFYPREAKVQWKVDNA GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGT DIQMTQSPSSVSASVGDRVTITCRAS CAGGGTATTAGCAGCTGG QGISSW TTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTAT LAWYQQKPGKAPKLLIY GCTGCATCC AAS AGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGT SLQSGVPSRFSGSASGTDFTLTISSLQPEDFATYYC TTCGGCGGAGGGACCAAGGTGGAGATCAAA FGGGTKVEIK CAACAGGCTAACAGTTTCCCCCTCACT QQANSFPLT TGTCAACAGGCTAACAGTTTCCCCCTCACTTTC 33 CQQANSFPLTF 11 438.107 nan 60.229 272.075 93S284M174S3N 378S1N37M136S 414S137M184N 7.292e-125 nan 3.221e-13 2.814e-76 0.99296 nan 1 100 94 377 1 284 379 415 2 38 415 551 1 137 94 171 172 189 190 240 241 249 250 357 385 414 358 384 C 1 False GACATCCAGATGACCCAGTCTCCATCTTCCGTGTCTGCATCTGTAGGAGACAGAGTCACCATCACTTGTCGGGCGAGTCAGGGTATTAGCAGCTGGTTAGCCTGGTATCAGCAGAAACCAGGGAAAGCCCCTAAACTCCTGATCTATGCTGCATCCAGTTTGCAAAGTGGGGTCCCATCAAGGTTCAGCGGCAGTGCATCTGGGACAGATTTCACTCTCACCATCAGCAGCCTGCAGCCTGAAGATTTTGCAACTTACTATTGTCAACAGGCTAACAGTTTCCCCCTCACTTTCGGCGGAGGGACCAAGGTGGAGATCAAA DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSASGTDFTLTISSLQPEDFATYYCQQANSFPLTFGGGTKVEIK 0.00704002 0.0105263 nan nan 0 0 -1 -1 -1 False False HT08 QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYYCARDLYSGSYSVDYWGQGTLVTVSS DIQMTQSPSSVSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYAASSLQSGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQANSFPLTFGGGTKVEIK ['W50C' 'S98G'] ['G66A'] False False [] ['50'] -1 13 9 IGHG G002630_False_IGHG_320 True
10901 G002-341_2_4_eODGT8_P02_CCTAAAGGTCAAACTC-1 G002-341 G002341 2 4 V160 eODGT8 PBMC 2022-10-07 P02 HT07 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003 P02 2022-10-07 0 0 0 0 0 SI-TT-H5 SI-TN-C7 221019_VH00497_32_AAANGGVM5 221019_VH00497_32_AAANGGVM5 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/221019_VH00497_32_AAANGGVM5 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/221019_VH00497_32_AAANGGVM5 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/demultiplexed/d1191380460aa54876be7325a32a84c7/outs/fastq_path vdj-SI-TT-H5 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/demultiplexed/d1191380460aa54876be7325a32a84c7/outs/fastq_path cso-SI-TN-C7 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/vdj/vdj_output_0007 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/cso/cso_output_0007 /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/vdj/vdj_output_0007/outs/sadie_airr.feather /mnt/fsx/workspace/jwillis/repos/G00x/g002/G002/sequencing/G002/run0003/working_directory/vdj/vdj_output_0007/outs/paired_sadie_airr.feather CCTAAAGGTCAAACTC-1 CCTAAAGGTCAAACTC-1_contig_1 AGATCTCAGAGAGGAGCCTTAGCCCTGGACTCCAAGGCCTTTCCACTTGGTGATCAGCACTGAGCACAGAGGACTCACCATGGAGTTGGGGCTGAGCTGGGTTTTCCTTGTTGCTATTTTAGAAGGTGTCCAGTGTGAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAGCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGA human IGH False True False True False True IGHV3-7*04 IGHV3-7*04 IGHD2-15*01 IGHD2-15*01 IGHJ5*02 IGHJ5*02 IGHG1*01 GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGANNNGGTGGTAGNNNNCTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARDWVEGPWFDPWGQGTLVTVSS EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARXXVVXXWFDPWGQGTLVTVSS 1 296 300 307 312 358 358 540 GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGA EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCAR GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGA EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCAR GGTGGAAG VE GGTGGTAG VV CTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG WFDPWGQGTLVTVSS CTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCAG WFDPWGQGTLVTVSS GCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGA ASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSG GCCTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGA ASTKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVSWNSGALTSGVHTFPAVLQSSG GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCT EVQLVESGGGLVQPGGSLRLSCAAS GGATTCACCTTTAGTAGCTATTGG GFTFSSYW ATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAAC MSWVRQAPGKGLEWVAN ATAAAGCAAGATGGAAGTGAGAAA IKQDGSEK TACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGT YYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYC TGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA WGQGTLVTVSS GCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCC ARDWVEGPWFDP TGTGCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCCTGG 42 CARDWVEGPWFDPW 14 463.037 11.095 76.078 363.264 136S296M244S 435S13N8M233S10N 447S4N47M182S 493S183M111N 2.812e-132 75.33 6.737e-18 1.222e-103 1 0.875 1 100 137 432 1 296 436 443 14 21 448 494 5 51 494 676 1 183 137 211 212 235 236 286 287 310 311 424 461 493 425 460 TTG 3 GGCC 4 False GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTCCAGCCTGGGGGGTCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTTAGTAGCTATTGGATGAGCTGGGTCCGCCAGGCTCCAGGGAAAGGGCTGGAGTGGGTGGCCAACATAAAGCAAGATGGAAGTGAGAAATACTATGTGGACTCTGTGAAGGGCCGATTCACCATCTCCAGAGACAACGCCAAGAACTCACTGTATCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTGTGTATTACTGTGCGAGGGATTGGGTGGAAGGGCCCTGGTTCGACCCCTGGGGCCAGGGAACCCTGGTCACCGTCTCCTCA EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARDWVEGPWFDPWGQGTLVTVSS 0 0 0.125 0.5 0 0 -1 -1 -1 False False CCTAAAGGTCAAACTC-1_contig_2 AGCTTCAGCTGTGGTAGAGAAGACAGGATTCAGGACAATCTCCAGCATGGCCGGCTTCCCTCTCCTCCTCACCCTCCTCACTCACTGTGCAGGGTCCTGGGCCCAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGTCGGTCTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCGGTCAGCCCAAGGCTGCCCCATCGGTCACTCTGTTCCCACCCTCCTCTGAGGAGCTTCAAGCCAACAAGGCCACACTGGTGTGTCTCGTAAGTGACTTCTACCCGGGAGCCGTGACAGTGGCCTGGAAGGCAGATGGCAGCCCCGTCAAGGTGGGAGTGGAGACCACCAAACCCTCCAAACAAAGCAAC human IGL False True False True False True IGLV1-47*01 IGLV1-47*01 IGLJ7*01 IGLJ7*01 IGLC7*01 CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGTCGGTCTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCG CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGTNNNNCTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCG QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSRSVFGGGTQLTVL QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSXXVFGGGTQLTVL 1 291 296 331 331 519 CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGT QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLS CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGT QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLS CTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCG VFGGGTQLTVL CTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTCG VFGGGTQLTVL GGTCAGCCCAAGGCTGCCCCATCGGTCACTCTGTTCCCACCCTCCTCTGAGGAGCTTCAAGCCAACAAGGCCACACTGGTGTGTCTCGTAAGTGACTTCTACCCGGGAGCCGTGACAGTGGCCTGGAAGGCAGATGGCAGCCCCGTCAAGGTGGGAGTGGAGACCACCAAACCCTCCAAACAAAGCAAC GQPKAAPSVTLFPPSSEELQANKATLVCLVSDFYPGAVTVAWKADGSPVKVGVETTKPSKQSN GGTCAGCCCAAGGCTGCCCCCTCGGTCACTCTGTTCCCACCCTCCTCTGAGGAGCTTCAAGCCAACAAGGCCACACTGGTGTGTCTCGTAAGTGACTTCTACCCGGGAGCCGTGACAGTGGCCTGGAAGGCAGATGGCAGCCCCGTCAAGGTGGGAGTGGAGACCACCAAACCCTCCAAACAAAGCAAC GQPKAAPSVTLFPPSSEELQANKATLVCLVSDFYPGAVTVAWKADGSPVKVGVETTKPSKQSN CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGC QSVLTQPPSASGTPGQRVTISCSGS AGCTCCAACATCGGAAGTAATTAT SSNIGSNY GTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTAT VYWYQQLPGTAPKLLIY AGGAATAAT RNN CAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGT QRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYC TTCGGAGGAGGCACCCAGCTGACCGTCCTC FGGGTQLTVL GCAGCATGGGATGACAGCCTGAGTCGGTCTGTG AAWDDSLSRSV TGTGCAGCATGGGATGACAGCCTGAGTCGGTCTGTGTTC 39 CAAWDDSLSRSVF 13 455.247 nan 58.644 367.228 103S291M228S5N 398S2N36M188S 433S189M129N 5.737e-130 nan 1.095e-12 7.191e-105 1 nan 1 99.471 104 394 1 291 399 434 3 38 434 622 1 189 104 178 179 202 203 253 254 262 263 370 404 433 371 403 CGGT 4 False CAGTCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGGCAGAGGGTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTATGTATACTGGTACCAGCAGCTCCCAGGAACGGCCCCCAAACTCCTCATCTATAGGAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTGGCACCTCAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGCCTGAGTCGGTCTGTGTTCGGAGGAGGCACCCAGCTGACCGTCCTC QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSRSVFGGGTQLTVL 0 0 nan nan 0 0 -1 -1 -1 False False HT07 EVQLVESGGGLVQPGGSLRLSCAASGFTFSSYWMSWVRQAPGKGLEWVANIKQDGSEKYYVDSVKGRFTISRDNAKNSLYLQMNSLRAEDTAVYYCARDWVVGPWFDPWGQGTLVTVSS QSVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVYWYQQLPGTAPKLLIYRNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAWDDSLSRSVFGGGTQLTVL ['V98E'] [] False False [] [] 0 12 11 IGHG G002341_False_IGHG_402 True