High frequency of shared clonotypes in human T cell receptor repertoires

We provide the Python scripts used to analyze the AIRR sequencing data from the study High frequency of shared clonotypes in human T cell receptor repertoires. Each directory contains a README file that describes the file format for the data. We also provide example data that can be used with each script. The example data is meant for illustration purposes only. These scripts, with the exception of the MongoDB subsampling script, were run on a machine which has 48 cores and 64 GB of RAM.

Datasets

HIP 1

AbHelix (mRNA)

TCRα

TCRβ

Adaptive Biotechnologies (gDNA)

Bulk Sequencing

HIP 2

AbHelix (mRNA)

TCRα

TCRβ

Adaptive Biotechnologies (gDNA)

HIP 3

AbHelix (mRNA)

TCRα

TCRβ

Adaptive Biotechnologies (gDNA)

HIP 4

Adaptive Biotechnologies (gDNA)

HIP 5

Adaptive Biotechnologies (gDNA)

Synthetic repertoires

We used the program IGoR to create synthetic repertoires for TCRβ chains. The synthetic repertoires can be downloaded here. Note that the simHIP1 data corresponds to all files with the prefix tcr_beta_synrep_set1, simHIP2 data corresponds to all files with the prefix tcr_beta_synrep_set2 and simHIP3 data corresponds to all the files with prefix tcr_beta_synrep_set3.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
box-and-whisker		box-and-whisker
compute-overlaps		compute-overlaps
heatmaps		heatmaps
morisita-horn		morisita-horn
sampling_synthetic_repertoires_with_MongoDB		sampling_synthetic_repertoires_with_MongoDB
subsampling		subsampling
README.md		README.md
graphical abstract combine_ec-square.tiff		graphical abstract combine_ec-square.tiff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

High frequency of shared clonotypes in human T cell receptor repertoires

Datasets

HIP 1

AbHelix (mRNA)

Adaptive Biotechnologies (gDNA)

HIP 2

AbHelix (mRNA)

Adaptive Biotechnologies (gDNA)

HIP 3

AbHelix (mRNA)

Adaptive Biotechnologies (gDNA)

HIP 4

Adaptive Biotechnologies (gDNA)

HIP 5

Adaptive Biotechnologies (gDNA)

Synthetic repertoires

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

crowelab/TCRBmanuscript

Folders and files

Latest commit

History

Repository files navigation

High frequency of shared clonotypes in human T cell receptor repertoires

Datasets

HIP 1

AbHelix (mRNA)

Adaptive Biotechnologies (gDNA)

HIP 2

AbHelix (mRNA)

Adaptive Biotechnologies (gDNA)

HIP 3

AbHelix (mRNA)

Adaptive Biotechnologies (gDNA)

HIP 4

Adaptive Biotechnologies (gDNA)

HIP 5

Adaptive Biotechnologies (gDNA)

Synthetic repertoires

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages