Natera Dataset

WES and RNAseq data were obtained from FFPE patient samples. Raw data, including WES (paired BAM) and RNAseq (FASTQ) files, were transferred in batches on a quarterly basis.

Data Processing

  • WES Data: Processed using either the CGL pipeline (Jianhua Zhang Lab, MD Anderson) or the Consensus Calling pipeline (Wenyi Wang Lab).
  • RNAseq Data: Processed by Ganiraju Manyam.

Data Storage

  • WES Data:
    • Batch1–Batch5: /rsrch5/home/gi_med_onc/GIMO_CRC_CentralizedDataStorage/Natera/
    • CGL Results: /rsrch5/home/gi_med_onc/GIMO_CRC_CentralizedDataStorage/ngs_runs/
    • Batch6 and beyond: /rsrch9/home/gi_med_onc/GIMO_CRC_CentralizedDataStorage/Natera_DNA/
    • CGL Results: /rsrch9/home/gi_med_onc/GIMO_CRC_CentralizedDataStorage/ngs_runs/
  • RNAseq Data:
    • /rsrch9/home/gi_med_onc/GIMO_CRC_CentralizedDataStorage/Natera_RNA/

Data Access

Results, along with clinical mapping, are available on Foundry under:
/MDACC/Colorectal Cancer: LAB09-0373/09-Projects_with_Multiple_Tumor_Locations/Signatera/WES_RNA

For access requests, please contact Kangyu Lin and cc Scott Kopetz.

Statistics

WES:

BatchPatient_numberMRN_Number (mapped)CGL pipeline statusConsensus Calling pipeline statusCGL results on FoundryConsensus results on Foundry
Batch1_11Nov2022809800FinishFinishYESNO
Batch2_02Jun2023475473FinishFinishYESNO
Batch3_19Oct23290289Finish YES 
Batch4_02Feb2024251251Finish YES 
Batch5_24May2024225225Finish YES 
Batch6_15OCT2024209209Finish YES 
       
       
       
Total22592250    

RNAseq:

BatchPatient_numberMRN_Number (mapped)Expression calling statusResults on Foundry
Batch1_07OCT2024998991YES YES
Batch2_28OCT2024476476 YES YES
     
     
     
Total14741467  

Total number of samples with WES and RNAseq:

Patient_numberMRN_Number (mapped)
12271223

Last Updated: 3/14/2025