← К разделам
Genome Sequencing Center
🧬 Геномный секвенирующий центр (NGS)
Модуль #313. National Center of Biotechnology Astana — план «Kazakhstan Genome Initiative» secвенирования 100K-1M казахов для precision medicine. Reference: Beijing Genome Institute BGI Shenzhen 100M+ genomes/year, Sanger Cambridge UK, Broad MIT Cambridge USA. Technology — Illumina NovaSeq 6000 + X 1.5 TB output per run + PacBio Sequel IIe long-read 25 kb avg. Bioinformatics HPC cluster для variant calling + annotation. CLIA + GA4GH Standards + ISO 17025 + СН РК 4.04-09.
1. Состав genomics center 100 000 samples/year
- Sample receiving + accession — barcode scanner Mettler + Aliquot Robotic Hamilton STAR 16-channel × 4;
- DNA extraction lab — QIAGEN QIAcube + Promega Maxwell automated 96-well, output 5-10 μg DNA/sample;
- Library prep — Illumina TruSeq Nano + KAPA HyperPlus automated с Beckman Coulter Biomek FX;
- Sequencing lab — Illumina NovaSeq 6000 × 4 instruments + NovaSeq X 8 instruments (1.5-3 TB/run);
- Long-read PacBio Sequel IIe × 2 + Oxford Nanopore PromethION для structural variants;
- HPC cluster — Dell PowerEdge R750 server 64-core EPYC × 100 nodes + GPU NVIDIA A100 × 50 для variant calling GATK4 + alignment BWA-MEM2;
- Storage — 10 PB Dell EMC Isilon scale-out NAS + 50 PB tape archive long-term;
- Bioinformatics workflow — Nextflow + Snakemake pipelines + AWS Genomics CLI для cloud burst;
- QC lab — Bioanalyzer Agilent 2100 + qPCR Roche LightCycler 480;
- Cold storage — -80 °C ULT freezer × 50 для DNA + RNA + bio-samples;
- Genetics counselling room + clinical reporting team для returning results to patients;
- Variant database GenomicsKZ + integration с ClinVar / dbSNP / gnomAD + protected access GA4GH Beacon;
- Cybersecurity HIPAA compliance + encrypted at-rest + at-transit AES-256;
- Power 1 МВт + UPS 200 kVA + DG 500 kVA backup для 24/7 sequencing runs;
- Adjacent biobank Air Liquide LD-5K LN2 storage 1M sample plasma/tissue.
Упражнение 1 — Sequencing technology
Упражнение 2 — Samples/year
Упражнение 3 — Капекс genomics center
- Illumina NovaSeq 6000 × 4 + X × 8 = 7 млрд
- PacBio Sequel IIe × 2 + Nanopore PromethION × 4 = 2 млрд
- HPC cluster Dell PowerEdge 100 nodes + GPU A100 × 50 = 4 млрд
- Storage Dell EMC Isilon 10 PB + tape 50 PB = 1.5 млрд
- Sample prep robotics Hamilton STAR + Beckman + QIAcube = 0.8 млрд
- QC + biobank LN2 + cold storage = 0.7 млрд
- Building 3000 м² + cleanroom + проектирование 4% + insurance = 2 млрд
Упражнение 4 — Data security
Нормативная база
- CLIA Clinical Laboratory Improvement Amendments
- GA4GH Global Alliance for Genomics and Health Standards
- HIPAA Health Insurance Portability + Accountability
- EU GDPR General Data Protection Regulation
- ISO 15189 — Medical Laboratory Quality
- ISO 17025 — Testing + Calibration Lab
- ISO 27001 — Information Security
- FIPS 140-2 — Cryptography
- Закон РК О персональных данных