🎒
NGS for natural scientist
  • 1. Preface
    • How to use this book
    • Motivation
    • Genomic data science as a tool to biologist
    • Next Generation Science (also NGS)
  • 2. Getting started
    • A step by step pipeline tutorial
    • Sequencing chemistry explained by Illumina
    • Joining a course
    • RNA quality and Library prep
    • (optional) My click moment about "Why Linux"
  • 3. Good-to-know beforehand
    • Experiment design
    • Single-end and Paired-end
    • Read per sample and data size
    • Normalization - RPKM/FPKM/TPM
    • Gene annotation
  • 4. Setting up terminal
    • My Linux terminal
    • Linux environment
    • R and RStudio
    • PATH
  • 5. FASTQ and quality control
    • Getting FASTQ files from online database
    • FASTQ quality assessment
  • 6. Mapping/alignment and quantification
    • Salmon
    • DESeq2
  • 7. Visualization
  • 8. Single cell RNA-Seq
  • 9. AWS cloud and Machine Learning
    • Machine Learning in a nutshell
    • R vs Python
    • Setting up ML terminal
    • Data exploration
  • (pending material)
    • graphPad
    • readings for ML
Powered by GitBook
On this page
  1. 3. Good-to-know beforehand

Gene annotation

When there is a time you would give up on the human readable gene names

PreviousNormalization - RPKM/FPKM/TPMNext4. Setting up terminal

Last updated 2 years ago

Essentially, the gene_id on the differential expression excel list (TIMP1 instead of ENSG00000102265). I prefer Ensembl, and the annotation would be the GTF/GFF file housed in the same FTP server with the genome sequence. It could be useful when one could label the heatmap with the gene name in the case of study that deals with individual gene but it is becoming less and less indispensable when everything happens from server to server.

To annotate genes with key of one's choice at the data visualization state, plainly it is just a matter of matching the the gene_id with the designated gene name in the annotation data frame, and that means it is about data manipulation in R. I will go into details in the later section.

https://asia.ensembl.org/info/data/ftp/index.html