A hybrid course consisting of intro to Next Generation Sequencing technologies and continuing with the EdX course: Case studies in Functional Genomics

The next months will be dedicated to learning the nature of data produced by Next Generation Sequencing (NGS) technologies and their applications, with practical sessions to perform functional genomics studies. The first part will be conducted based on selected papers in functional genomics and will then continue with the EdX course “Case studies in functional genomics”. We will self-audit this course before the meetings and will discuss the lectures and the exercises during the meeting. Please, review the materials beforehand, in order to have an effective discussion later on.

The classes in January will be held at ISTC

Next session: 18 January, Thursday, at 18:30-20:00. 

Part 4. EdX course: RNA-Seq (Week 1)

Previous session (30 November, Thursday):  Variant calling: theory and practice


Part 1. Intro to NGS technologies: data generation and quality control


Date: 26 October, Thursday, 17:30-19:30

History and classification of sequencing technologies: 

Paper 1: The sequence of sequencers: The history of sequencing DNA

Paper 2: Ten years of next-generation sequencing technology

Slides Oct26: Intro to NGS technologies


Date: 02 November, Thursday, 17:30-19:30

Quality control of data from Illumina sequencers 

The programs that read the image files from sequencers and convert those to sequences, along with reporting quality scores, are called base callers. The quality of base calling depends on multiple factors. We will discuss those factors shortly.

Additionally, we will discuss the following paper about various quality score formats:

Paper 3: The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants

We will next discuss the FASTQC program to analyze quality of the reads.

Please, watch this video tutorial on FASTQC.

The following is the documentation of FASTQC. Please, look at the documentation in parallel with the two examples of fastqc outputs: for good and for bad Illumina data.

Slides Nov02: Quality control

Part 2. Short read alignment

Date: 09 November, Thursday, 18:00-19:30

We will start the meeting with pairwise sequence alignment algorithm. Please, have a look at the chapter from the start till 2.1.2 Local alignment.

We’ll then discuss the first successful attempt of fast short-read alignment:

Paper 4: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome (bowtie).

As well as the adaptation of the method to map RNA-seq reads:

Paper 5: TopHat: discovering splice junctions with RNA-Seq. 

Slides Nov09: Alignment


Part 3. Variant calling: theory and practice

Date: 16 November, Thursday, 18:00-19:30

What is paired-end sequencing? We will review the pros and cons of this technology and the specifics of paired-end read mapping.

Please watch the following presentation Paired end and Mate pair Sequencing: What is it and How is it done?

We will then discuss the concept of variant calling and how it is performed.

Finally, the Sequence Alignment/Map(SAM) format will be discussed. Please have a look at SAM format specifications.

Date: 23 November, Thursday, 18:00-19:30

We will start our meeting with a discussion of​ variant calling tool HaplotypeCaller from Genome Analysis Toolkit (GATK). Please, have a look at variant calling tutorial.

Then we will ​have some practice with HaplotypeCaller. Please, try to use it and run the codes. In tutorials folder you can find the data used in the tutorial.

Finally, we’ll get an idea of SAM format specifications.

Date: 30 November, Thursday, 18:00-19:30

After many meetings of lost promises to discuss SAM format , we’ll start the session with SAM discussion.

Further, we’ll talk about VCF format, which is the end-result of variant calling.

In the end, we’ll have an idea of genome-wide association studies and if time’s left we’ll get an idea of how it should be done.

Please watch these videos: part1, part2 of a cool lady with blue hair talking about GWAS.

Part 4. EdX course: RNA-Seq

To get enrolled in the EdX  course “Case Studies in Functional Genomics”, see the EdX instructions. 

Those who have already registered in this course previously: please note, the course has started all over, and it’s better if you get enrolled again, instead of viewing the archived materials.

For the next class, please review the Week 1 of RNA-seq and do the home assignments.

Date: 18 January, 2018, Thursday, 18:30-20:00

Part 5. New algorithms in RNA-Seq analysis: pseudoalignment

Materials: TBD


Part 6. EdX course: ChIP-Seq

Materials: see EdX instructions


Part 7. EdX course: Bisulfite-Seq

Materials: see EdX instructions



When and where

Where: The courses will be held at BIG, in the the Institute of Molecular Biology, on the 3rd Floor. Turn left from the elevator, find our sign on one of the doors on the right side of the corridor and enter.

Note: some classes will be held in ISTC. Please, call 094601703 (Lilit) to find the room.

When: Each Thursday, from 18:30 – 20:00. First session: 26 October.


