Lesson 2. Sequencing and assembly

Here, you will be working on the sequencing and assembly of your genome. If your species has been sequenced multiple times, focus on just one of the articles. For this topic, focus on the information from the article and not on what you can find in databases, where there may be results from different sequencing projects for the same species.

(a) Which sequencing technology/technologies were used? Provide the most relevant data, such as the number and size of the reads, the total amount of sequence obtained, redundancy, and average quality per base (Phred score).

(b) Are these next-generation sequencing technologies or classical methods? What are the main advantages and disadvantages of the technology/technologies used?

(c) What type of assembly was performed, de novo or reference-based mapping? Which program(s) were used? What strategy was followed to achieve scaffolding? Provide the most relevant data from the obtained assembly, such as the number and size of contigs, the number and size of scaffolds, and the average quality per base (Phred score) of the final sequence.

(d) Based on what you know about your genome, do you think the assembly was difficult? How confident are you in the obtained result? Justify your answer.