HW2ΒΆ

1. Use the assemblies.csv file you made in lab and this webpage http://hibberdlab.com/transrate/metrics.html to understand that the numbers in that file mean. There is no need to copy the definitions for the homework, but taking what you know about how those were created, talk about the metrics that are differnet and tell me why they might be different.

  1. List the reasons why genome assembly may be complicated.
  2. Describe digital normalization and the metric used for deciding whether or not a read is retained or discarded.
  3. What sequencing strategy would one chose for sequencing a large genome, assuming unlimited $$