Learning objectives

Introduction

This lab follows the Data Carpentry’s Data Wrangling and Processing for Genomics. Arianna Kazemi has tested it on the MGPHCC. She had the following issues.

  1. It took a long time to down the genome files using curl. If you are having trouble downloading the E.coli genome files. You can download them from here, unzip the folder and then use scp to transfer them to your MGHPC https://drive.google.com/file/d/1UYj7H05p5AtkCCxlmalE-et0MMOlAJ58/view?usp=sharing . See the See the MGHPCC wiki for scp instructions

  2. When running trimmomatic, since you need the path to the jar file and because the path the workshop supplies to the adapter files doesn’t work. These commands worked on MGHPCC
  1. Download IGV to your own computer. I tried to use it on the MGHPCC but have run into memory issues.

Exercises

For now work through the pipeline so that you have data that can be visualized in IGV.