The subterranaut (pronounced: subterrainnot) was inspired by my work on high throughput sequence data of the deep terrestrial subsurface.
Coming from a background in biochemistry, I realized that there was a steep learning curve to begin working with high throughput sequence data. Raw data files were larger than my iTunes library and scanning through endless rows on an Excel file felt like a hopeless task. I knew that there must be better way to handle my large datasets and analyze them. This blog contains the steps I took to understand my sequence data. Many of the posts are direct answers to the wide range of questions that I had as I began to enter this field. I hope others can use this blog as a roadmap to ease their transition from working in the lab to working on high throughput sequence data.
I hope you enjoy!