Today, GenomeWeb wrote a piece on the STORMSeq pipeline (Scalable Tools for Open source Read Mapping), our newest project in the goal of enabling the public to explore their own personal genetic data. In this pipeline, users upload reads to Amazon S3 and start a webserver in Amazon EC2, where they can set parameters for read mapping and variant calling, all in a graphical user interface. Once they click “GO!”, the pipeline runs and progress and quality control metrics can be monitored, and the final results of the pipeline are uploaded back to Amazon S3. The pipeline itself is free, though the user pays for storage (currently $0.1 per Gb-month) and compute time (currently estimated about $1-2 per exome, $25-35 per genome) on the Amazon cloud. A publication with details about the pipeline is forthcoming, but the pipeline is ready to use now (currently in version 0.8.5) with instructions for use at www.stormseq.org (and the code is available on github.com/konradjk/stormseq).
- Refining the role of de novo protein-truncating variants in neurodevelopmental disorders by using population reference samples. Kosmicki JA, Samocha KE, Howrigan DP, Sanders SJ, Slowikowski K, Lek M, Karczewski KJ, Cutler DJ, Devlin B, Roeder K, Buxbaum JD, Neale BM, MacArthur DG, Wall DP, Robinson EB, Daly MJ
- RT @RobertCGreen: GATC "OG" twitter.com/razibkhan/stat… 1 week ago
- RT @edyong209: A yr ago, I started tracking the gender balance of the sources in my stories. I've since raised it from 25% women to 50%. An… 2 weeks ago
- RT @fdmts: @dgmacarthur Likewise! The ATGU weekly was my favorite recurring meeting. Thanks for letting me sit in. 2 weeks ago
- RT @cdbustamante: @HankGreelyLSJU @carlzimmer And @konradjk just entered the story... 2 weeks ago
- RT @ryanlcollins13: Thrilled to see our paper on SV in disease out today in @GenomeBiology! Huge team effort by @TalkowskiLab over ~4yrs ht… 2 weeks ago