The software you need for bioinformatics or high-performance computing is probably already available for you to use with a few quick commands. No need to compile software from scratch.
Room scheduling for ATRB (Advanced Teaching and Research Building).
Please contact us and we'll be happy to help you find the right storage solution.
The Genome Informatics Facility of the Iowa State University Office of Biotechnology
Split the input sequence file into multiple pieces and run the BLAST search simultaneously on each of the split file.
This site gives a good explanation of how to get started using Gnu Parallel, which can be used to break up data files into chunks that can be run concurrently rather than serially processing through the entire file.
R is a tool for statistics and data modeling. The R programming language is elegant, versatile, and has a highly expressive syntax designed around working with data. R is more than that, though — it also includes extremely powerful graphics capabilities. If you want to easily manipulate your data and present it in compelling ways, R is the tool for you.
An explanation of why vectorizing your R code often results in improved performance.
…the catchphrase used to describe information sets that have extreme volume, variety, and velocity. Additional descriptors that have emerged include veracity, variability, and complexity. ‘Big Data analysis’ conveys the concept of extreme information management. Research conducted by ISU scientists is complex, collaborative, and multidisciplinary.
The High Performance Computing facility in Durham Center provides access to petascale systems that support large-scale computing needs for scientific and instructional endeavors at Iowa State University. View a list of systems and equipment.
Omics Pipe is a Python framework for automating ‘best practice’ next generation sequencing pipelines. Omics Pipe can be run from the command-line by providing it with a YAML parameter file specifying your directory structure and software specific parameters. This executes a parallel automated pipeline on a Distributed Resource Management system (local cluster or Amazon Web Services (AWS)) that efficiently handles job resource allocation, monitoring and restarting.
Learn the basics of common Git workflows used by enterprise teams and assess their ability to fit your individual needs in this Atlassian Git tutorial.