One of the Top 10 Medical Universities in the United States
Given the exponential increases in human knowledge across a myriad of scientific disciplines, we found ourselves facing an unprecedented data explosion. Our lab uses a combination of recombinant DNA, DNA sequencing methods, and bioinformatics to analyze the structures and functions of genomes. As knowledge in field of genomics increases, our datasets and analysis result have increased exponentially. We found ourselves at a crossroad in terms of how to mitigate this data challenge. We sent out a request for proposals to many of the top vendors and they returned with some very interesting and cutting edge solutions that would stem the data flood, albeit with a six figures tag. However, I posed one simple question to the vendors, “How will this scale once we pass the inflexion point on our data growth?”. They returned with several upgrade paths that we could transition to over the years which would require us to migrate to newer software versions and hardware architectures. This was a bit of a sticking point for us. Software upgrades? Yes. Entirely new architectures? No.
I had looked at MooseFS several years ago and I liked the replication methods and trash retention system. However, at the time, our existing solution was tied to the hardware manufacture. When I looked at MooseFS this time, their new features such as Atomic Snapshots, Redundancy with Erasure Coding, Fast Disk Recovery, and Fault Tolerance, made me have to get them on the phone! They explained their solutions in detail and let me set up clusters in our own data center to see for ourselves just how powerful their solution could be. They had me sold and we have been using their products ever since. I can add on any hardware that I wish. Data on failed drives is automatically shuttled to other disks so I can hot swap disks from my multiple 90+ disk arrays almost at will. The Redundancy with Erasure Coding allows me to minimize space considerably. No vendor tie in. Use the disks and architectures that you want whether cutting edge and expensive or cheap and after-market.
Our research areas include Statistical Genetics, Genetic Epidemiology, Longevity Genetics, Cardiovascular Disease Genetics and we partner with numerous studies, academics institutions, and medical professionals around the world. During this period of COVID-19 pandemic, we have been tasked to double our research efforts. Our storage requirements have doubled in the span of a week. The MooseFS team reached out to us and asked if we were involved in COVID-19 research. When I confirmed this, they gave us additional tools and extended our license to enable us to face these data growth challenges, today. That is an experience that I never had with our other vendors.
I highly recommend MooseFS if you are looking for a new, dynamic, and cost-effective storage solution.
A top-10 US medical university runs MooseFS as the storage backbone for genomics research — statistical genetics, genetic epidemiology, longevity genetics and cardiovascular-disease genetics — absorbing exponential data growth without vendor lock-in. Atomic snapshots and erasure coding underpin the lab’s reproducibility and capacity strategy across 90+ disk arrays.