Starfish Storage Marks 10 Years Leading in Unstructured Data Management

Starfish Storage, a leader in metadata-driven unstructured data management, is honored to celebrate its decade-long journey. Concurrently, the company reveals it now oversees an excess of one exabyte of capacity among its clientele. This encompasses major players such as eight out of the top ten pharmaceutical companies globally, seven of the eight Ivy League institutions, supercomputing sites under the Department of Energy, and prominent corporations spanning semiconductor, oil and gas, fintech, automotive, healthcare, consumer products, and media-entertainment sectors.

Starfish services the largest and most demanding file environments in the world. The typical Starfish customer uses high-performance parallel file systems and scale-out NAS to service the needs of scientific computing, AI/ML, engineering, rendering, and other highly demanding production workloads. These environments consist of billions of files, tens and sometimes hundreds of petabytes of capacity, and have a myriad of data management challenges.

Starfish’s metadata-driven approach sets it aside from traditional file management solutions that rely on timestamps and coarse-grained analytics to make policy decisions. At the heart of the Starfish platform is a data catalog specifically designed for unstructured data. The metadata and analytics capabilities of the catalog allow Starfish to service a wide variety of use cases (well beyond the table-stakes use cases of archiving and data movement) and address the nuanced needs of each set of stakeholders. Most importantly, Starfish enables the end users who create and consume files to manage their own data with the appropriate security and safeguards.
Company founder, Jacob Farmer, explains, “In these large, diverse computing facilities where Starfish plays there are a myriad of use cases, often with nuanced implementation details. There are also many stakeholders including the users who create and consume the files as well those who are responsible for paying for storage capacity, various aspects of compliance, and ensuring proper data curation.” Some of the use cases Farmer refers to include archiving, data protection, migrations, cloud bursting, cost accounting, data disposition, ROT (Redundant, Obsolete, and Trivial) cleanup, FAIR data management, and AI/ML workflows.
Related News:

About Author

Taylor Graham, marketing grad with an inner nature to be a perpetual researchist, currently all things IT. Personally and professionally, Taylor is one to know with her tenacity and encouraging spirit. When not working you can find her spending time with friends and family.