You may have large amounts of data that you have to manage –examples are video, text, images, IoT, spreadsheets etc.. What if that data initially needs to be accessed frequently by your customers but is then set aside, only to be accessed maybe once a year afterward (if that)?
For example, if an image file is active for analysis in the first month but then is only needed for future correlation, how can you manage both that storage capacity and demand with the responsiveness that customers have come to expect from their data? This data is considered “hot data” because it will be accessed by the customer frequently, and it comprises only a very small percent of the overall amount of data that you may have.
Conversely, what happens when the video or file becomes stale and needs to be stored securely, but with the option to be pulled on-demand when requested? Furthermore, what if this video or file stays archived for over a year or more? This is called “cold data.” That sort of ability to scale up or down in accordance with the volume of customer demand is challenging, particularly when it comes to extremely large files. Managing, storing and providing these large files of cold data can also be very expensive.
So how can you scale up and down to meet demand, keep these files readily available and then store them in an economically viable manner where they can be accessed on-demand, if needed?
Basically, how can you keep your customer happy, while keeping your costs in check?
The hot and cold data storage solution
IBM Storage Scale and Storage Scale System has integration with tape with IBM Storage Archive and can enable up to 125GB/s access to your data. Our purpose is to improve the Total Cost of Ownership (TCO) for your data. Data accessibility can use a large amount of power, which correlates into a potentially significant cost for you. Hot data is saved on your storage locally on Flash storage or high-performance disks/disk arrays through IBM Storage Scale.
We then save cold files to the IBM TS4500 Tape Library, which lowers your cost because the tape drive only uses power when it is accessed (making it a more efficient, greener solution). The TCO to save the data on tape is much lower, as the power and corresponding cost required to access the data is a fraction of the cost when compared to traditional storage. Another important advantage of using our solution is that it helps you quickly and easily access the data.
With these three tools, we are able to provide a rapid, flexible and low-cost solution:
- IBM Storage Scale provides a global data platform for high-performance, next-generation data services. Accelerate your AI initiatives with parallel access to Yottabytes of file and object data using multiple APIs concurrently to the same data.
- IBM Storage Archive is a universal file system that can store large amounts of data that can be scaled up easily (with storage capabilities from petabytes to exabytes).
- IBM TS4500 Tape Library provides flexible and secure storage capacity.
Our overall solution consists of integrating our data lake solution — which includes the IBM TS4500 Tape Library — with IBM Storage Scale and IBM Spectrum Archive into your existing data platform. We use the IBM TS4500 Tape Library, where we can save a huge amount of data. IBM Storage Archive has the capacity to host up to 10,000 cartridges, and each cartridge has a 12-terabyte storage capacity and IBM Storage Scale and Scale System can storage up to 633 YB of file and object data.
Get started
Learn more how to lower the cost of your data lake and data bakehouse : https://www.ibm.com/products/storage-scale