DataCenterNews Asia logo
Specialist data center news for Asia
Story image

Data gravity and its impact on data storage infrastructure

By Contributor
Wed 1 Sep 2021

Article by Seagate country manager for A/NZ Jeff Park.
 

Data gravity affects the entire IT infrastructure; it should be a major consideration when planning data management strategies. It’s important to ensure that no single data set exerts an uncontrollable force on the rest of the IT and application ecosystem.

Data is now an essential asset to businesses in every vertical, just as physical capital and intellectual property are. With ever-increasing quantities of both structured and unstructured data, data growth will continue at unprecedented rates in the coming years. 

Meanwhile, data sprawl — the increasing degree to which business data no longer resides in one location but is scattered across data centers and geographies — adds complexity to the challenges of managing data’s growth, movement, and activation.

Enterprises must implement a strategy to efficiently manage mass data across cloud, edge, and endpoint environments. And it’s more critical than ever to develop a calculated plan when designing data storage infrastructure at scale.

As enterprises aim to overcome the cost and complexity of storing, moving, and activating data at scale, they should seek better economics, less friction, and a simpler experience. A new way to data.

The concept of data gravity is a vital element to consider in these efforts.

According to the new Seagate-sponsored report from IDC, as storage associated with massive data sets continues to grow, so will its gravitational force on other elements within the IT universe.

Data gravity is a consequence of data’s volume and level of activation. Basic physics provides a suitable analogy: a body with greater mass has a greater gravitational effect on the bodies surrounding it. “Workloads with the largest volumes of stored data exhibit the largest mass within their ‘universe,’ attracting applications, services, and other infrastructure resources into their orbit,” according to the IDC report.

A large and active dataset will necessarily affect the location and treatment of the smaller datasets that need to interact with it. So, data gravity reflects data lifecycle dynamics and must help inform IT architecture decisions.

Consider two datasets: one is 1 petabyte, and the other is 1 gigabyte. To integrate the two sets, it is more efficient to move the smaller dataset to the location of the larger dataset. As a result, the storage system with the 1-petabyte set now stores the 1-gigabyte set as well. Because large datasets will ‘attract’ other smaller datasets, large databases tend to accrete data, further increasing their overall data gravity.

Managing, analysing and activating data also relies on applications and services, whether those are provided by a private or public cloud vendor or an on-prem data management team. Applications collect and generate data; a lot of work has to happen on the data. Naturally, the more massive a data set grows, the harder it is to use that data unless it is close to the applications. So applications are often moved close to the data sets. From on-premises data centers to public clouds and edge computing, data gravity is a property that spans the entire IT infrastructure.

But according to the IDC report, such massive data sets can become like black holes. “Trapping stored data, applications, and services in a single location, unless IT environments are architected to allow the migration and management of stored data, along with the applications and services that rely on it, regardless of operational location.”

Because data gravity can affect an entire IT infrastructure, it should be a major design consideration when planning data management strategies. An important goal in designing a data ecosystem, according to IDC, is to “ensure that no single data set exerts uncontrollable force on the rest of the IT and application ecosystem.”
 

Ensuring applications have access to data, regardless of location

IT architecture strategy should put mass storage and data movement at its centre. This begins with optimising data location. A data-centred architecture brings applications, services and user interaction closer to the location where data resides, rather than relying on time-consuming and often costly long-distance transfers of mass data to and from centralised service providers.

IDC notes that “one way to mitigate the impact of data gravity is to ensure that stored data is colocated adjacent to applications regardless of location.”

This model can be accomplished by leveraging colocated data centers that bring together multiple private and public cloud service providers.

The fundamental goal of a data-centred architecture is data accessibility. Accessibility can impact future business innovation, improve the ability to generate metadata and new datasets, enable search and discovery, and further empower data scientists to deploy data for machine learning and AI.

But putting data at the centre of IT architecture can also positively impact application performance optimisation. The overall reliability and durability of the data is also a significant benefit: reliability is the ability to access data when needed, and durability is the ability to preserve data over extended periods.
 

Put data at the centre of IT strategy

Altogether, these considerations have considerable implications for enterprise data management planning — from defining an overall IT strategy to formulating a business initiative. Planning out the necessary workloads and jobs means accounting for data gravity. 

Key questions to ask include: 

  • What is the volume of data being generated or consumed? 
  • What is data distribution across the data center, private clouds, public clouds, edge devices, and remote and branch offices? 
  • What is the velocity of the data being transmitted across the entire IT ecosystem? 

Addressing these considerations will increase the efficiency of the data infrastructure and can reduce costly data pipeline issues down the line.

IDC advises in its report, “Don’t let a single workload or operational location dictate the movement of storage or data resources.” Because data has gravity, data infrastructure must be designed to prevent large individual workloads from exerting a dominant gravitational pull on storage resources.

This means always maintaining awareness about which datasets are being pulled where, the most efficient path to move the data, and what helps those workloads run the best. This can also mean automating the movement of data to reduce storage costs, or moving lower-performing datasets that are not immediately needed. 

Putting these ideas into action means deploying data architecture, infrastructure and management processes that are adaptive. So while an organisation may have a good idea of its data gravity considerations today, they may not be the same five years from now.

“Not every enterprise manages multiple massive data sets, but many already do,” IDC notes in the report. “And, given the pace of digitisation of business and the importance placed on the value of enterprise data and data gathering, many organisations will find themselves managing massive data sets in the near future.”

Every data management system should change to accommodate new data requirements. Data management and the data architecture to support it must be agile and adapt to shifting business needs.

Related stories
Top stories
Story image
Sustainability
Empyrion DC announces 40MW green data center in South Korea
Empyrion DC has announced it is developing a 40MW green data center in Gangnam, Seoul, South Korea (GDC).
Story image
Google Cloud
Google Cloud to open first cloud region in NZ - among others
Google Cloud has announced plans to bring three new cloud regions, one each in New Zealand, Malaysia and Thailand.
Story image
Data
Talend announces support for Amazon Redshift Serverless
Talend has announced its support for Amazon Redshift Serverless, with the company saying the integration reinforces its commitment and leadership in supporting businesses.
Story image
Storage
DCI Data Centers breaks ground on AKL02 center
DCI Data Centers has commenced construction on Auckland's largest data center.
Story image
Data center
Macquarie Asset Management acquires stake in ST Telemedias VIRTUS Data Centres
"We will further strengthen VIRTUS' focus on sustainability by backing investment in its technology and enhancing the lifecycle management of its equipment."
Story image
Partnership
NCS, FPT Software launch Strategic Delivery Centre in Vietnam
The new partnership is designed to support increasing demand for high quality digital services across the region.
Story image
Data Centre Maintenance / Management
Vertiv releases update to Smart InfraSight platform
Vertiv has unveiled an update to its Smart InfraSight data centre management platform, featuring improved intelligence and the ability to manage multiple IT devices.
Story image
Hybrid Cloud
ERP implementations biggest concern for customers - report
"Companies are setting a higher bar for their ERP providers to deliver on more than just the technology itself."
Story image
Migration
New Relic launches Agentless Monitoring for SAP Solutions
The company says the solution empowers IT teams to better support business operations by harnessing existing SAP data sources to access all necessary telemetry data.
Story image
Energy
Sustainability huge factor for APAC data centre managers
A new report reveals that 85% of data centre managers in APAC believe that sustainability will significantly impact operations and decision making.
Story image
Hybrid Cloud
HPE GreenLake advances hybrid cloud experience with new services
"The innovations unveiled today further build on our vision to provide the market with an unmatched platform to spur innovation and drive transformation.”
Story image
Artificial Intelligence
Vectra AI named as AWS security competency partner
Threat detection and response company Vectra AI has announced that it has become an Amazon Web Services Security Competency Partner.
Story image
Migration
SNP unveils next generation of CrystalBridge software platform
Data is a key pillar of every customer-centric organisation, as it relies on agile decisions to become increasingly sustainable and intelligent.
Story image
Schneider Electric
Schneider Electric University adds new courses to lineup
The new updates include fundamentals of power, cooling, racks and physical security, and guidance on how to optimise data centre designs.
Story image
Data center
Schneider Electric launches education platform to address data center talent shortage
Schneider Electric has announced a series of updates to its vendor-agnostic and CPD-accredited digital education platform.
Story image
Cybersecurity
Cloudflare expands A/NZ footprint with four new data centres
New data centres in Adelaide, Canberra, Hobart, and Christchurch will bring faster, more reliable, and more secure internet to A/NZ.
Story image
Data center
Keppel deepens inroads into China’s data centre market
This latest development marks Keppel’s sixth project since entering mainland China’s data centre market in 2020. 
Story image
Amazon Web Services / AWS
Sapporo City selects Nutanix Cloud Clusters on AWS
The city first used a hyper-converged infrastructure (HCI) solution from Nutanix to modernize and improve the efficiency of its on-premise datacenter.
Story image
Macquarie Data Centres
Macquarie deal to pioneer CO2-cutting data centre tech in Australia
Macquarie Data Centres has signed a multi-year deal with ResetData, an Australian first provider using Submer data centre technology. 
Story image
Cybersecurity
Zscaler launches co-located data centres in Canberra and Auckland
The investment will offer public and private sector enterprises greater resilience in support of their zero trust cybersecurity posture.
AWS Marketplace
Learn how security orchestration, automation, and response (SOAR) enhances your security strategy.
Link image
Story image
SaaS
Iron Mountain InSight SaaS platform extends capabilities on AWS
Company deepens work with AWS, helps customers to accelerate their journey from physical to digital on a global scale.
Story image
Sustainability
SoftIron joins Sustainable Digital Infrastructure Alliance
SoftIron has joined the Sustainable Digital Infrastructure Alliance (SDIA), a platform designed to help the digital sector reduce its environmental impact.
Story image
Infrastructure
Oracle Cloud Infrastructure expands distributed cloud services
“Distributed cloud is the next evolution of cloud computing, and provides customers with more flexibility and control in how they deploy cloud resources."
Story image
Digital Realty
Digital Realty joins forces with CypressTel to deliver enhanced interconnectivity
The collaboration expands access across the Greater China region with Digital Realty's PlatformDIGITAL and CypressTel's hybrid WAN capabilities.
AWS Marketplace
Watch this webinar to gain building blocks for data mesh, and how AWS customers today are successfully enabling domain driven data.
Link image
Story image
Southern Cross Cable
Southern Cross Cable launches the SX NEXT cable to connect NZ to the world
The new Southern Cross NEXT fibre cable (SX NEXT) is set to connect Australasia to the US and further enhance connectivity between New Zealand, Australia, and the US.
Story image
Cloud
Cloudflare outage in 19 data centers worldwide due to own error
Cloudflare says its outage for 19 of its data centers yesterday was because of a change in a long-running project to increase resilience in its busiest locations.
Story image
Amazon Web Services / AWS
Qualtrics goes live on AWS Cloud Infrastructure in Japan
Organisations across Japan will now be able to access the Qualtrics XM/OS platform locally via data centre in the AWS Asia Pacific (Tokyo) region.
Story image
Development
Intel Labs unveils integrated photonics research advancement
"This new research demonstrates that its possible to achieve well-matched output power with uniform and densely spaced wavelengths."
Story image
Sustainability
ST Engineering launches cooling system for greener data centers
ST Engineering says its Airbitat DC Cooling System cools down data centers and achieves annual net energy savings of more than 20% over conventional chiller systems alone. 
Story image
Big Data
DataStax, Nanyang Polytechnic partner to grow big data management talent in Singapore
The collaboration will deliver technology, curricula and certifications in big data management to accelerate innovation and sustainability.
Story image
Microsoft
Schneider Electric named Microsoft Energy & Sustainability Partner of the Year
"The award is a great recognition of the collaborative impact we are making together, to tackle climate change."
Story image
Infrastructure
Global investment in data centers more than doubled in 2021
DLA Piper's latest global survey finds the total investment in data center infrastructure worldwide rose from USD $24.4 billion in 2020 to USD $53.8 billion in 2021.
Story image
Microsoft
Cloudian’s HyperStore validated to work with Microsoft Azure
Cloudian’s HyperStore object storage is now validated to work with Microsoft Azure Stack HCI, a joint offering that will give customers public cloud benefits within their own data centres.
AWS Marketplace
Whitepaper: A practical guide for mitigating risk in today’s modern applications
Link image
Story image
Quinbrook Infrastructure Partners
Quinbrook launches $2.5 billion ‘Supernode’ 800MW data storage project
Quinbrook says the new Supernode will be one of the largest permit-approved data storage campus projects in the Southern Hemisphere.
Story image
Data center
Tokyo, Sydney and Seoul lead data center growth in APAC
Knight Frank’s latest report in partnership with DC Byte, which looks at centers in APAC, has found the region had an increase of 488 MW of new capacity in Q1, driven mainly by Tokyo, Sydney and Seoul.
Story image
Sustainability
Kohler Power Systems diesel generators now more sustainable
Kohler Power Systems has announced its diesel generators are compatible with Hydrotreated Vegetable Oil (HVO), a major breakthrough in the usage of alternative fuels in backup power.
Story image
Public Cloud
Public cloud services revenues top $400 billion in 2021
"For the next several years, leading cloud providers will play a critical role in helping enterprises navigate the current storms of disruption."