{"id":617005,"date":"2023-03-12T08:57:24","date_gmt":"2023-03-12T13:57:24","guid":{"rendered":"https:\/\/news.sellorbuyhomefast.com\/index.php\/2023\/03\/12\/3-ways-data-teams-can-avoid-a-tragedy-of-the-cloud-commons\/"},"modified":"2023-03-12T08:57:24","modified_gmt":"2023-03-12T13:57:24","slug":"3-ways-data-teams-can-avoid-a-tragedy-of-the-cloud-commons","status":"publish","type":"post","link":"https:\/\/newsycanuse.com\/index.php\/2023\/03\/12\/3-ways-data-teams-can-avoid-a-tragedy-of-the-cloud-commons\/","title":{"rendered":"3 ways data teams can avoid a tragedy of the cloud commons"},"content":{"rendered":"<div>\n<section>\n<p><time title=\"2023-03-11T16:20:00+00:00\" datetime=\"2023-03-11T16:20:00+00:00\">March 11, 2023 8:20 AM<\/time>\n\t\t\t<\/p>\n<\/section>\n<div>\n<p><img loading=\"lazy\" decoding=\"async\" width=\"750\" height=\"379\" src=\"https:\/\/venturebeat.com\/wp-content\/uploads\/2022\/08\/GettyImages-940308630.jpg?fit=750%2C379&#038;strip=all\" alt=\"Humorous mobile cloud computing, securing identities conceptual image.\"><\/p>\n<div>\n<p><em>Image Credit: gremlin\/Getty<\/em><\/p>\n<\/div><\/div>\n<\/p><\/div>\n<div id=\"primary\" role=\"main\">\n<article id=\"post-2845632\">\n<div>\n<p>In 1833, British economist William Forster Lloyd coined the term \u201cTragedy of the Commons\u201d to describe a situation in which individual users who have open access to a collective resource, unimpeded by formal rules that govern access and use, will act according to their own self-interest and contrary to the common good.<\/p>\n<p>In Lloyd\u2019s famous hypothetical, a group of individual herders share a public pasture for grazing their cattle. As each herder seeks to optimize his or her own economic gain by giving more of his or her cows access to graze, the commons eventually becomes depleted to the detriment of all.\u00a0<\/p>\n<p>In other words, when an infinite and seemingly \u201cfree\u201d resource is offered up to be used with little consideration of cost or consequence, it becomes unsustainable.\u00a0<\/p>\n<p>There\u2019s a similar phenomenon happening in today\u2019s cloud-first data operations (dataops) environment. The \u201ccommons\u201d in this case is the <a href=\"https:\/\/venturebeat.com\/data-infrastructure\/multicloud-isnt-working-bring-on-the-supercloud\/\" target=\"_blank\" rel=\"noreferrer noopener\">public cloud<\/a>, a shared resource that appears to be free to the data teams using it since they have little visibility into what their cloud usage actually costs.\u00a0<\/p>\n<h2 id=\"h-crisis-in-the-cloud\">Crisis in the cloud<\/h2>\n<p>Industry analysts estimate that at least <a href=\"https:\/\/devops.com\/the-cloud-is-booming-but-so-is-cloud-waste\/\">30% of<\/a><a href=\"https:\/\/devops.com\/the-cloud-is-booming-but-so-is-cloud-waste\/\" target=\"_blank\" rel=\"noreferrer noopener\"> <\/a><a href=\"https:\/\/devops.com\/the-cloud-is-booming-but-so-is-cloud-waste\/\">cloud spend is \u201cwasted\u201d<\/a> each year \u2014 some $17.6 billion. For modern data pipelines in the cloud, the percentage of waste is significantly higher, estimated at closer to 50%.<\/p>\n<p>It\u2019s not hard to understand how we got here. Public cloud services like AWS and GCP have made it easy to spin resources up and down at will, as they\u2019re needed. Having unfettered access to a \u201climitless\u201d pool of computing resources has truly transformed how businesses create new products and services and bring them to market.\u00a0<\/p>\n<p>For modern data teams, this \u201cdemocratization of IT\u201d facilitated by the public cloud has been a game-changer. For one thing, it\u2019s enabled them to be far more agile as they don\u2019t need to negotiate and justify a business case with the IT department to buy or repurpose a server in the corporate data center. And as an operational expenditure, the pay-by-the-drip model of the cloud makes budget planning seem more flexible.<\/p>\n<p>However, the ease with which we can spin up a cloud instance doesn\u2019t come without a few unintentional consequences \u2014 forgotten workloads, over-provisioned or underutilized resources \u2014 with results including spiraling and unpredictable costs. Near-infinite cloud resources make it easy to simply throw additional compute resources at inefficient queries.<\/p>\n<p>The practice of FinOps has emerged in part as a response to this democratization of IT. The unifying principle of FinOps is that by bringing finance, engineering and business teams together to make better decisions around cost and performance, they will act in a more responsible manner \u2014 provided they have access to the right data to inform their decision-making.<\/p>\n<p>According to the <a href=\"https:\/\/data.finops.org\/\">2022 State of FinOps <\/a><a href=\"https:\/\/data.finops.org\/\" target=\"_blank\" rel=\"noreferrer noopener\">report<\/a>, the biggest challenge facing organizations trying to establish a FinOps culture is \u201cgetting engineers to take action on cost optimization.\u201d The authors go on to say that with so many data projects on their backlog and nearly unlimited cloud resources at their disposal, it\u2019s understandable that data engineers naturally prioritize new data pipeline creation and timely data delivery over resource optimization.<\/p>\n<p>While this is sound advice, this type of generalized guidance glosses over just how difficult a task this can be, and begs the question: How can data engineers be accountable if they can\u2019t capture accurate and easy-to-understand metrics about actual usage requirements? Moreover, how do you encourage this type of accountability without sacrificing cloud agility?<\/p>\n<h2 id=\"h-empowering-data-teams-via-feedback-loops\">Empowering data teams via feedback loops<\/h2>\n<p>One powerful mechanism to change behavior is providing people with information about their actions in real time so they can alter their behavior accordingly. This is the fundamental premise of a positive feedback loop.<\/p>\n<p>For instance, think about the black box that is residential electricity consumption. Few of us have real-time access to utility pricing or a sense of how much it really costs us to run a household appliance. But connect a smart meter to an outlet and suddenly you can just look at an app on your phone and understand at a much more granular level precisely how much energy each device that\u2019s plugged in is using and therefore what it\u2019s costing you.<\/p>\n<p>It\u2019s also important to consider the role that behavior theory and incentives play in shaping how we make decisions. In the context of cloud consumption, the incentives at work for a data engineer are quite different from those of the finance director. The data engineer is primarily motivated by and held accountable to metrics related to performance and reliability. They want to know: Are my applications running reliably, on time, every time?<\/p>\n<p>In the engineer\u2019s calculus, they\u2019ve become conditioned to overestimate the resources an application might require rather than having to \u201cguesstimate\u201d their perceived capacity requirements. It\u2019s not that they are intentionally over-provisioning resources; rather, they simply don\u2019t know exactly how many or what size resources are actually needed, so they guess, erring on the side of too much rather than too little.<\/p>\n<p>In order for engineers to take action on cost optimization, they need to be given the granular-level usage details that enable them to make informed and defensible choices \u2014 and do so without worrying that they will fall short on their service-level obligations.<\/p>\n<p>Getting at this information, however, is anything but easy. The data pipelines that feed modern data apps are enormously complex and the sheer size and scale of the data workloads only amplifies the challenge of identifying cost-saving opportunities.<\/p>\n<h2>A flight path to cloud usage observability<\/h2>\n<p>This is the problem that full stack observability, informed by <a href=\"https:\/\/venturebeat.com\/2022\/06\/15\/what-is-artificial-intelligence\/\" target=\"_blank\" rel=\"noreferrer noopener\">AI<\/a> algorithms and <a href=\"https:\/\/venturebeat.com\/ai\/artificial-intelligence-ai-vs-machine-learning-ml-key-comparisons\/\" target=\"_blank\" rel=\"noreferrer noopener\">machine learning<\/a> models, was designed to address. There are a several ways in which the deep visibility that observability enables can help data teams more fully understand their usage costs and nudge their behavior to become more cost-conscious. <\/p>\n<ul>\n<li><strong>Start at the job level: <\/strong>While most cloud cost control measures take a top-down approach that gives a bird\u2019s-eye aggregated view of spending, they don\u2019t really help users identify exactly where the cost-saving opportunities lie. Controlling cloud costs starts at the job level, as there are typically thousands of jobs running on more expensive instances than necessary. Without deep visibility into the actual resource requirements of each job over time, data teams are just guessing as to what they think they will need.<\/li>\n<\/ul>\n<ul>\n<li><strong>Enable showback to align IT value with cost<\/strong>: To help connect the dots between what data teams are consuming and what they are spending, a growing number of organizations are using observability to generate showback and\/or chargeback reports \u2014 itemized bills of materials that show precisely who is consuming what resource and what it costs. With this type of intelligence, cost allocations can be put into a context that makes sense to all \u2014 whether that\u2019s breaking down costs by department, team, project or application all the way down to the individual job or user level.\u00a0<\/li>\n<\/ul>\n<ul>\n<li><strong>Provide users with prescriptive recommendations<\/strong>: \u200b\u200bIt\u2019s not enough to simply throw a bunch of charts and metrics at engineers and expect them to puzzle everything out to make the right choices. Instead they need to be served up actionable and <a href=\"https:\/\/venturebeat.com\/data-infrastructure\/what-is-data-analytics-definition-models-life-cycle-and-application-best-practices\/\" target=\"_blank\" rel=\"noreferrer noopener\">prescriptive<\/a> recommendations that tell them in plain English precisely what steps they should take. This level of self-service will empower engineers to make more cost-effective decisions on their own so they can take individual responsibility and be held accountable for their cloud usage.<\/li>\n<\/ul>\n<p>One of the enduring lessons from the Tragedy of the Commons analogy is that when everyone is responsible, no one is responsible. It\u2019s not enough to tell stakeholders to be accountable; you need to provide them with the tools, insights and incentives that are needed to change their behavior.<\/p>\n<p><em><a href=\"https:\/\/www.linkedin.com\/in\/clintonwford\/\">Clinton Ford<\/a> is DataOps champion at <a href=\"http:\/\/www.unraveldata.com\">Unravel Data<\/a><\/em>.<\/p>\n<div id=\"boilerplate_2736392\">\n<h3 id=\"h-datadecisionmakers\">DataDecisionMakers<\/h3>\n<p>Welcome to the VentureBeat community!<\/p>\n<p>DataDecisionMakers is where experts, including the technical people doing data work, can share data-related insights and innovation.<\/p>\n<p>If you want to read about cutting-edge ideas and up-to-date information, best practices, and the future of data and data tech, join us at DataDecisionMakers.<\/p>\n<p>You might even consider\u00a0<a rel=\"noreferrer noopener\" target=\"_blank\" href=\"https:\/\/venturebeat.com\/contribute-to-datadecisionmakers\/\">contributing an article<\/a>\u00a0of your own!<\/p>\n<p><a rel=\"noreferrer noopener\" href=\"https:\/\/venturebeat.com\/category\/DataDecisionMakers\/\" target=\"_blank\">Read More From DataDecisionMakers<\/a><\/p>\n<\/div><\/div>\n<\/p><\/div>\n<p><a href=\"https:\/\/venturebeat.com\/data-infrastructure\/3-ways-data-teams-can-avoid-a-tragedy-of-the-cloud-commons\/\" class=\"button purchase\" rel=\"nofollow noopener\" target=\"_blank\">Read More<\/a><br \/>\n Clinton Ford, Unravel Data<\/p>\n","protected":false},"excerpt":{"rendered":"<p>March 11, 2023 8:20 AM Image Credit: gremlin\/Getty In 1833, British economist William Forster Lloyd coined the term \u201cTragedy of the Commons\u201d to describe a situation in which individual users who have open access to a collective resource, unimpeded by formal rules that govern access and use, will act according to their own self-interest and<\/p>\n","protected":false},"author":1,"featured_media":617006,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[22402,22152,46],"tags":[],"class_list":{"0":"post-617005","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-avoid","8":"category-teams","9":"category-technology"},"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/617005","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/comments?post=617005"}],"version-history":[{"count":0,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/617005\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media\/617006"}],"wp:attachment":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media?parent=617005"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/categories?post=617005"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/tags?post=617005"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}