{"id":596427,"date":"2023-01-11T05:49:47","date_gmt":"2023-01-11T11:49:47","guid":{"rendered":"https:\/\/news.sellorbuyhomefast.com\/index.php\/2023\/01\/11\/how-observability-designed-for-data-teams-can-unlock-the-promise-of-dataops\/"},"modified":"2023-01-11T05:49:47","modified_gmt":"2023-01-11T11:49:47","slug":"how-observability-designed-for-data-teams-can-unlock-the-promise-of-dataops","status":"publish","type":"post","link":"https:\/\/newsycanuse.com\/index.php\/2023\/01\/11\/how-observability-designed-for-data-teams-can-unlock-the-promise-of-dataops\/","title":{"rendered":"How observability designed for data teams can unlock the promise of DataOps"},"content":{"rendered":"<div id=\"boilerplate_2682874\">\n<p><em>Check out all the on-demand sessions from the Intelligent Security Summit <a href=\"https:\/\/avolio.swapcard.com\/intelligentsecuritysummit2022\/registrations\/Start?utm_source=vb&#038;utm_medium=boiler&#038;utm_content=ondemand&#038;utm_campaign=IS22_BoilerPlates\" data-type=\"URL\" data-id=\"https:\/\/avolio.swapcard.com\/intelligentsecuritysummit2022\/registrations\/Start?utm_source=vb&#038;utm_medium=boiler&#038;utm_content=ondemand&#038;utm_campaign=IS22_BoilerPlates\">here<\/a><\/em>.<\/p>\n<hr>\n<\/div>\n<p>These days, it\u2019s no exaggeration to say that every company is a data company. And if they\u2019re not, they need to be. That\u2019s why more organizations are investing in the modern data stack (think: Databricks and Snowflake, Amazon EMR, BigQuery, Dataproc). <\/p>\n<p>However, these new technologies and the increasing business-criticality of their <a href=\"https:\/\/venturebeat.com\/data-infrastructure\/how-development-data-security-operations-can-benefit-the-enterprise\/\" target=\"_blank\" rel=\"noreferrer noopener\">data initiatives<\/a> introduce significant challenges. Not only must today\u2019s data teams deal with the sheer volume of data being ingested on a daily basis from a wide array of sources, but they must also be able to manage and monitor the tangle of thousands of interconnected and interdependent data applications.\u00a0<\/p>\n<p>The biggest challenge comes down to managing the complexity of the intertwined systems that we call the modern data stack. And as anyone who has spent time in the data trenches knows, deciphering data app performance, getting cloud costs under control and mitigating data quality issues is no small task.\u00a0<\/p>\n<p>When something breaks down in these Byzantine <a href=\"https:\/\/venturebeat.com\/data-infrastructure\/data-warehouses-and-lakes-will-merge\/\" target=\"_blank\" rel=\"noreferrer noopener\">data<\/a> pipelines, without a single source of truth to refer back to, the finger-pointing begins with data scientists blaming operations, operations blaming engineering, engineering blaming developers \u2014 and so forth and so on in perpetuity.\u00a0<\/p>\n<div><body><\/p>\n<div id=\"boilerplate_2803147\">\n<h3>Event<\/h3>\n<div>\n<p><span>Intelligent Security Summit On-Demand<\/span><\/p>\n<p><span>Learn the critical role of AI &#038; ML in cybersecurity and industry specific case studies. Watch on-demand sessions today.<\/span><\/p>\n<\/div>\n<p><a href=\"https:\/\/avolio.swapcard.com\/intelligentsecuritysummit2022\/registrations\/Start?utm_source=vb&#038;utm_medium=incontent&#038;utm_content=ondemand&#038;utm_campaign=IS22_InContent\"><br \/>\n                Watch Here            <\/a>\n                        <\/p>\n<\/div>\n<p><\/body><\/p>\n<p>Is it the code? Insufficient infrastructure resources? A scheduling coordination problem? Without a single source of truth for everyone to rally around, everybody uses their own tool, working in silos. And different tools give different answers  \u2014 and untangling the wires to get to the heart of the problem takes hours (even days).<\/p>\n<h2 id=\"h-why-modern-data-teams-need-a-modern-approach\">Why modern data teams need a modern approach<\/h2>\n<p>Data teams today are facing many of the same challenges that software teams once did: A fractured team working in silos, under the gun to keep up with the accelerated pace of delivering more, faster, without enough people, in an increasingly complex environment.\u00a0<\/p>\n<p>Software teams successfully tackled those obstacles via the discipline of DevOps. A big part of what enables DevOps teams to succeed is the observability provided by the new generation of application performance management (APM). Software teams are able to accurately and efficiently diagnose the root cause of problems, work collaboratively from a single source of truth, and enable developers to address problems early on \u2014 before software goes into production \u2014 without having to throw issues over the fence to the <a href=\"https:\/\/venturebeat.com\/datadecisionmakers\/dataops-still-an-unsolved-challenge-for-many-organizations\/\" target=\"_blank\" rel=\"noreferrer noopener\">Ops<\/a> team.\u00a0<\/p>\n<p>So why are data teams struggling when software teams aren\u2019t? They\u2019re using basically the same tools to solve essentially the same problem. <\/p>\n<p>Because, despite the generic similarities, observability for data teams is a completely different animal than observability for software teams.\u00a0<\/p>\n<h2 id=\"h-cost-control-is-critical\">Cost control is critical<\/h2>\n<p>First off, consider that in addition to understanding a data pipeline\u2019s performance and reliability, data teams must also grapple with the question of data quality \u2014 how can they be assured that they are feeding their analytics engines with high-quality inputs? And, as more workloads move to an assortment of public clouds, it\u2019s also vital that teams are able to understand their data pipelines through the lens of cost.<\/p>\n<p>Unfortunately, data teams find it difficult to get the information they need. Different teams have different questions they need answered, and everybody is myopically focused on solving their particular piece of the puzzle, using their own particular tool of choice, and different tools yield different answers.<\/p>\n<p>Troubleshooting issues is challenging. The problem could be anywhere along a highly complex and interconnected application\/pipeline for any one of a thousand reasons. And, while web app <a href=\"https:\/\/venturebeat.com\/data-infrastructure\/introduction-to-observability-what-is-observability-and-why-is-it-important\/\" target=\"_blank\" rel=\"noreferrer noopener\">observability<\/a> tools have their purpose, they were never intended to absorb and correlate the performance details buried within a modern data stack\u2019s components or \u201cuntangle the wires\u201d among a data application\u2019s upstream or downstream dependencies.\u00a0<\/p>\n<p>Moreover, as more data workloads migrate to the cloud, the cost of running data pipelines can quickly spiral out of control. An organization with 100,000-plus data jobs in the cloud has innumerable decisions to make about where, when, and how to run these jobs. And each decision carries a price tag.\u00a0<\/p>\n<p>As organizations cede centralized control over infrastructure, it\u2019s essential for both data engineers and FinOps to understand where the money is going and identify opportunities to reduce\/control costs.<\/p>\n<p>To get fine-grained insight into performance, cost, and data quality, data teams are forced to cobble together information from a variety of tools. And, as organizations scale their data stacks, the vast amount of information (and sources) makes it extraordinarily difficult to see the entirety of the data forest when you\u2019re sitting in the trees.\u00a0<\/p>\n<p>Most of the granular details needed are available \u2014 unfortunately, they\u2019re often hidden in plain sight. Each tool provides some of the information required, but not all. What\u2019s needed is observability that pulls together all these details and presents them in a context that makes sense and speaks the language of data teams. <\/p>\n<p>Observability that is designed from the ground up specifically for data teams allows them to see how everything fits together holistically. And while there is a slew of cloud-vendor-specific, open-source, and proprietary data observability tools that provide details about one layer or system in isolation, ideally, a full-stack observability solution can stitch it all together into a workload-aware context. Solutions that leverage deep AI are further able\u00a0to show not just where and why an issue exists but how it affects other data pipelines \u2014 and, finally, what to do about it.<\/p>\n<p>Just like <a href=\"https:\/\/venturebeat.com\/security\/5-ways-to-secure-devops\/\" target=\"_blank\" rel=\"noreferrer noopener\">DevOps<\/a> observability provides the foundational underpinnings to help improve the speed and reliability of the software development lifecycle, DataOps observability can do the same for the data application\/pipeline lifecycle. But \u2014 \u200aand this is a big <em>but<\/em> \u2014 \u200aDataOps observability as a technology has to be designed from the ground up to meet the different needs of data teams.<\/p>\n<p>DataOps observability cuts across multiple domains:<\/p>\n<ul>\n<li><strong>Data application\/pipeline\/model observability <\/strong>ensures that data analytics applications\/pipelines are running on time, every time, without errors.<\/li>\n<li><strong>Operations observability <\/strong>enables data teams to understand how the entire platform is running end to end, offering a unified view of how everything is working together, both horizontally and vertically.\u00a0<\/li>\n<li><strong>Business observability <\/strong>has two parts: profit and cost.<strong> <\/strong>The first is about ROI and monitors and correlates the performance of data applications with business outcomes. The second part is <strong>FinOps observability<\/strong>, where organizations use real-time data to govern and control their cloud costs, understand where the money is going, set budget guardrails, and identify opportunities to optimize the environment to reduce costs.<\/li>\n<li><strong>Data observability<\/strong> looks at the datasets themselves, running quality checks to ensure correct results. It tracks lineage, usage, and the integrity and quality of data.<\/li>\n<\/ul>\n<p>Data teams can\u2019t be singularly focused because problems in the modern data stack are interrelated. Without a unified view of the entire data sphere, the promise of DataOps will go unfulfilled.<\/p>\n<h2 id=\"h-observability-for-the-modern-data-stack\">Observability for the modern data stack<\/h2>\n<p>Extracting, correlating, and analyzing everything at a foundational layer in a data team\u2013centric, workload-aware context delivers five capabilities that are the hallmarks of a mature DataOps observability function:<\/p>\n<ul>\n<li><strong>End-to-end visibility<\/strong> correlates telemetry data and metadata from across the full data stack to give a unified, in-depth understanding of the behavior, performance, cost, and health of your data and data workflows.\u00a0<\/li>\n<li><strong>Situational awareness<\/strong> puts this aggregated information into a meaningful context.<\/li>\n<li><strong>Actionable intelligence<\/strong> tells you not just what\u2019s happening but why. Next-gen observability platforms go a step further and provide prescriptive AI-powered recommendations on what to do next.<\/li>\n<li>Everything either happens through or enables a <strong>high degree of automation<\/strong>.<\/li>\n<li>This proactive capability is <strong>governance <\/strong>in action, where the system applies the recommendations automatically \u2014 no human intervention is needed.\u00a0<\/li>\n<\/ul>\n<p>As more and more innovative technologies make their way into the modern data stack \u2014 and ever more workloads migrate to the cloud \u2014 it\u2019s increasingly necessary to have a unified DataOps observability platform with the flexibility to comprehend the growing complexity and the intelligence to provide a solution. That\u2019s true DataOps observability.<\/p>\n<p><em>Chris Santiago is VP of solutions engineering for <a href=\"https:\/\/www.unraveldata.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">Unravel<\/a><\/em>. <\/p>\n<div id=\"boilerplate_2736392\">\n<h3 id=\"h-datadecisionmakers\">DataDecisionMakers<\/h3>\n<p>Welcome to the VentureBeat community!<\/p>\n<p>DataDecisionMakers is where experts, including the technical people doing data work, can share data-related insights and innovation.<\/p>\n<p>If you want to read about cutting-edge ideas and up-to-date information, best practices, and the future of data and data tech, join us at DataDecisionMakers.<\/p>\n<p>You might even consider\u00a0<a rel=\"noreferrer noopener\" target=\"_blank\" href=\"https:\/\/venturebeat.com\/contribute-to-datadecisionmakers\/\">contributing an article<\/a>\u00a0of your own!<\/p>\n<p><a rel=\"noreferrer noopener\" href=\"https:\/\/venturebeat.com\/category\/DataDecisionMakers\/\" target=\"_blank\">Read More From DataDecisionMakers<\/a><\/p>\n<\/div><\/div>\n<p><a href=\"https:\/\/venturebeat.com\/enterprise-analytics\/how-observability-designed-for-data-teams-can-unlock-the-promise-of-dataops\/\" class=\"button purchase\" rel=\"nofollow noopener\" target=\"_blank\">Read More<\/a><br \/>\n Chris Santiago, Unravel<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Check out all the on-demand sessions from the Intelligent Security Summit here. These days, it\u2019s no exaggeration to say that every company is a data company. And if they\u2019re not, they need to be. That\u2019s why more organizations are investing in the modern data stack (think: Databricks and Snowflake, Amazon EMR, BigQuery, Dataproc). However, these<\/p>\n","protected":false},"author":1,"featured_media":596428,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[22226,74017,46],"tags":[],"class_list":{"0":"post-596427","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-designed","8":"category-observability","9":"category-technology"},"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/596427","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/comments?post=596427"}],"version-history":[{"count":0,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/596427\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media\/596428"}],"wp:attachment":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media?parent=596427"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/categories?post=596427"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/tags?post=596427"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}