{"id":301,"date":"2021-04-07T09:59:17","date_gmt":"2021-04-07T09:59:17","guid":{"rendered":"https:\/\/blog.spike.sh\/2021\/04\/07\/introduction-to-slo-and-sla\/"},"modified":"2025-06-06T14:06:26","modified_gmt":"2025-06-06T08:36:26","slug":"introduction-to-slo-and-sla","status":"publish","type":"post","link":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/","title":{"rendered":"Introduction to SLO, SLI and SLA"},"content":{"rendered":"\n<nav aria-label=\"Table of Contents\" class=\"wp-block-table-of-contents\"><ol><li><a class=\"wp-block-table-of-contents__entry\" href=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#slo-and-sla\">SLO and SLA<\/a><\/li><li><a class=\"wp-block-table-of-contents__entry\" href=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#why-set-slos\">Why set SLOs?<\/a><\/li><li><a class=\"wp-block-table-of-contents__entry\" href=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#why-your-slo-targets-should-not-be-100\">Why your SLO targets should not be 100%<\/a><\/li><li><a class=\"wp-block-table-of-contents__entry\" href=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#setting-and-measuring-slos\">Setting and measuring SLOs<\/a><\/li><li><a class=\"wp-block-table-of-contents__entry\" href=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#tracking-slos\">Tracking SLOs<\/a><\/li><li><a class=\"wp-block-table-of-contents__entry\" href=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#slo-ownership\">SLO ownership<\/a><\/li><li><a class=\"wp-block-table-of-contents__entry\" href=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#conclusion\">Conclusion<\/a><\/li><\/ol><\/nav>\n\n\n\n<p class=\"wp-block-paragraph\">When you start researching how to improve the reliability of your software, you will soon run into terms like SLOs and SLAs. It can sound intimidating, but it&#8217;s quite straightforward to understand. In this post, we will introduce these terms, the differences between them and how to start using them to make your systems more reliable.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"slo-and-sla\">SLO and SLA<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">SLO (Service Level Objective) for a service is a reliability target that teams want to achieve (e.g. 99.9% uptime for APIs). SLOs are set for internal teams and they are different from SLA (Service Level Agreement) that the business team will sign with customers, along with some penalties that may be payable if these SLA targets are not met. If you have SLAs with your customers, your SLOs will be derived from that.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"why-set-slos\">Why set SLOs?<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Having clear SLOs can make it easy to take decisions about reliability of your systems. Otherwise, you will never have the data to help prioritise engineering team effort between features and reliability related tasks. SLOs also give all stakeholders a clear understanding of the performance of their systems. <\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"why-your-slo-targets-should-not-be-100\">Why your SLO targets should not be 100%<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Your SLOs should be aligned with customer satisfaction. They should be set at a level that your customers will be satisfied with your product. The initial temptation for teams is to set SLO targets at 100% e.g. Website uptime should be 100%, APIs should meet their target response time 100% of the time etc. But you should avoid setting SLOs at 100%, because it is not possible to achieve. If you set SLOs at 100%, all your engineering effort will be spent on making sure that your services never go down. This will slow down your feature velocity and lead to customer unhappiness.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The following table of nines might be helpful to understand what your SLOs should be. Every extra nine that you add to the SLO will have additional cost associated with it, while most customers might never notice it.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><!--kg-card-begin: markdown--><\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><thead><tr><th>SLO %<\/th><th>Nines<\/th><th>Downtime per month<\/th><\/tr><\/thead><tbody><tr><td>99 %<\/td><td>2 nines<\/td><td>7.2 hours<\/td><\/tr><tr><td>99.9 %<\/td><td>3 nines<\/td><td>43.2 minutes<\/td><\/tr><tr><td>99.99 %<\/td><td>4 nines<\/td><td>4.32 minutes<\/td><\/tr><tr><td>99.999 %<\/td><td>5 nines<\/td><td>25.9 seconds<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\"><!--kg-card-end: markdown--><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"setting-and-measuring-slos\">Setting and measuring SLOs<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">To understand what parts of the your system should SLOs cover, you should think about what do users care about more in your service. e.g. If you are an email provider like Gmail, being available all the time is very important so your SLOs should cover availability. If you are an online multiplayer game, users really care about lag between their actions and the impact it has on the game, so your latency based SLOs will be very important.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">You can check whether you are meeting your SLOs by checking the SLIs. Service Level Indicator (SLI) is a measure of your service. Depending on what metric your SLO covers, you will have a corresponding SLI which calculates that metric. The type of SLI tells you what aspect of the service are you measuring. The different types are &#8211; availability, latency, correctness, freshness, quality, coverage and durability.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Here are the different aspects of your service that you can measure &#8211;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Availability<\/strong>: This is one of the most popular SLAs because users really care about a service being available any time that they want to use it. The SLI here can be [(number of successful requests \/ number of total requests) * 100]. You can measure this using uptime monitoring services like <a href=\"https:\/\/www.checklyhq.com\/\">Checkly<\/a>.<\/li>\n\n\n\n<li><strong>Latency<\/strong>: This covers the speed of your service and is an important one for many types of products. If your product is slow, you will start losing customers and revenue. Example SLO can be that 99% of website page requests should be loaded in less than 100ms. You can measure latency using performance monitoring tools like <a href=\"https:\/\/www.datadoghq.com\/\">Datadog<\/a>.<\/li>\n\n\n\n<li><strong>Quality<\/strong>: When your services are overloaded, you should still make the service available to users in degraded state. You should measure how many user interactions resulted in lower quality responses. E.g. If Instagram is facing overload, it can decide to show images in lower resolution.<\/li>\n\n\n\n<li><strong>Freshness<\/strong>: This metric measures the \u201crecency\u201d of the information accessed by the user. E.g. If your application updates reports data only every hour, \u00a0then user could be accessing older data when they check the reports.<\/li>\n\n\n\n<li><strong>Correctness<\/strong>: If you have a service which takes in data and performs computations on it, then correctness will measure the number of times the output is correct based on the input.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"tracking-slos\">Tracking SLOs<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">SLOs should be tracked over a time window, which can be a calendar time (1 month) or rolling window (last 4 weeks). Shorter windows will allow you to respond to SLO violations faster. This SLO data can help you make decisions about task prioritisation and resource allocation. It will also help you with deciding the different aspects of your <a href=\"https:\/\/spike.sh\/blog\/introduction-on-call-schedule\/\">on-call schedules<\/a> like coverage, escalation times etc.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"slo-ownership\">SLO ownership<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">It&#8217;s important that reliability become a team effort and so maintaining SLOs should not only be the responsibility of development or operations team members only. Doing this will help you create a blameless culture so team members are empowered to speak up and point out real problems in achieving your reliability goals.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"conclusion\">Conclusion<\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">I hope this post helps you get started with SLOs to make your systems more reliable. To understand how to set up alerting so you don&#8217;t miss your SLOs and SLAs, sign up at <a href=\"https:\/\/spike.sh\">Spike.sh<\/a> or email us at <a href=\"mailto:hello@spike.sh\">hello@spike.sh.<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>When you start researching how to improve the reliability of your software, you will soon run into terms like SLOs and SLAs. It can sound intimidating, but it&#8217;s quite straightforward to understand. In this post, we will introduce these terms, the differences between them and how to start using them to make your systems more [&hellip;]<\/p>\n","protected":false},"author":263547074,"featured_media":874,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_crdt_document":"","_import_markdown_pro_load_document_selector":0,"_import_markdown_pro_submit_text_textarea":"","_lmt_disableupdate":"","_lmt_disable":"","_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2},"_wpas_customize_per_network":false,"jetpack_post_was_ever_published":false},"categories":[1428],"tags":[],"class_list":["post-301","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-incident-management"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Introduction to SLO, SLI and SLA<\/title>\n<meta name=\"description\" content=\"Understand SLO, SLI and SLA\u2014key concepts for measuring, tracking, and improving software reliability and customer satisfaction.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Introduction to SLO, SLI and SLA\" \/>\n<meta property=\"og:description\" content=\"Understand SLO, SLI and SLA\u2014key concepts for measuring, tracking, and improving software reliability and customer satisfaction.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/\" \/>\n<meta property=\"og:site_name\" content=\"Spike&#039;s blog\" \/>\n<meta property=\"article:published_time\" content=\"2021-04-07T09:59:17+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-06-06T08:36:26+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/blog.spike.sh\/wp-content\/uploads\/2021\/04\/c.png\" \/>\n\t<meta property=\"og:image:width\" content=\"2400\" \/>\n\t<meta property=\"og:image:height\" content=\"960\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Pruthvi\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Pruthvi\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/\"},\"author\":{\"name\":\"Pruthvi\",\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/#\\\/schema\\\/person\\\/2c9fa677c459b8f4fb26f1a02b90b5ec\"},\"headline\":\"Introduction to SLO, SLI and SLA\",\"datePublished\":\"2021-04-07T09:59:17+00:00\",\"dateModified\":\"2025-06-06T08:36:26+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/\"},\"wordCount\":917,\"commentCount\":0,\"image\":{\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/blog.spike.sh\\\/wp-content\\\/uploads\\\/2021\\\/04\\\/c.png\",\"articleSection\":[\"Incident Management\"],\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/\",\"url\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/\",\"name\":\"Introduction to SLO, SLI and SLA\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/blog.spike.sh\\\/wp-content\\\/uploads\\\/2021\\\/04\\\/c.png\",\"datePublished\":\"2021-04-07T09:59:17+00:00\",\"dateModified\":\"2025-06-06T08:36:26+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/#\\\/schema\\\/person\\\/2c9fa677c459b8f4fb26f1a02b90b5ec\"},\"description\":\"Understand SLO, SLI and SLA\u2014key concepts for measuring, tracking, and improving software reliability and customer satisfaction.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/#primaryimage\",\"url\":\"https:\\\/\\\/blog.spike.sh\\\/wp-content\\\/uploads\\\/2021\\\/04\\\/c.png\",\"contentUrl\":\"https:\\\/\\\/blog.spike.sh\\\/wp-content\\\/uploads\\\/2021\\\/04\\\/c.png\",\"width\":2400,\"height\":960},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/introduction-to-slo-and-sla\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/blog.spike.sh\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Introduction to SLO, SLI and SLA\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/#website\",\"url\":\"https:\\\/\\\/blog.spike.sh\\\/\",\"name\":\"Spike&#039;s blog\",\"description\":\"Learnings and opinions in a changing world\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/blog.spike.sh\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/blog.spike.sh\\\/#\\\/schema\\\/person\\\/2c9fa677c459b8f4fb26f1a02b90b5ec\",\"name\":\"Pruthvi\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e9476164464b4c9fb3455f2ee4879aad90f1790dce018e71caeaca2cbd548637?s=96&d=robohash&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e9476164464b4c9fb3455f2ee4879aad90f1790dce018e71caeaca2cbd548637?s=96&d=robohash&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e9476164464b4c9fb3455f2ee4879aad90f1790dce018e71caeaca2cbd548637?s=96&d=robohash&r=g\",\"caption\":\"Pruthvi\"},\"url\":\"https:\\\/\\\/blog.spike.sh\\\/author\\\/pruthvi\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Introduction to SLO, SLI and SLA","description":"Understand SLO, SLI and SLA\u2014key concepts for measuring, tracking, and improving software reliability and customer satisfaction.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/","og_locale":"en_GB","og_type":"article","og_title":"Introduction to SLO, SLI and SLA","og_description":"Understand SLO, SLI and SLA\u2014key concepts for measuring, tracking, and improving software reliability and customer satisfaction.","og_url":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/","og_site_name":"Spike&#039;s blog","article_published_time":"2021-04-07T09:59:17+00:00","article_modified_time":"2025-06-06T08:36:26+00:00","og_image":[{"width":2400,"height":960,"url":"https:\/\/blog.spike.sh\/wp-content\/uploads\/2021\/04\/c.png","type":"image\/png"}],"author":"Pruthvi","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Pruthvi","Estimated reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#article","isPartOf":{"@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/"},"author":{"name":"Pruthvi","@id":"https:\/\/blog.spike.sh\/#\/schema\/person\/2c9fa677c459b8f4fb26f1a02b90b5ec"},"headline":"Introduction to SLO, SLI and SLA","datePublished":"2021-04-07T09:59:17+00:00","dateModified":"2025-06-06T08:36:26+00:00","mainEntityOfPage":{"@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/"},"wordCount":917,"commentCount":0,"image":{"@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#primaryimage"},"thumbnailUrl":"https:\/\/blog.spike.sh\/wp-content\/uploads\/2021\/04\/c.png","articleSection":["Incident Management"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/","url":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/","name":"Introduction to SLO, SLI and SLA","isPartOf":{"@id":"https:\/\/blog.spike.sh\/#website"},"primaryImageOfPage":{"@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#primaryimage"},"image":{"@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#primaryimage"},"thumbnailUrl":"https:\/\/blog.spike.sh\/wp-content\/uploads\/2021\/04\/c.png","datePublished":"2021-04-07T09:59:17+00:00","dateModified":"2025-06-06T08:36:26+00:00","author":{"@id":"https:\/\/blog.spike.sh\/#\/schema\/person\/2c9fa677c459b8f4fb26f1a02b90b5ec"},"description":"Understand SLO, SLI and SLA\u2014key concepts for measuring, tracking, and improving software reliability and customer satisfaction.","breadcrumb":{"@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/"]}]},{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#primaryimage","url":"https:\/\/blog.spike.sh\/wp-content\/uploads\/2021\/04\/c.png","contentUrl":"https:\/\/blog.spike.sh\/wp-content\/uploads\/2021\/04\/c.png","width":2400,"height":960},{"@type":"BreadcrumbList","@id":"https:\/\/blog.spike.sh\/introduction-to-slo-and-sla\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/blog.spike.sh\/"},{"@type":"ListItem","position":2,"name":"Introduction to SLO, SLI and SLA"}]},{"@type":"WebSite","@id":"https:\/\/blog.spike.sh\/#website","url":"https:\/\/blog.spike.sh\/","name":"Spike&#039;s blog","description":"Learnings and opinions in a changing world","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/blog.spike.sh\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":"Person","@id":"https:\/\/blog.spike.sh\/#\/schema\/person\/2c9fa677c459b8f4fb26f1a02b90b5ec","name":"Pruthvi","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/secure.gravatar.com\/avatar\/e9476164464b4c9fb3455f2ee4879aad90f1790dce018e71caeaca2cbd548637?s=96&d=robohash&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/e9476164464b4c9fb3455f2ee4879aad90f1790dce018e71caeaca2cbd548637?s=96&d=robohash&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e9476164464b4c9fb3455f2ee4879aad90f1790dce018e71caeaca2cbd548637?s=96&d=robohash&r=g","caption":"Pruthvi"},"url":"https:\/\/blog.spike.sh\/author\/pruthvi\/"}]}},"modified_by":"Sreekar","jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/blog.spike.sh\/wp-content\/uploads\/2021\/04\/c.png","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/pfMe4Q-4R","jetpack-related-posts":[{"id":3800,"url":"https:\/\/blog.spike.sh\/sla-slo-sli\/","url_meta":{"origin":301,"position":0},"title":"SLA, SLO, and SLI: Understanding the Foundations of Service Reliability","author":"Samyati Mohanty","date":"28th October, 2025","format":false,"excerpt":"SLA, SLO, and SLI are the backbone of service reliability. Discover how these metrics work together, what each one measures, and why your DevOps team depends on them to deliver consistent, trustworthy performance every single day.","rel":"","context":"In &quot;Industry Knowledge&quot;","block_context":{"text":"Industry Knowledge","link":"https:\/\/blog.spike.sh\/category\/industry-knowledge\/"},"img":{"alt_text":"Blog cover titled \"SLA, SLO, and SLI\"","src":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/10\/b-2.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/10\/b-2.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/10\/b-2.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/10\/b-2.png?resize=700%2C400&ssl=1 2x"},"classes":[]},{"id":314,"url":"https:\/\/blog.spike.sh\/sre-role-2021-analysed-30-job-postings\/","url_meta":{"origin":301,"position":1},"title":"What is expected in the SRE role? We analyzed 30 job postings to find out.","author":"Pruthvi","date":"21st September, 2021","format":false,"excerpt":"IntroductionFindingsDeploy and maintain infrastructureDefine and manage SLO, SLI and error budgetsSetup monitoring and alertsBe on call, respond to incidents and conduct post-mortemsBuild tools and automationsOther responsibilitiesConclusion Introduction In 2016, Google released the definitive book on Site Reliability Engineering (SRE) - a practice that had originated in the company to take\u2026","rel":"","context":"In &quot;Industry Insights&quot;","block_context":{"text":"Industry Insights","link":"https:\/\/blog.spike.sh\/category\/industry-insights\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2021\/09\/003.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2021\/09\/003.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2021\/09\/003.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2021\/09\/003.png?resize=700%2C400&ssl=1 2x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2021\/09\/003.png?resize=1050%2C600&ssl=1 3x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2021\/09\/003.png?resize=1400%2C800&ssl=1 4x"},"classes":[]},{"id":3956,"url":"https:\/\/blog.spike.sh\/uptime-vs-availability\/","url_meta":{"origin":301,"position":2},"title":"Uptime vs. Availability: Why the Difference Matters (and How They Shape SLAs)","author":"Samyati Mohanty","date":"5th November, 2025","format":false,"excerpt":"Uptime shows if systems run; availability shows if users can rely on them. This guide explains why availability matters more for SLAs and real-world reliability.","rel":"","context":"In &quot;Industry Knowledge&quot;","block_context":{"text":"Industry Knowledge","link":"https:\/\/blog.spike.sh\/category\/industry-knowledge\/"},"img":{"alt_text":"Blog cover titled \"Uptime vs Availability: Key Differences & SLA Impact\"","src":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/11\/19744.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/11\/19744.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/11\/19744.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/11\/19744.png?resize=700%2C400&ssl=1 2x"},"classes":[]},{"id":2896,"url":"https:\/\/blog.spike.sh\/pagerduty-alternatives-for-incident-management\/","url_meta":{"origin":301,"position":3},"title":"5 Better PagerDuty Alternatives for Incident Management (2026)","author":"Sreekar","date":"19th August, 2025","format":false,"excerpt":"When I reviewed PagerDuty, I found that it gets the core incident management features right: Strong Slack integration, powerful bi-directional Jira sync, service-based on-call handoff notifications, and detailed incident timelines. But you\u2019re here, looking for PagerDuty alternatives, likely for a reason. Maybe its automation is too expensive, or setting up\u2026","rel":"","context":"In &quot;Comparison&quot;","block_context":{"text":"Comparison","link":"https:\/\/blog.spike.sh\/category\/comparison\/"},"img":{"alt_text":"Blog cover image titled \"5 Better PagerDuty Alternatives for Incident Management\"","src":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/08\/background-37-1.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/08\/background-37-1.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/08\/background-37-1.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/08\/background-37-1.png?resize=700%2C400&ssl=1 2x"},"classes":[]},{"id":322,"url":"https:\/\/blog.spike.sh\/what-is-an-incident\/","url_meta":{"origin":301,"position":4},"title":"What the Heck is an Incident?","author":"Kaushik","date":"19th September, 2022","format":false,"excerpt":"Incident management is all about incidents but what the heck is an incident anyway?","rel":"","context":"In &quot;Incident Management&quot;","block_context":{"text":"Incident Management","link":"https:\/\/blog.spike.sh\/category\/incident-management\/"},"img":{"alt_text":"","src":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2022\/09\/INCIDENT-MEME-1-1.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2022\/09\/INCIDENT-MEME-1-1.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2022\/09\/INCIDENT-MEME-1-1.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2022\/09\/INCIDENT-MEME-1-1.png?resize=700%2C400&ssl=1 2x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2022\/09\/INCIDENT-MEME-1-1.png?resize=1050%2C600&ssl=1 3x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2022\/09\/INCIDENT-MEME-1-1.png?resize=1400%2C800&ssl=1 4x"},"classes":[]},{"id":4237,"url":"https:\/\/blog.spike.sh\/jsm-alternatives-for-incident-management\/","url_meta":{"origin":301,"position":5},"title":"5 Better Jira Service Management (JSM) Alternatives for Incident Management (2026)","author":"Sreekar","date":"16th November, 2025","format":false,"excerpt":"OpsGenie is shutting down, and Atlassian recommends JSM. But is it right for you? This post covers 5 better Jira Service Management (JSM) Alternatives for incident management, with detailed comparisons.","rel":"","context":"In &quot;JSM&quot;","block_context":{"text":"JSM","link":"https:\/\/blog.spike.sh\/category\/comparison\/jsm\/"},"img":{"alt_text":"Blog cover titled \"JSM Alternatives for Incident Management\"","src":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/11\/background-45-1.png?resize=350%2C200&ssl=1","width":350,"height":200,"srcset":"https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/11\/background-45-1.png?resize=350%2C200&ssl=1 1x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/11\/background-45-1.png?resize=525%2C300&ssl=1 1.5x, https:\/\/i0.wp.com\/blog.spike.sh\/wp-content\/uploads\/2025\/11\/background-45-1.png?resize=700%2C400&ssl=1 2x"},"classes":[]}],"_links":{"self":[{"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/posts\/301","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/users\/263547074"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/comments?post=301"}],"version-history":[{"count":2,"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/posts\/301\/revisions"}],"predecessor-version":[{"id":875,"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/posts\/301\/revisions\/875"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/media\/874"}],"wp:attachment":[{"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/media?parent=301"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/categories?post=301"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.spike.sh\/wp-json\/wp\/v2\/tags?post=301"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}