Got a
DevOps horror story
? Tell us about your worst on-call nightmares this
Halloween and get featured!
Click Here
Product
Platform
Reliability AI
Revolutionize Incident Management with Innovative AI Capabilities.
New
Unified Incident Management
Combine on-call and incident response for efficient operations.
Service Reliability Management
Enhance reliability with automation, and analytics.
Workflows
Reduce manual work and resolve incidents faster.
Continuous Learning with AI & ML
Leverage reliability insights to fine-tune systems and protocols.
Enterprise Incident Management
Advanced reliability tools designed for scale.
On-Call Management
Event Intelligence
The latest industry news, updates and info.
Schedules & Escalations
Learn how our customers are making big changes.
Live Call Routing
The latest industry news, updates and info.
Intelligent Noise Reduction
Get up and running on new features and techniques.
Reliability Workflows
SLO & Error Budgets
We’re always looking for talented people. Join our team!
Service Health
We’re always looking for talented people. Join our team!
Incident Analytics & Reliability Insights
We’re always looking for talented people. Join our team!
Incident Response
Enhanced Collaboration
Learn about our story and our mission statement.
Runbooks
News and writings, press releases, and press resources.
Postmortems
We’re always looking for talented people. Join our team!
Status Pages
We’re always looking for talented people. Join our team!
Continuous Learning
Past Incidents
Get up and running on new features and techniques.
Mobile Incident Management
Explore
Experience Effortless Migration with Squadcast
See How
Reliability Automation Platform
Consolidate and automate workflows, while leveraging deep analytics for data-led decisions and continuous improvements.
Overview
We've just released an update!
Check out the all new dashboard view. Pages now load faster.
Changelog
Live Call Routing →
Connect Users with On-Call Engineers Instantly
We’re always looking for talented people. Join our team!
NEW
Bidirectional Integration with Squadcast
FEATURED INTEGRATION
Solutions
By Use Case
SRE and DevOps
IT Ops
Continuous Learning with AI & ML
Leverage reliability insights to fine-tune systems and protocols.
Business Resilience
Customer Experience
By Industry
Technology
The latest industry news, updates and info.
Finance & Banking
Learn how our customers are making big changes.
Retail
Get up and running on new features and techniques.
Service Providers
We’re always looking for talented people. Join our team!
Media & Gaming
We’re always looking for talented people. Join our team!
Explore Squadcast
Workflows
Noise Reduction
Postmortems
Service Health
FEATURED
Unified Incident Management: Merits of Combined On-Call and Incident Response
Integrations
Monitoring
IT Service Management
Collaboration / ChatOps
Developer Tools
View all Integrations
Public API
Webhooks
Terraform
Email
FEATURED
Squadcast and ServiceNow: Streamlining Incident Management with our latest Bidirectional integration
Pricing
Customers
Since the implementation of Squadcast, we’ve managed to reduce the number of incoming alerts from tens of thousands to hundreds, all thanks to the flexible deduplication mechanism. Squadcast brings simplicity and flexibility and has a direct effect on the decrease in alert fatigue and the increase of awareness.
Avner Yaacov
Senior Manager,
Read Case Study
How Charter Leveraged Squadcast to Drive Client Success With Robust Incident Management
Read Case Study
How Squadcast helped Tanner gain system insights and boost team productivity
Read Case Study
View all Case Studies
Community
Developer Resources
Incident Response Tools
Public API
FEATURED
Hear from our customers: Elevating reliability engineering to drive client success.
🏆 Category Leader in IT Alerting - G2 🏆
Resources
Continuous Learning with AI & ML
Leverage reliability insights to fine-tune systems and protocols.
Documentation
Changelog
The latest industry news, updates and info.
Blog
Learn about our story and our mission statement.
Community
Webinars
Learn how our customers are making big changes.
Developers
News and writings, press releases, and press resources.
SRE Best Practices
News and writings, press releases, and press resources.
Incident Response Tools
DevOps Best Practices
Explore Squadcast
Workflows
Noise Reduction
Postmortems
Service Health
Stay updated with the latest in Reliability Automation.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
FEATURED
Squadcast Ranks in the Top 10 Incident Management Tools Report by G2
Log in
Log in
Book a Demo
Start For Free
×
Name *
Email *
Phone Number *
Please fill in all the required fields.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Gigi Sayfan
Author of
"Mastering Kubernetes"
Understanding the landscape of AWS compute
July 10, 2020
SLOs for AWS-based infrastructure
July 8, 2020
Kubernetes Operators for Automated SRE
May 27, 2020
Using observability tools to set SLOs for Kubernetes Applications
April 16, 2020
The Age of Service Mesh
November 28, 2019
Kubernetes Capacity Planning and Autoscaling - Build Reliable Services
July 24, 2019
More Blogs
Get the latest scoop on Reliability insights. Delivered straight to your inbox.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
If you wish to unsubscribe, we won't hold it against you.
Privacy policy
.
Engage with the Squadcast community for effective Incident Response strategies.
Sign up today!
Product
Features
Integrations
Pricing
Mobile Incident Management
Product Demo
COMPARE
PagerDuty Alternative
Opsgenie Alternative
Solutions
SRE Tools
IT Alerting
IT Incident Management
Status Page
Runbooks
How to Reduce MTTR
Modern Incident Response Platform
Incident Postmortems
Company
About Us
Partners
Contact Us
Careers
Support
Getting Started
Submit a Ticket
Service Status
Resources
Blog
Case Studies
Developer Resources
Community
SRE Best Practices
Error Budget Calculator
Pagerduty to Squadcast: Savings Calculator
Privacy Policy
Responsible Disclosure
GDPR
Terms of Use
Security & Compliance
Copyright © Squadcast Inc. 2017-2024
Gigi Sayfan
Author of
"Mastering Kubernetes"
Squadcast way to resolve Incidents
TRY SQUADCAST for Free
schedule a demo
Subscribe to our latest updates
Enter your Email Id
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Understanding the landscape of AWS compute
In the second part of our "SLOs for AWS-based infrastructure" blog , Gigi Sayfan dives deeper into understanding the landscape of AWS compute by using the lens of Kubernetes to compare and contrast & covers in detail setting of SLOs for ECS, EKS, Fargate, and Lambda based services.
Read more
July 10, 2020
SLOs for AWS-based infrastructure
In our latest two-part series blog post, Gigi Sayfan, author of “Mastering Kubernetes”, discusses managing complex infrastructure on AWS with an eye towards SLOs (service level objectives). Though there are many ways to discuss the management of infrastructure, in this two-part series, he covers SLOs for AWS, Observability on AWS, Quotas Limits, and Optimizing cost on AWS and in the second part, he uses the lens of Kubernetes to compare and contrast compute infrastructure on AWS with Kubernetes.
Read more
July 8, 2020
Kubernetes Operators for Automated SRE
Mastering Kubernetes operators can help SRE teams to maintain large scale Kubernetes systems with thousands of services and help SRE teams achieve the goal of automated SRE
Read more
May 27, 2020
Using observability tools to set SLOs for Kubernetes Applications
Gigi Sayfan, author of “Mastering Kubernetes” explores Kubernetes observability tools like Prometheus, Grafana and Jaeger, how to utilize them to set proper SLOs and make sure the service meets its objectives.
Read more
April 16, 2020
The Age of Service Mesh
Ensure maximum observability and optimize your system performance with the latest technology - Service Mesh. In this blog, we explore the age of Service Mesh and how it revolutionizes the way you manage and maintain your system. Learn how it boosts observability and how you can get started with it.
Read more
November 28, 2019
Kubernetes Capacity Planning and Autoscaling - Build Reliable Services
Understand the importance of Kubernetes Capacity Planning and Autoscaling for building efficient and resilient cloud-native applications. Learn how Intent-based Capacity Planning & Autoscaling enables your organization to manage resources in a better way with Kubernetes.
Read more
July 24, 2019
No items found.