Monitoring and alerting checklist

Comprehensive list of monitoring best practices.

Monitor the uptime of websites and APIs
Monitoring your customer facing websites and critical API endpoints can make you aware of serious issues that should be resolved urgently. You should add uptime monitoring to check these endpoints at regular intervals and raise alerts if the endpoints are not responding or sending a wrong response.
Relevant Spike.sh integrations: Pingdom, Uptime Robot, Apex Ping
Monitor the uptime of your databases
Your database is a critical part of your applications and monitoring the uptime of the database is of critical importance.
Relevant Spike.sh integrations: Webhooks
Monitor the disk space utilization
Disk space can often fill up due to growing log files and is especially dangerous because it can be hard to keep track of. You can monitor disk space usage using the guides for AWS, Google Cloud Platform or Microsoft Azure below.
Alternately, you can create a script to check disk space and raise alert via webhook if the disk space utilisation goes above a certain threshold (usually 80-90%). A cron job should execute this script at regular intervals (at least once a day).
Relevant Spike.sh integrations: AWS, Google Cloud Platform, Microsoft Azure, Webhooks
Monitor your infrastructure
Keep track of utilization and load for your infrastructure by monitoring CPU and memory usage. High CPU utilisation can lead to programs slowing down or freezing altogether. High memory utilisation can lead to performance bottlenecks and inability to handle more users on your website and apps. You should also raise alerts when your network I/O usage goes up either due to user load or suspicious network activity.
Relevant Spike.sh integrations: AWS, Google Cloud Platform, Microsoft Azure, Prometheus, Sematext
Keep track of application errors
Keep track of important errors and exceptions in your web and mobile apps which affect your customers. Configure the error monitoring to raise alerts based on the importance and frequency of the errors.
Relevant Spike.sh integrations: Sentry, Bugsnag, Rollbar, Honeybadger
Monitor your cron jobs
Cron jobs form the backbone of your system and keep track of important tasks like DB backups, user data management etc. Cron job failures can often go unnoticed and cause havoc. Keep track of them and raise alerts when necessary.
Relevant Spike.sh integrations: Healthchecks, Cronitor
Monitor your application performance
Poor application performance can lead to a bad user experience and lead to users leaving your website and apps. Monitor the performance of your apps and alert your engineering and ops teams when performance thresholds are crossed.
Relevant Spike.sh integrations: Datadog, New Relic, Prometheus, Sematext, Grafana
Monitor your security alerts
Keep track of security events in your apps before they become security incidents or breaches. Configure your security products to raise alerts when serious security events take place.
Relevant Spike.sh integrations: Webhooks
Monitor your important business functionality
Raise alerts when important business critical features are facing issues. e.g. For e-commerce websites, raise alerts when cart or payment functionality errors arise.
Relevant Spike.sh integrations: Webhooks
Get Phone call, SMS, Slack and Email alerts from Pingdom for just $7/month

It takes only 5 minutes to setup Pingdom integration and start receiveing alerts.

Get started 
  • • No credit card required
  • • Risk free 14-day trial
  • • Cancel anytime
Get Phone call, SMS, Slack and Email alerts from Pingdom for just $7/month

It takes only 5 minutes to setup Pingdom integration and start receiveing alerts.

Get started 
  • • No credit card required
  • • Risk free 14-day trial
  • • Cancel anytime

Integrate seamlessly with all your stack

Each integration is built carefully in-house with focus on suppression, severity, priority, and auto-resolving incident.
Can't see an integration? No problem, we can build one in no time.