I have to say AWS Trusted Advisor is a great tool! AWS keeps improving it by adding more useful new checks. Here is one that I got this morning: I setup health checks for some new records, but forgot to decrease TTL to a low value (it is 300 seconds by default). Now trusted advisor reminds … Continue reading Great AWS Trusted Advisor
Category: Monitoring
Elasticache Redis Unreachable Issue
We have a Elasticache Redis replication group, it has two nodes: one primary and one replica. Last week, we noticed that the primary redis node suddenly stops working - any connections to the primary node timed out eventually. According to the log, there was a load burst and following that the redis reboot itself. Unfortunately, … Continue reading Elasticache Redis Unreachable Issue
Run query via Sumologic API
SumoLogic query can also be run via API. Here is a bash example that I wrote to get the nginx access logs. By default it searches the logs in the past 10 minutes, but you can overwrite it by adding a parameter. Here is the script. It is also can be found in my github. #!/bin/bash … Continue reading Run query via Sumologic API
Troubleshoot high CPU usage java process
This is a real troubleshooting example that I just did yesterday for a high CPU usage java application. The application uses tomcat and runs on AWS EC2. Login into the box, and change to root user so you can see all users' process. sudo su - Install htop if you have not installed it before, the run it. … Continue reading Troubleshoot high CPU usage java process
Send VPC flow logs to Sumologic
VPC flow is a AWS feature that allows you to capture the inbound and outbound traffics of all network interfaces in your VPC. To get the information, you have to go to the Cloudwatch to read the raw logs line by line which is not really efficient. If you happen to use Sumologic, you can … Continue reading Send VPC flow logs to Sumologic