Smart Stop (AWS)
Overview
Smart Stop is an advanced feature of the Autofix module designed to eliminate "silent waste" in cloud environments. Unlike conventional fixed-schedule automation, Smart Stop monitors the actual state and utilization of resources to execute the automatic shutdown of idle workloads, especially in development, staging, and testing environments.
Business Objective
The primary focus is Utilization Efficiency. Smart Stop resolves unnecessary costs associated with:
Active instances outside business hours or on weekends.
Test environments forgotten after the completion of development cycles.
Resources with continuous low utilization that do not justify the cost of remaining powered on.
Operating Modes
Smart Stop offers two main approaches for resource management:
1. Only Stop (Filter-Based)
Shuts down AWS resources based on specific selection criteria (Tags, IDs, or Names). If the resource matches the configured filters, it is turned off to ensure compliance with the organization's policy.
Supported Resources: EC2 instances, RDS instances/clusters, and Auto Scaling Groups (in this case, Min, Max, and Desired capacities are reduced to 0).
2. Metric Analyzer (Utilization-Based)
Analyzes real performance before taking action. The resource is only shut down if the maximum CPU utilization during the configured period is below the defined threshold.
Shutdown Requirements: The instance must be running, have been created longer ago than the configured analysis period, and must not have protection tags or active Overrides.
Control and Protection Mechanisms
Override (Temporary Restart)
Allows a resource shut down by Smart Stop to be temporarily restarted for a determined period.
The user defines the start time and the duration of the Override.
The system starts the resource at the scheduled time.
At the end of the period, the system executes the automatic shutdown again, recording the entire cycle in the history.
Resource Protection (Safety Net)
To prevent the accidental shutdown of critical resources, the system respects the following rules:
Exception Tag: Any resource with the
PIER_CLOUD_SMART_STOP_SKIP_RESOURCEtag will be permanently ignored by the automation.Override Priority: Resources with an active Override are ignored by the standard shutdown cycle to avoid operational conflicts.
Configuration and Execution
How to Configure
In the Autofix → Smart Stop module, the process follows three steps:
Scope Definition: Selection of accounts, regions, and filters (Tags, ID, or Name).
Idleness Criteria: Definition of the CPU threshold and minimum inactivity period.
Activation: Activation via status toggle to begin monitoring.
Frequency and Audit
Verification: The system analyzes resources every 1 minute.
History: All actions (successes, failures, ASG capacity changes, and before/after states) are recorded with detailed logs for technical auditing.
FinOps Impact
The implementation of Smart Stop directly reflects on the financial health of the cloud operation:
Waste Reduction: Immediate cut in unused Compute costs.
Accountability Culture: Encourages engineering teams to maintain lean and tagged environments.
Continuous Optimization: Replaces manual audits with automated and resilient governance.
Best Practices
Start the application in non-production environments (Dev/Sandbox).
Use tag-based scoping to ensure production resources are not affected.
Monitor "Monthly Recurring Savings" metrics generated by the first executions before expanding to the entire organization.
For use and access in the Pier platform:
Smart Stop is an Autofix feature that automates the shutdown of existing instances in the environment through a key and value.
Smart Stop includes the following AWS services: EC2, RDS and Auto Scaling Group
Accessing Smart Stop.
In the Autofix side menu, select Smart Stop.

Select the EC2, RDS, or Auto Scaling Group service:

Creating an action
On the main screen, click the "Add" button

Properties
The system will open a screen to create the action
Add a name to the Smart Stop action:

Set the Smart Stop type:


Sets metrics so that Smart Stop can monitor resources and stop resources that are outside the configured metric.
Will stop the selected resources, without using specific metrics.
Define the regions:

The system allows to select more than one region

Define accounts:
The system allows to select more than one account

Filters
By clicking the next button, the system will advance to the filters screen, where it is possible to add the desired filters.

Filters are where Smart Stop will locate resources by key and value
In "Filter Type" it is possible to locate the resource in three ways: tag, resource name and resource id:

Set the expression CONTAINS and NOT CONTAIN:

In key and value, add the resource ID so it can be found.

When adding, you can view the filter of the created resource

Metrics
By clicking the next button, the system will advance to the Metrics screen
Set the metrics to check the Smart Stop action. For example, vCPU usage:

Note: Smart Stop will monitor the VCPU usage of resources.
Set the period during which Smart Stop will monitor the resource that has VCPU usage below the defined value:

Set the maximum percentage less than: VCPU usage value that will indicate Smart Stop monitoring:

Schedule
By clicking the next button, the system will advance to the Schedule screen
Set the Smart Stop monitoring schedule

Execution interval that Smart Stop will monitor resources to find resources outside the defined metric.
Once the rule has been created, the configuration is visible on the Smart Stop home screen, where it is possible to Activate and deactivate the rule in the “Status” column.

Edit
Allows the user to edit any information created in the action.

Delete
Allows the user to delete the created action.

Last updated