A technique used to control how many API requests a user or system can make in a given time window. It's essential for scalability and security.