easylimit

A simple, precise Python rate limiter with built-in context manager support for hassle-free API throttling.

Features

Simple and Intuitive: Easy-to-use context manager interface
Precise Rate Limiting: Token bucket algorithm ensures accurate timing
Call Tracking: Optional tracking of calls, delays, and efficiency metrics
Thread-Safe: Safe for use in multi-threaded applications
Zero Dependencies: No runtime dependencies, lightweight and fast
Flexible Configuration: Support for any positive rate (calls per second)
Comprehensive Testing: Extensive test suite with 100% coverage

Installation

pip install easylimit

Quick Start

from easylimit import RateLimiter

# Create a rate limiter for 2 calls per second
limiter = RateLimiter(max_calls_per_second=2)

# Use with context manager (recommended)
with limiter:
    make_api_call()

# Or acquire tokens manually
if limiter.try_acquire():
    make_api_call()

# Enable call tracking for monitoring
tracked_limiter = RateLimiter(max_calls_per_second=2, track_calls=True)
with tracked_limiter:
    make_api_call()

print(f"Total calls: {tracked_limiter.call_count}")
print(f"Efficiency: {tracked_limiter.get_efficiency():.1f}%")

Usage Examples

Basic API Rate Limiting

import requests
from easylimit import RateLimiter

# Limit API calls to 2 per second
api_limiter = RateLimiter(max_calls_per_second=2)

def fetch_user_data(user_id):
    with api_limiter:
        response = requests.get(f"https://api.example.com/users/{user_id}")
        return response.json()

# Make multiple API calls - automatically rate limited
for user_id in range(1, 11):
    user_data = fetch_user_data(user_id)
    print(f"User {user_id}: {user_data['name']}")

Handling Multiple APIs with Different Limits

from easylimit import RateLimiter

# Different rate limits for different APIs
github_limiter = RateLimiter(max_calls_per_second=1)    # 1 call/sec
twitter_limiter = RateLimiter(max_calls_per_second=0.5)  # 1 call every 2 seconds

def fetch_github_data():
    with github_limiter:
        # GitHub API call
        pass

def fetch_twitter_data():
    with twitter_limiter:
        # Twitter API call
        pass

Non-blocking Token Acquisi 8000 tion

from easylimit import RateLimiter

limiter = RateLimiter(max_calls_per_second=1)

# Try to acquire a token without blocking
if limiter.try_acquire():
    print("Token acquired, making API call")
    make_api_call()
else:
    print("Rate limit reached, skipping call")

Timeout-based Acquisition

from easylimit import RateLimiter

limiter = RateLimiter(max_calls_per_second=1)

# Wait up to 5 seconds for a token
if limiter.acquire(timeout=5.0):
    print("Token acquired within timeout")
    make_api_call()
else:
    print("Timeout reached, no token available")

Call Tracking and Monitoring

from easylimit import RateLimiter, CallStats

# Enable call tracking with custom history window
limiter = RateLimiter(
    max_calls_per_second=2.0,
    track_calls=True,
    history_window_seconds=1800  # Keep 30 minutes of history
)

# Make some API calls
for i in range(5):
    with limiter:
        make_api_call(f"request_{i}")

# Check call statistics
print(f"Total calls made: {limiter.call_count}")
print(f"Calls in last 5 minutes: {limiter.calls_in_window(300)}")
print(f"Current efficiency: {limiter.get_efficiency(60):.1f}%")

# Get detailed statistics
stats: CallStats = limiter.stats
print(f"Average delay per call: {stats.average_delay_seconds:.3f}s")
print(f"Maximum delay encountered: {stats.max_delay_seconds:.3f}s")
print(f"Overall calls per second: {stats.calls_per_second_average:.2f}")

# Reset tracking data
limiter.reset_call_count()

from easylimit import RateLimiter

# Before: Manual estimation (error-prone)
def process_users_old_way(users):
    total_api_calls = 0
    for user in users:
        if user.get("mfa_status") == "ENABLED":
            total_api_calls += (2 if user.get("email") else 3)
        elif user.get("mfa_status") in ["ALREADY_ENABLED", "DATA_MISMATCH"]:
            total_api_calls += 1
    
    print(f"Estimated API calls: {total_api_calls}")

# After: Automatic tracking (accurate)
def process_users_with_tracking(users):
    limiter = RateLimiter(max_calls_per_second=2.0, track_calls=True)
    
    for user in users:
        with limiter:
            process_user_mfa(user)  # Actual API calls
    
    print(f"Actual API calls made: {limiter.call_count}")
    print(f"Efficiency: {limiter.get_efficiency():.1f}%")

API Reference

RateLimiter

class RateLimiter:
    def __init__(
        self, 
        max_calls_per_second: float = 1.0,
        track_calls: bool = False,
        history_window_seconds: int = 3600
    ) -> None:
        """
        Initialise the rate limiter.
        
        Args:
            max_calls_per_second: Maximum number of calls allowed per second
            track_calls: Enable call tracking (default: False)
            history_window_seconds: How long to keep call history for windowed queries
            
        Raises:
            ValueError: If max_calls_per_second is not positive
        """

Methods

acquire(timeout: Optional[float] = None) -> bool

Acquire a token, blocking if necessary.
- timeout: Maximum time to wait for a token (None for no timeout)
- Returns: True if token was acquired, False if timeout occurred
try_acquire() -> bool

Try to acquire a token without blocking.
- Returns: True if token was acquired, False otherwise
available_tokens() -> float

Get the current number of available tokens.
- Returns: Number of tokens currently available

Call Tracking Properties (when `track_calls=True`)

call_count -> int

Total number of calls made through this rate limiter.
- Returns: Total call count
- Raises: ValueError if call tracking is not enabled
stats -> CallStats

Detailed statistics about calls and timing.
- Returns: CallStats object with comprehensive metrics
- Raises: ValueError if call tracking is not enabled

Call Tracking Methods (when `track_calls=True`)

reset_call_count() -> None

Reset the call counter and all tracking data to zero.
- Raises: ValueError if call tracking is not enabled
calls_in_window(window_seconds: int) -> int

Return number of calls made in the last N seconds.
- window_seconds: Time window in seconds (must be positive)
- Returns: Number of calls in the specified window
- Raises: ValueError if call tracking is not enabled or window_seconds is not positive
get_efficiency(window_seconds: int = 60) -> float

Calculate rate limit efficiency as percentage (0.0 to 100.0).
- window_seconds: Time window for efficiency calculation (default: 60)
- Returns: Efficiency percentage (0.0 = no calls, 100.0 = maximum rate utilisation)
- Raises: ValueError if call tracking is not enabled or window_seconds is not positive

Context Manager Support

The RateLimiter can be used as a context manager:

with limiter:
    # This block will only execute after acquiring a token
    make_api_call()

CallStats

When call tracking is enabled, the stats property returns a CallStats object:

from easylimit import CallStats

@dataclass
class CallStats:
    """Statistics about rate limiter usage."""
    total_calls: int                    # Total number of calls made
    total_delay_seconds: float          # Cumulative delay time
    average_delay_seconds: float        # Average delay per call
    max_delay_seconds: float            # Maximum delay encountered
    calls_per_second_average: float     # Average call rate
    efficiency_percentage: float        # Rate limit efficiency (0-100%)
    tracking_start_time: datetime       # When tracking began
    last_call_time: Optional[datetime]  # Timestamp of most recent call

How It Works

Rate Limiting Algorithm

easylimit uses a token bucket algorithm to provide precise rate limiting:

Token Bucket: A bucket holds tokens, with a maximum capacity equal to max_calls_per_second
Token Refill: Tokens are added to the bucket at a constant rate over time
Token Consumption: Each operation consumes one token from the bucket
Rate Limiting: When the bucket is empty, operations must wait for new tokens

This approach allows for:

Burst Handling: Initial burst of calls up to the bucket capacity
Sustained Rate: Steady rate limiting after the initial burst
Precise Timing: Accurate rate control based on elapsed time

Call Tracking System

When track_calls=True, the rate limiter maintains detailed statistics:

Call Counting: Each successful token acquisition increments the call counter
Delay Tracking: Records the time spent waiting for tokens
Timestamp History: Maintains a sliding window of call timestamps for windowed queries
Memory Management: Automatically cleans up old timestamps beyond the history window
Thread Safety: All tracking operations are protected by the same lock as rate limiting

Efficiency Calculation: Efficiency is calculated as (actual_calls / max_possible_calls) * 100% for a given time window, helping you understand how well you're utilising your rate limit.

Thread Safety

easylimit is fully thread-safe and can be used safely in multi-threaded applications. All operations, including call tracking, are protected by internal locking mechanisms using threading.RLock().

import threading
from easylimit import RateLimiter

# Thread-safe rate limiting with tracking
limiter = RateLimiter(max_calls_per_second=5, track_calls=True)

def worker(worker_id):
    for i in range(10):
        with limiter:
            # Thread-safe API call with automatic tracking
            make_api_call(f"worker_{worker_id}_call_{i}")

# Create multiple threads
threads = [threading.Thread(target=worker, args=(i,)) for i in range(3)]
for thread in threads:
    thread.start()

for thread in threads:
    thread.join()

# All tracking data is consistent across threads
print(f"Total calls from all threads: {limiter.call_count}")
print(f"Thread-safe efficiency: {limiter.get_efficiency():.1f}%")

Development

Setting Up Development Environment

# Clone the repository
git clone https://github.com/man8/easylimit.git
cd easylimit

# Install uv (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install dependencies
uv sync --all-extras --dev

Running Tests

# Run unit tests only (default)
uv run pytest

# Run all tests including integration tests
uv run pytest -m ""

# Run only integration tests
uv run pytest -m integration

# Run with coverage
uv run pytest --cov=easylimit --cov-report=html

Code Quality

# Lint code
uv run ruff check .

# Format code
uv run ruff format .

# Type checking
uv run mypy src/

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Make your changes
Add tests for your changes
Ensure all tests pass (uv run pytest)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Changelog

0.2.0 (2025-06-03)

Initial release
Token bucket rate limiting algorithm
Context manager support
Thread-safe implementation
Call tracking capabilities
- Optional call counting and statistics
- Time-windowed call history tracking
- Efficiency metrics and delay analysis
- Thread-safe tracking with bounded memory usage
Comprehensive test suite with 37 tests
Zero runtime dependencies

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github/workflows		.github/workflows
src/easylimit		src/easylimit
tests		tests
.gitignore		.gitignore
.tool-versions		.tool-versions < 8000 /td>
LICENSE		LICENSE
Pipfile		Pipfile
README.md		README.md
demo.py		demo.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

man8/easylimit

Folders and files

Latest commit

History

Repository files navigation

easylimit

Features

Installation

Quick Start

Usage Examples

Basic API Rate Limiting

Handling Multiple APIs with Different Limits

Non-blocking Token Acquisi 8000 tion

Timeout-based Acquisition

Call Tracking and Monitoring

API Reference

RateLimiter

Methods

Call Tracking Properties (when track_calls=True)

Call Tracking Methods (when track_calls=True)

Context Manager Support

CallStats

How It Works

Rate Limiting Algorithm

Call Tracking System

Thread Safety

Development

Setting Up Development Environment

Running Tests

Code Quality

Contributing

License

Changelog

0.2.0 (2025-06-03)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Call Tracking Properties (when `track_calls=True`)

Call Tracking Methods (when `track_calls=True`)

Packages