AI‑WAF

A self‑learning, Django‑friendly Web Application Firewall
with rate‑limiting, anomaly detection, honeypots, UUID‑tamper protection, dynamic keyword extraction, file‑extension probing detection, exempt path awareness, and daily retraining.

System Requirements

No GPU needed—AI-WAF runs entirely on CPU with just Python 3.8+, Django 3.2+, a single vCPU and ~512 MB RAM for small sites; for moderate production traffic you can bump to 2–4 vCPUs and 2–4 GB RAM, offload the daily detect-and-train job to a worker, and rotate logs to keep memory use bounded.

📁 Package Structure

aiwaf/
├── __init__.py
├── blacklist_manager.py
├── middleware.py
├── trainer.py                   # exposes train()
├── utils.py
├── template_tags/
│   └── aiwaf_tags.py
├── resources/
│   ├── model.pkl                # pre‑trained base model
│   └── dynamic_keywords.json    # evolves daily
├── management/
│   └── commands/
│       └── detect_and_train.py  # `python manage.py detect_and_train`
└── LICENSE

🚀 Features

IP Blocklist
Instantly blocks suspicious IPs (supports CSV fallback or Django model).
Rate Limiting
Sliding‑window blocks flooders (> AIWAF_RATE_MAX per AIWAF_RATE_WINDOW), then blacklists them.
AI Anomaly Detection
IsolationForest trained on:
- Path length
- Keyword hits (static + dynamic)
- Response time
- Status‑code index
- Burst count
- Total 404s
Dynamic Keyword Extraction & Cleanup
- Every retrain adds top 10 keyword segments from 4xx/5xx paths
- If a path is added to AIWAF_EXEMPT_PATHS, its keywords are automatically removed from the database
File‑Extension Probing Detection
Tracks repeated 404s on common extensions (e.g. .php, .asp) and blocks IPs.
Honeypot Field
Hidden field for bot detection → IP blacklisted on fill.
UUID Tampering Protection
Blocks guessed or invalid UUIDs that don’t resolve to real models.
Exempt Path Awareness
Fully respects AIWAF_EXEMPT_PATHS across all modules — exempt paths are:
- Skipped from keyword learning
- Immune to AI blocking
- Ignored in log training
- Cleaned from DynamicKeyword model automatically
Daily Retraining
Reads rotated logs, auto‑blocks 404 floods, retrains the IsolationForest, updates model.pkl, and evolves the keyword DB.

⚙️ Configuration (`settings.py`)

INSTALLED_APPS += ["aiwaf"]

Database Setup

After adding aiwaf to your INSTALLED_APPS, run the following to create the necessary tables:

python manage.py makemigrations aiwaf
python manage.py migrate

Required

AIWAF_ACCESS_LOG = "/var/log/nginx/access.log"

Optional (defaults shown)

AIWAF_MODEL_PATH         = BASE_DIR / "aiwaf" / "resources" / "model.pkl"
AIWAF_HONEYPOT_FIELD     = "hp_field"
AIWAF_RATE_WINDOW        = 10         # seconds
AIWAF_RATE_MAX           = 20         # max requests per window
AIWAF_RATE_FLOOD         = 10         # flood threshold
AIWAF_WINDOW_SECONDS     = 60         # anomaly detection window
AIWAF_FILE_EXTENSIONS    = [".php", ".asp", ".jsp"]
AIWAF_EXEMPT_PATHS       = [          # optional but highly recommended
    "/favicon.ico",
    "/robots.txt",
    "/static/",
    "/media/",
    "/health/",
]

Note: You no longer need to define AIWAF_MALICIOUS_KEYWORDS or AIWAF_STATUS_CODES — they evolve dynamically.

🧱 Middleware Setup

Add in this order to your MIDDLEWARE list:

MIDDLEWARE = [
    "aiwaf.middleware.IPAndKeywordBlockMiddleware",
    "aiwaf.middleware.RateLimitMiddleware",
    "aiwaf.middleware.AIAnomalyMiddleware",
    "aiwaf.middleware.HoneypotMiddleware",
    "aiwaf.middleware.UUIDTamperMiddleware",
    # ... other middleware ...
]

🕵️ Honeypot Field (in your template)

{% load aiwaf_tags %}

<form method="post">
  {% csrf_token %}
  {% honeypot_field %}
  <!-- your real fields -->
</form>

Renders a hidden <input name="hp_field" style="display:none">.
Any non‑empty submission → IP blacklisted.

🔁 Running Detection & Training

python manage.py detect_and_train

What happens:

Read access logs (incl. rotated or gzipped)
Auto‑block IPs with ≥ 6 total 404s
Extract features & train IsolationForest
Save model.pkl
Extract top 10 dynamic keywords from 4xx/5xx
Remove any keywords associated with newly exempt paths

🧠 How It Works

Middleware	Purpose
IPAndKeywordBlockMiddleware	Blocks requests from known blacklisted IPs and Keywords
RateLimitMiddleware	Enforces burst & flood thresholds
AIAnomalyMiddleware	ML‑driven behavior analysis + block on anomaly
HoneypotMiddleware	Detects bots filling hidden inputs in forms
UUIDTamperMiddleware	Blocks guessed/nonexistent UUIDs across all models in an app

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.

👤 Credits

AI‑WAF by Aayush Gauba

“Let your firewall learn and evolve — keep your site a fortress.”

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
aiwaf		aiwaf
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI‑WAF

System Requirements

📁 Package Structure

🚀 Features

⚙️ Configuration (`settings.py`)

Database Setup

Required

Optional (defaults shown)

🧱 Middleware Setup

🕵️ Honeypot Field (in your template)

🔁 Running Detection & Training

What happens:

🧠 How It Works

📄 License

👤 Credits

About

Uh oh!

Releases 11

Packages

Uh oh!

Languages

License

aayushgauba/aiwaf

Folders and files

Latest commit

History

Repository files navigation

AI‑WAF

System Requirements

📁 Package Structure

🚀 Features

⚙️ Configuration (settings.py)

Database Setup

Required

Optional (defaults shown)

🧱 Middleware Setup

🕵️ Honeypot Field (in your template)

🔁 Running Detection & Training

What happens:

🧠 How It Works

📄 License

👤 Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Uh oh!

Languages

⚙️ Configuration (`settings.py`)

Packages