🏗️ Rust Forward Proxy Architecture

This document provides a comprehensive overview of the Rust Forward Proxy architecture, implementation details, and data flows.

🎯 Overview

The Rust Forward Proxy is designed as a high-performance, production-grade HTTP/HTTPS proxy with advanced features:

🔒 Complete HTTPS Interception - TLS termination, certificate generation, and re-encryption
⚡ High Performance - Async architecture with certificate caching and connection pooling
🛡️ Production Ready - Comprehensive logging, monitoring, error handling, and scalability
🧩 Modular Design - Clean separation of concerns with extensible middleware system

📁 Project Structure

rust-forward-proxy/
├── 📦 src/                                    # Core implementation (3,247 lines)
│   ├── 🌐 proxy/                              # HTTP/HTTPS proxy logic (523 lines)
│   │   ├── server.rs                         # Main proxy server (402 lines)
│   │   ├── http_client.rs                    # Optimized HTTP client (78 lines)
│   │   └── streaming.rs                      # Smart body handler (43 lines)
│   │
│   ├── 🔒 tls/                                # TLS & Certificate System (1,156 lines)
│   │   ├── server.rs                         # HTTPS termination server (189 lines)
│   │   ├── cert_gen.rs                       # Certificate generation (515 lines)
│   │   ├── cache.rs                          # Certificate caching (350 lines)
│   │   ├── config.rs                         # TLS configuration (72 lines)
│   │   └── mod.rs                            # TLS module exports (30 lines)
│   │
│   ├── ⚙️ config/                             # Configuration Management (166 lines)
│   │   ├── settings.rs                       # Complete config structs (161 lines)
│   │   └── mod.rs                            # Config exports (5 lines)
│   │
│   ├── 📋 logging/                            # Production Logging (296 lines)
│   │   └── mod.rs                            # Structured logging system
│   │
│   ├── 🛠️ utils/                              # HTTP/URL/Time Utilities (366 lines)
│   │   ├── http.rs                           # HTTP processing (197 lines)
│   │   ├── logging.rs                        # Logging utilities (111 lines)
│   │   ├── url.rs                            # URL parsing (23 lines)
│   │   ├── time.rs                           # Time utilities (24 lines)
│   │   └── mod.rs                            # Utility exports (11 lines)
│   │
│   ├── 🎮 cli/                                # Command-Line Interface (291 lines)
│   │   ├── server.rs                         # Server management commands (130 lines)
│   │   ├── cert.rs                           # Certificate CLI tools (118 lines)
│   │   └── mod.rs                            # CLI exports (43 lines)
│   │
│   ├── 📊 models/                             # Data Structures (137 lines)
│   │   └── mod.rs                            # Request/Response/Log models
│   │
│   ├── ❌ error/                              # Error Handling (50 lines)
│   │   └── mod.rs                            # Custom error types
│   │
│   ├── main.rs                               # Server entry point (134 lines)
│   ├── main_cli.rs                           # CLI entry point (43 lines)
│   └── lib.rs                                # Library exports (22 lines)
│
├── 📚 docs/                                   # Comprehensive Documentation
├── 🐳 docker-compose.yml                     # Docker deployment
├── 📋 Makefile                               # Development/production commands
├── 🔧 Cargo.toml                             # Dependencies and features
└── 🧪 scripts/                               # Testing and setup automation

🌊 Data Flow Architecture

1. HTTP Request Flow

┌─────────────┐    ┌─────────────────────────────────────┐    ┌──────────────────┐
│             │    │                                     │    │                  │
│  HTTP       │    │  Rust Forward Proxy                 │    │  Upstream        │
│  Client     │───▶│  ┌─────────────────────────────────┐ │───▶│  HTTP Server     │
│             │    │  │  HTTP Request Handler           │ │    │                  │
│             │◀───│  │  • Parse headers/body/cookies   │ │◀───│                  │
└─────────────┘    │  │  • Extract form data           │ │    └──────────────────┘
                   │  │  • Log complete request        │ │
                   │  │  • Forward to upstream         │ │
                   │  │  • Log complete response       │ │
                   │  └─────────────────────────────────┘ │
                   └─────────────────────────────────────┘
                                     │
                                     ▼
                            ┌─────────────────┐
                            │  Complete       │
                            │  HTTP Traffic   │
                            │  Visibility     │
                            └─────────────────┘

Detailed HTTP Flow:

Client Request → handle_request() in proxy/server.rs
Request Parsing → extract_request_data() extracts headers, cookies, body
Logging → log_incoming_request() creates clean INFO log
Upstream Forward → build_forwarding_request() constructs upstream request
Response Handling → Complete response logging and return to client

2. Utility Modules (`src/utils/`) - 366 lines total

HTTP Utilities (`src/utils/http.rs`) - 197 lines

// CONNECT handling
parse_connect_target() -> Result<(String, u16), String>
build_error_response(status: StatusCode, message: &str) -> Response<Body>

// Request processing  
extract_headers(req_headers: &HeaderMap, request_data: &mut RequestData)
extract_cookies_to_request_data(req_headers: &HeaderMap, request_data: &mut RequestData)
should_extract_body(req_headers: &HeaderMap, method: &str) -> (bool, Option<String>)
extract_body(body: Body, request_data: &mut RequestData)

// Request building
build_forwarding_request(request_data: &RequestData) -> Result<Request<Body>>

Logging Utilities (`src/utils/logging.rs`) - 111 lines

// Request logging
log_incoming_request(method: &str, uri: &str, remote_addr: &SocketAddr)
log_forwarding_request(request_data: &RequestData)

// CONNECT logging
log_connect_request(uri: &str)
log_connect_success(host: &str, port: u16, connect_time: u128)
log_connect_failure(host: &str, port: u16, connect_time: u128, error: &str)

// HTTP logging  
log_http_success(method: &str, path: &str, status: StatusCode, total_time: u128)
log_http_failure(method: &str, path: &str, total_time: u128, error: &anyhow::Error)

// Transaction logging
create_connect_transaction(request_data, response_data, error) -> ProxyLog

URL/Time Utilities (`src/utils/url.rs`, `src/utils/time.rs`) - 47 lines

URL parsing and manipulation functions
Time-related utility functions

3. Data Models (`src/models/mod.rs`) - 137 lines

Core Data Structures:

pub struct RequestData {
    // HTTP request metadata
    method: String,
    url: String, 
    path: String,
    query_string: Option<String>,
    headers: HashMap<String, String>,
    cookies: HashMap<String, String>,
    body: Vec<u8>,
    form_data: HashMap<String, String>,
    
    // Connection metadata
    client_ip: IpAddr,
    client_port: u16,
    timestamp: DateTime<Utc>,
    is_https: bool,
    // ... other fields
}

pub struct ResponseData {
    status_code: u16,
    status_text: String,  
    headers: Vec<(String, String)>,
    body: Vec<u8>,
    response_time_ms: u64,
    // ... other fields
}

pub struct ProxyLog {
    request: RequestData,
    response: Option<ResponseData>,
    error: Option<String>,
}

4. Logging System (`src/logging/mod.rs`) - 296 lines

Two-Tier Logging Architecture:

INFO Level (Production)

📥 GET http://httpbin.org/get from 127.0.0.1
🔄 Forwarding GET to upstream  
📤 Upstream response: 200 (156ms)
✅ GET /get → 200 OK (158ms)
##################################

🔐 CONNECT tunnel to example.com:443
✅ Tunnel established to example.com:443 (45ms)
🔌 Tunnel completed for example.com:443
##################################

DEBUG Level (Development)

🔍 REQUEST DETAILS:
  Method: GET
  URI: http://httpbin.org/get
  Remote: 127.0.0.1:54321
  Headers: 12

🔄 FORWARDING REQUEST:
  Method: GET
  URL: http://httpbin.org/get  
  Headers: 10
  Body Size: 0 bytes

📋 HTTP TRANSACTION:
{
  "request": { ... full RequestData struct ... },
  "response": { ... full ResponseData struct ... }
}

Data Flow Architecture

HTTP Request Flow

1. Client Request → handle_request()
2. log_incoming_request() → Clean INFO log
3. extract_request_data() → Parse headers, cookies, body
4. handle_http_request() → Process with full interception  
5. build_forwarding_request() → Construct upstream request
6. Forward to upstream → Get response
7. log_http_success() → Clean completion log
8. Return response to client

HTTPS CONNECT Flow

1. Client CONNECT → handle_request()
2. log_connect_request() → Log tunnel establishment
3. handle_connect_request() → Parse target host:port
4. TcpStream::connect() → Establish upstream connection
5. handle_connect_tunnel() → Return 200 OK + spawn tunnel task
6. hyper::upgrade::on() → Upgrade connection 
7. tunnel_bidirectional() → Raw TCP forwarding
8. log_connect_success() → Log tunnel completion

Modular Architecture Benefits

1. Clean Separation of Concerns

Server Logic: Pure request handling and flow control
HTTP Utilities: Reusable HTTP processing functions
Logging Utilities: Consistent logging across the application
Data Models: Clear data structure definitions

2. DRY Principle Implementation

No Code Duplication: Common patterns extracted into utility functions
Reusable Components: Utilities can be imported anywhere
Single Source of Truth: Each function has one clear responsibility

3. Testability & Maintainability

Unit Testing: Individual utility functions can be tested in isolation
Clear Dependencies: Easy to trace function dependencies
Focused Changes: Modifications affect only relevant modules

4. Performance Characteristics

Async Throughout: Full Tokio/Hyper async stack
Zero-Copy Where Possible: Minimal data copying in request/response handling
Connection Reuse: HTTP client reuses connections for efficiency
Streaming: Large request/response bodies handled efficiently

Extension Points

The current architecture supports extension in several areas:

Middleware System: Can be added to src/proxy/middleware/
Authentication: Auth utilities can be added to src/utils/
Rate Limiting: Rate limiting logic can be integrated into request flow
Connection Pooling: Upstream connection management can be enhanced
Protocol Support: WebSocket tunneling, HTTP/2 support can be added

Security Considerations

HTTPS Tunneling: Raw TCP passthrough maintains end-to-end encryption
Request Validation: Input validation in utility functions
Error Handling: Safe error propagation without information leakage
Memory Safety: Rust's ownership system prevents common security issues

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🏗️ Rust Forward Proxy Architecture

🎯 Overview

📁 Project Structure

🌊 Data Flow Architecture

1. HTTP Request Flow

2. Utility Modules (`src/utils/`) - 366 lines total

HTTP Utilities (`src/utils/http.rs`) - 197 lines

Logging Utilities (`src/utils/logging.rs`) - 111 lines

URL/Time Utilities (`src/utils/url.rs`, `src/utils/time.rs`) - 47 lines

3. Data Models (`src/models/mod.rs`) - 137 lines

4. Logging System (`src/logging/mod.rs`) - 296 lines

INFO Level (Production)

DEBUG Level (Development)

Data Flow Architecture

HTTP Request Flow

HTTPS CONNECT Flow

Modular Architecture Benefits

1. Clean Separation of Concerns

2. DRY Principle Implementation

3. Testability & Maintainability

4. Performance Characteristics

Extension Points

Security Considerations

FilesExpand file tree

architecture.md

Latest commit

History

architecture.md

File metadata and controls

🏗️ Rust Forward Proxy Architecture

🎯 Overview

📁 Project Structure

🌊 Data Flow Architecture

1. HTTP Request Flow

2. Utility Modules (src/utils/) - 366 lines total

HTTP Utilities (src/utils/http.rs) - 197 lines

Logging Utilities (src/utils/logging.rs) - 111 lines

URL/Time Utilities (src/utils/url.rs, src/utils/time.rs) - 47 lines

3. Data Models (src/models/mod.rs) - 137 lines

4. Logging System (src/logging/mod.rs) - 296 lines

INFO Level (Production)

DEBUG Level (Development)

Data Flow Architecture

HTTP Request Flow

HTTPS CONNECT Flow

Modular Architecture Benefits

1. Clean Separation of Concerns

2. DRY Principle Implementation

3. Testability & Maintainability

4. Performance Characteristics

Extension Points

Security Considerations

2. Utility Modules (`src/utils/`) - 366 lines total

HTTP Utilities (`src/utils/http.rs`) - 197 lines

Logging Utilities (`src/utils/logging.rs`) - 111 lines

URL/Time Utilities (`src/utils/url.rs`, `src/utils/time.rs`) - 47 lines

3. Data Models (`src/models/mod.rs`) - 137 lines

4. Logging System (`src/logging/mod.rs`) - 296 lines