Control Variate
...大约 1 分钟
Control Variate
layout: Slide sidebar: false breadcrumb: false pageInfo: false
Introduction to Control Variate
Target
Reduce the variance of a random variable .
Method
Generate an alternative random variable such that:
Approach and Proof
The control variate is defined as:
Where:
- is any other random variable (different from ) with known
- is a constant
Key Requirements
- must be such that is known a priori
- should be an inherent part of the simulation output for (generated "for free" with )
Control Variate in IMTSP
Problem Formulation
View MTSP as a bilevel optimization problem:
Upper Level:
- Optimize allocation network (city-agent assignment)
- Optimize surrogate network
Lower Level:
- Optimize TSP solver (single-agent routing)
Notation Breakdown
Component | Description |
---|---|
Allocation network (parameters ) | |
Sampling function | |
TSP solver | |
TSP solver parameters | |
Euclidean distance cost function |
Gradient Estimation
Challenge 1: Non-Differentiability
Solution: Log-Derivative Trick
Implement gradient computation through:
- Allocation network
- TSP solver and parameters
Compact form:
Proof of Gradient Interchange
Under regularity conditions:
Rewritten as:
Variance Reduction
Challenge 2: High Variance
::: success Solution: Control Variate Introduce surrogate network to:
- Provide control variate for allocation network
- Minimize single-sample gradient variance :::
Surrogate Network Design
Input: Allocation matrix
Output: Maximum tour length
Powered by Waline v3.1.3