The setup is as follow:
- Random feature vectors .
- State space , possible classes.
- Decision space , where we have possible decisions.
- Loss function , the cost of deciding when the true state is .
The goal is to minimize the expected loss. We define the conditional risk for one sample as:
Now, we need to calculate the overall risk for all the samples.
Note that here is a decision rule that maps each to a decision. Next, to find the optimal decision rule ,
This might be hard to calculate, so we minimize the integral by minimizing the integrand at each pooint. This is called Minimal Risk Decision.
To calculate , same as the above equation actually.