There two distinct calibration procedures to be used, depending on the methodology used to build the scoring function.
- Logistic regression: already yields sample-dependent PD estimates, which need to be rescaled to each segment's average PD.
- Statistical and heuristic models (e.g. MDA): calibration assigns PD values to scores; rescaling may be necessary.
Calibration for logistic regression
Rescaling is done via the Relative Default Frequency, that is, the ration of defaulters to non-defaulters in the sample.
Procedure for rescaling the PD estimates from a logistic regression model:
Calibrating for Multiple Discriminant Analysis and other statistical and heuristic models
Procedure:
- Calculation of the mean PD of the non-defaulted sample.
- Conversion of mean PD in RDF(sample).
- Obtaining the mean PD of the portfolio (average default rate per segment).
- Representation of the model's PD estimates as RDF(unscaled).
- RDF scaling: RDF(scaled) = RDF(unscaled) * (RDF(portfolio)/RDF(sample))
- Conversion of RDF(scaled) in PD(scaled)
Calibrating for Multiple Discriminant Analysis and other statistical and heuristic models
Procedure:
- Division of score range into intervals
- Differences between mean PDs of different intervals are sufficiently large.
- 10 or more classes (min. 7 according to Directive 2006/48/EC)
- 100 cases per interval to make a reliable estimate of mean PD.
- Interval widths not necessarily identical.
- RDF(unscaled) is computed for each interval as the ratio of defaulters to non-defaulters.
- RDF scaling: RDF(scaled) = RDF(unscaled) * (RDF(portfolio)/RDF(sample))
- Conversion of RDF(scaled) in PD(scaled)
No comments:
Post a Comment