Processing math: 100%

PKH1982

Victor Navarro

The mathematics behind PKH1982

Another departure from global error term models such as RW1972 (Rescorla & Wagner, 1972), the PKH1982 model (Pearce et al., 1982) does not use an error term for learning excitatory associations (but does for inhibitory associations), and ties stimulus associability (α) to absolute global prediction error.

note: The implementation of this model closely follows the technical note from the CAL-R group where possible. Divergences are noted.

1 - Generating expectations

Let vk,j denote the excitatory strength from stimulus k to stimulus j, and vk,¯j the inhibitory strength from stimulus k to stimulus j (effectively, a “no j” representation). On any given trial, the net expectation of stimulus j, ej, is given by:

ej=Kkxkvk,jKkxkvk,¯j

where xk denotes the presence (1) or absence (0) of stimulus k, and the set K represents all stimuli in the design.

2 - Learning associations

Changes to the excitatory and inhibitory associations between stimuli are given by:

Δvi,j=δjxiβex,jαiλj Δvi,¯j=xiβin,jαi|¯λj|

where βex,j and βin,j represent learning rates for excitatory and inhibitory associations, respectively, as determined by stimulus j, αi is the associability of stimulus i, respectively, and λj and ¯λj are the excitatory asymptote and the overexpectation of stimulus j, respectively.

Importantly, δj in Eq.2a is a parameter that is equal to 1 if the expectation of stimulus j, is lower than its excitatory asymptote (i.e., ej<λj), but 0 if not. This implies that the model stops strengthening vi,j if the expectation of j is higher than its excitatory asymptote.

As mentioned in the introductory note, the PKH1982 model does not learn excitatory associations via correction error. However, the model does learn inhibitory associations via correction error, as the overexpectation term above, ¯λj is equal to min(λjej,0), where min is the minimum function. This implies ¯λj only takes non-zero values when the expectation of j is higher than its intensity on the trial (λj).

3 - Learning to attend

The associability parameter αi changes completely from trial to trial as a function of learning (note the lack of Δ below), with the change being equal to the difference of the absolute global error, via:

Δαi=xiKjγj(|λjej|) where γj denotes the contribution of the prediction error based on the jth stimulus. In this regard, it is important to note that Pearce et al. (1982) did not extend their model to account for the predictive power of within-compound associations, yet the implementation of the model in this package does. This can sometimes result in unexpected behaviour, and as such, Eq. 3 above includes the extra parameter γj (defaulting to 1/K) that denotes whether the expectation of stimulus j contributes to attentional learning. As such, the user can set these parameters manually to reflect the contribution of the different experimental stimuli. For example, in a simple “AB>(US)” design, setting γUS = 1 and γA=γB=0 leads to the behavior of the original model.

The PKH1982 model improves upon the Pearce & Hall (1980) model by adding an extra parameter that controls the rate at which associability changes. If we qualify the changes in associability determined by Eq.3 via Δαni (meaning they happened after trial n), then we can quantify the total associability of stimulus i after trial n via:

αni={(1θi)αn1i+θiΔαnj, if xi=1αni, otherwise where θi is a parameter determining both the rate at which associability decays (via 1θi), and the rate at which increments in attention occur. Note that changes in associability only apply to stimuli presented on the trial (i.e., xi=1); attention to absent stimuli remains unchanged.

4 - Generating responses

There is no specification of response-generating mechanisms in PKH1982. However, the simplest response function that can be adopted is the identity function on stimulus expectations. If so, the responses reflecting the nature of j, rj, are given by:

rj=ej

References

Pearce, J. M., & Hall, G. (1980). A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychological Review, 87, 532–552. https://doi.org/10.1037/0033-295X.87.6.532
Pearce, J. M., Kaye, H., & Hall, G. (1982). Predictive accuracy and stimulus associability: Development of a model for Pavlovian conditioning. In Quantitative analyses of behavior: Acquisition (Vol. 3, pp. 241–255). Ballinger.
Rescorla, R. A., & Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: Current research and theory. (pp. 64–69). Appleton-Century-Crofts.