Figure 4:
Suitably chosen intermediate ConvRNN circuits provide consistent predictions of primate ventral stream neural dynamics. (a) The y-axis indicates the median across neurons of the explained variance between predictions and ground-truth responses on held-out images divided by the square root of the internal consistencies of the neurons, defined in section A.6.3. Error bars indicates the s.e.m. across neurons (N=88 for V4, N=88 for pIT, N=80 for cIT/aIT) averaged across 10 ms time bins (N=4 each for the Early and Late designations). As can be seen, the intermediate-depth feedforward BaseNet model (first bars) is a poor predictor of the subset of late responses that are beyond the feedforward pass, but certain types of ConvRNN circuits (such as RGC Median, UGRNN, and GRU) added to the BaseNet are overall best predictive across visual areas at late time points (Wilcoxon test, with Bonferroni correction with feedforward BaseNet, p<0.001 for each visual area). See Figure S6 for the full time courses at the resolution of 10 ms bins. (b) For each ConvRNN circuit, we compare the average neural predictivity (averaged per neuron across early and late timepoints) averaged across areas, to the OST consistency. The ConvRNNs that have the best average neural predictivity also best match the OST consistency (RGC Median, UGRNN, and GRU).

Suitably chosen intermediate ConvRNN circuits provide consistent predictions of primate ventral stream neural dynamics. (a) The y-axis indicates the median across neurons of the explained variance between predictions and ground-truth responses on held-out images divided by the square root of the internal consistencies of the neurons, defined in section A.6.3. Error bars indicates the s.e.m. across neurons (N=88 for V4, N=88 for pIT, N=80 for cIT/aIT) averaged across 10 ms time bins (N=4 each for the Early and Late designations). As can be seen, the intermediate-depth feedforward BaseNet model (first bars) is a poor predictor of the subset of late responses that are beyond the feedforward pass, but certain types of ConvRNN circuits (such as RGC Median, UGRNN, and GRU) added to the BaseNet are overall best predictive across visual areas at late time points (Wilcoxon test, with Bonferroni correction with feedforward BaseNet, p<0.001 for each visual area). See Figure S6 for the full time courses at the resolution of 10 ms bins. (b) For each ConvRNN circuit, we compare the average neural predictivity (averaged per neuron across early and late timepoints) averaged across areas, to the OST consistency. The ConvRNNs that have the best average neural predictivity also best match the OST consistency (RGC Median, UGRNN, and GRU).

Close Modal

or Create an Account

Close Modal
Close Modal