====== A Bayesian Interpretation of the Light Gated Recurrent Unit ====== * **ID:** 20231012155231808-1184 * **Researcher:** Alexandre Bittar, Philip N. Garner * **WP:** Other * **PI:** null * **Abstract:** We summarise previous work showing that the basic sigmoid activation function arises as an instance of Bayes’s theorem, and that recurrence follows from the prior. We derive a layerwise recurrence without the assumptions of previous work, and show that it leads to a standard recurrence with modest modifications to reflect use of log-probabilities. The resulting architecture closely resembles the Li-GRU which is the current state of the art for ASR. Although the contribution is mainly theoretical, we show that it is able to outperform the state of the art on the TIMIT and AMI datasets. * **Publication DOI:** [[https://doi.org/10.1109/ICASSP39728.2021.9414259|https://doi.org/10.1109/ICASSP39728.2021.9414259]] * **Publication Link:** [[https://ieeexplore.ieee.org/document/9414259/|https://ieeexplore.ieee.org/document/9414259/]] * **Data Type:** null * **Data Format:** null * **Git:** [[None|None]]