Merge branch 'main' of https://whale.am28.uni-tuebingen.de/git/j_hartling/paper_2025

2026-06-17 16:15:52 +02:00
parent dab77a262c cc4da8aa05
commit 015ef529e5
2 changed files with 52 additions and 42 deletions
--- a/main.tex
+++ b/main.tex
@@ -1691,7 +1691,7 @@ additional certainty.
 \subsection{Invariant processing in the grasshopper auditory system}
 \label{sec:general_inv}

-% Invariance in the general (systemic) sense:
+% Invariance in the general (systemic) sense (could be skipped if too much):
 The notion of invariance is fundamental for sensory processing systems.
 Invariance, in the general sense, can be described as the property of a
 transformation to maintain variation across certain meaningful input parameters
@@ -1785,52 +1785,61 @@ that are robust to noise masking~(\bcite{einhaupl2011attractiveness}).
 % Trading SNR for log-HP intensity invariance (+variability, +general principle):
 The SNR of each song representation prior to $\adapt(t)$ increases
 monotonically with $\sca$~(excluding $0<\sca\ll1$, noise regime). These
-representations maintain and improve the initial SNR of $\raw(t)$ and hence
-never achieve intensity invariance. In contrast, the SNR of the
-intensity-invariant $\adapt(t)$ never exceeds its saturation level even for
-arbitrarily high $\sca$. The saturation level of $\adapt(t)$ varies across
-species and songs. This variability is likely rooted in the way in which
-logarithmic compression acts on the specific distribution of $\env(t)$, which
-depends on the $\fc$ of the lowpass filter as well as the temporal structure
-and frequency spectrum of the rectified $\filt(t)$. Overall, $\adapt(t)$ has
-never been observed to exceed a SNR of around~10 across all songs. The low SNR
-of $\adapt(t)$ partially results from the amplification of smaller values of
-$\env(t)$ by the logarithm, which raises the noise floor of $\adapt(t)$. Still,
-the reduction in SNR is substantial --- considering that the SNR of preceeding
-song representations has been orders of magnitude higher --- but is likely a
-necessary price to pay for the intensity invariance of $\adapt(t)$. After all,
-a transformation cannot compress a range of different input intensities into a
+representations maintain the full extent of the initial SNR of $\raw(t)$ and
+hence never achieve intensity invariance. In contrast, the SNR of $\adapt(t)$
+saturates for sufficiently high $\sca$. Accordingly, $\adapt(t)$ is
+intensity-invariant but cannot have a higher SNR than the saturation level,
+which indicates a fundamental trade-off. The saturation level of $\adapt(t)$
+varies across species and songs. This variability is likely rooted in the way
+in which logarithmic compression acts on the specific $\env(t)$, which depends
+on the $\fc$ of the lowpass filter as well as the temporal structure and
+frequency spectrum of the rectified $\filt(t)$. Across all songs, the
+saturation level of $\adapt(t)$ has never been observed to exceed a SNR of
+around~10. This is a substantial reduction in SNR, considering that the SNR of
+preceeding representations had been orders of magnitude higher. Part of this
+reduction stems from the amplification of smaller values of $\env(t)$ by
+logarithmic compression, which raises the noise floor of $\adapt(t)$ relative
+to the song. Accordingly, the low SNR of $\adapt(t)$ appears to be a necessary
+price to pay for its intensity invariance through logarithmic compression and
+adaptation. But the trade-off between intensity invariance and SNR likely goes
+beyond the particular mechanisms along the pathway. After all, a transformation
+is not expected to compress a range of different input intensities into a
 constant output intensity without sacrificing some of the corresponding input
-SNR. Accordingly, the trade-off between intensity invariance and SNR is not
-expected to be specific to the particular mechanisms along the pathway but
-presumably applies to any transformation that achieves or improves intensity
-invariance.
+SNR. This suggests that the trade-off is a more general principle that applies
+to any transformation that achieves or improves intensity invariance.

-Thresholding and temporal averaging renders feature $f_i(t)$
-intensity-invariant for sufficiently large $\sca$. The trade-off between
-intensity invariance and SNR is mediated by threshold value $\thr$. A lower
-$\thr$ ($\thr\to0$) improves intensity invariance by shifting the saturation
-point towards lower $\sca$ but also decreases the SNR of $f_i(t)$. The
-saturation level of $f_i(t)$ is independent of $\thr$ as long as the intensity
-invariance by the previous mechanism is neglected. The SNR of $f_i(t)$ is
-therefore determined solely by the pure-noise response of $f_i(t)$. The
-distribution $\pci$ of the pure-noise kernel response $c_i(t)$ is largely a
-normal distribution with mean $\mu\approx0$ for all kernels $k_i(t)$. The value
-of the pure-noise $f_i(t)$ is hence 0.5 for $\thr=0$ and decreases for higher
-$\thr$. If $\thr$ is set above the maximum of $c_i(t)$, the pure-noise feature
-value is 0, which results in an "unlimited" SNR of $f_i(t)$. In this case, any
-non-zero feature value that is sustained for a sufficient duration could serve
-as indicator for the presence of $\soc(t)$, although at the cost of a higher
-saturation point. The maximum of the pure-noise $c_i(t)$ is assumed to be very
+The second mechanism of intensity invariance consists of thresholding and
+temporal averaging of $c_i(t)$ into $f_i(t)$. Here, the trade-off between
+intensity invariance and SNR is mediated by the threshold value $\thr$. A lower
+$\thr$~($\thr\to0$) improves the intensity invariance of $f_i(t)$ by shifting
+the saturation point towards lower $\sca$. However, a lower $\thr$ also raises
+the noise floor of $f_i(t)$ by including more of the pure-noise $c_i(t)$, which
+decreases the SNR of $f_i(t)$. The distribution $\pci$ of the pure-noise
+$c_i(t)$ is very close to a normal distribution with mean $\mu\approx0$ for all
+kernels $k_i(t)$. The value of the pure-noise $f_i(t)$ is hence 0.5 for
+$\thr=0$ and decreases for higher $\thr$. If $\thr$ is set above the maximum of
+$c_i(t)$, the pure-noise feature value is 0, which results in an "unlimited"
+SNR of $f_i(t)$. In this case, any non-zero feature value that is sustained for
+a sufficient duration could serve as indicator for the presence of $\soc(t)$ in
+$\raw(t)$, although at the cost of a higher saturation point. Of course, this
+would require a fine evolutionary tuning of $\thr$ to the properties of the
+natural noise in a certain habitat to avoid false positives.
+
+The saturation level of $f_i(t)$ is independent of $\thr$ as long as the
+intensity invariance by the previous mechanism is neglected.
+
+
+If $f_i(t)$ can achieve an arbitrarily high SNR, it can counteract the
+comparably low SNR of $\adapt(t)$ 
+
+The maximum of the pure-noise $c_i(t)$ is assumed to be very
 small due to the various SNR improvements along the pathway, so that the
 required increase in $\thr$ and hence the saturation point of $f_i(t)$ is not
-expected to be substantial. However, exploiting the capacity of $f_i(t)$ for
-arbitrarily high SNR would certainly require a fine evolutionary tuning of
-$\thr$ to the properties of both the species-specific song and the natural
-noise in a certain habitat.
+expected to be substantial.

-\newpage
-\subsection{Intensity invariance versus intensity invariance}
+
+% \newpage
+% \subsection{Intensity invariance versus intensity invariance}

 Two consecutive mechanisms of intensity invariance do not necessarily add up to
 a stronger overall intensity invariance. If the first mechanism results in a