Good progress with cleaning up "IntInv vs SNR". Gonna merge with "IntInv vs IntInv" tomorrow.

2026-06-16 18:23:14 +02:00
parent 21e6ab4d64
commit 395cfb98ab
2 changed files with 35 additions and 38 deletions
--- a/main.pdf
+++ b/main.pdf
--- a/main.tex
+++ b/main.tex
@@ -1719,18 +1719,7 @@ degree of temporal integration~(Section\,\ref{sec:constant_feat}).
 \subsection{Intensity invariance versus SNR along the model pathway}
-% % Establishing the principle trade-off (should maybe come later?):
+% Building a sufficiently large SNR "buffer":
 % The output of a transformation is considered to be intensity-invariant if its
 % intensity measure saturates for sufficiently large scales $\sca$, which in turn
 % caps the output SNR to a constant value across these $\sca$. Otherwise, the
 % output SNR will increase monotonically with $\sca$. The trade-off between
 % intensity invariance and SNR refers to the principle that a transformation can
 % either improve intensity invariance or maintain SNR --- it cannot do both at
 % the same time. This principle is most likely not specific to the two mechanisms
 % along the model pathway but rather a general property of transformations that
 % equalize between different input intensities.
 % Building a sufficient SNR "buffer":
 A stridulating grasshopper generates a song with a specific initial intensity,
 which is steadily attenuated as the song propagates through the
 environment~(\bcite{michelsen1978sound}). A listening grasshopper receives a
@@ -1742,12 +1731,12 @@ filtering of $\raw(t)$ into $\filt(t)$ likely improves the SNR by attenuating
 frequencies outside the relevant range of grasshopper songs. The SNR is further
 improved by the rectification and lowpass filtering of $\filt(t)$ into
 $\env(t)$. The lower the cutoff frequency $\fc$ of the lowpass filter, the
-higher the SNR of $\env(t)$ at a given $\sca$, although $\fc$ must also be
+higher the SNR of $\env(t)$ for a given $\sca$, although $\fc$ must also be
-sufficiently high to preserve the amplitude dynamics of the song pattern.
+sufficiently high to preserve the amplitude dynamics of the song pattern. The
-Overall, the first processing steps along the pathway are not designed to
+first processing steps along the pathway are hence designed to improve the SNR
-achieve intensity invariance but rather to improve the SNR of the song
+of the song representation beyond the initial SNR of $\raw(t)$.
 representation beyond the initial SNR of $\raw(t)$.
 % Dependence of log-HP intensity invariance on sufficient SNR (+implications):
 The first mechanism of intensity invariance consists of logarithmic compression
 and adaptation of $\env(t)$ into $\adapt(t)$. In the absence of $\noc(t)$,
 $\adapt(t)$ is a perfectly intensity-invariant representation of $\soc(t)$. In
@@ -1757,28 +1746,36 @@ $\raw(t)$ to $\env(t)$ thus serve to improve the intensity invariance of
 $\adapt(t)$ by shifting the saturation point towards lower $\sca$. However,
 this effect is limited --- if the SNR of $\raw(t)$ at the receiver's position
 does not allow for a sufficiently high SNR of $\env(t)$, $\adapt(t)$ will not
-be intensity-invariant. The initial song intensity that the sender can achieve
+be intensity-invariant. In this case, the receiver is presumably less likely to
-therefore determines the distance at which $\adapt(t)$ is intensity-invariant
+recognize $\raw(t)$ as a conspecific song. The limitation of the intensity
-to the receiver.
+invariance of $\adapt(t)$ by the SNR of $\raw(t)$ might hence at least in parts
 be responsible for the limited maximum distance at which song recognition is
 possible~(\bcite{lang2000acoustic}) and the selection towards song patterns
 that are robust to noise masking~(\bcite{einhaupl2011attractiveness}).
-Assuming that intensity invariance of $\adapt(t)$ is required for reliable song
+% Trading SNR for log-HP intensity invariance (+variability, +general principle):
-recognition, 
+The SNR of each song representation prior to $\adapt(t)$ increases
-
+monotonically with $\sca$~(excluding $0<\sca\ll1$, noise regime). These
-
+representations maintain and improve the initial SNR of $\raw(t)$ and hence
-This might be a reason why robustness to noise masking is an
+never achieve intensity invariance. In contrast, the SNR of the
-attractive property of male calling songs~(\bcite{einhaupl2011attractiveness}).
+intensity-invariant $\adapt(t)$ never exceeds its saturation level even for
-
+arbitrarily high $\sca$. The saturation level of $\adapt(t)$ varies across
-The saturation level of $\adapt$,
+species and songs. This variability is likely rooted in the way in which
-unlike its saturation point, is independent of the SNR of $\env(t)$ because the
+logarithmic compression acts on the specific distribution of $\env(t)$, which
-influence of $\noc(t)$ is negligible for sufficiently large $\sca$. The output
+depends on the $\fc$ of the lowpass filter as well as the temporal structure
-SNR of $\adapt(t)$ saturates at a comparably low value of around 10. This might
+and frequency spectrum of the rectified $\filt(t)$. Overall, $\adapt(t)$ has
-in parts be a consequence of the logarithm, which compresses different higher
+never been observed to exceed a SNR of around~10 across all songs. The low SNR
-intensities but also amplifies lower intensities, including the noise floor.
+of $\adapt(t)$ partially results from the amplification of smaller values of
-Both the saturation level and the saturation point of $\adapt(t)$ vary between
+$\env(t)$ by the logarithm, which raises the noise floor of $\adapt(t)$. Still,
-different species and individual songs. These differences are likely rooted in
+the reduction in SNR is substantial --- considering that the SNR of preceeding
-the way in which logarithmic compression acts on the specific distribution of
+song representations has been orders of magnitude higher --- but is likely a
-$\env(t)$, which is determined by $\fc$ as well as the temporal structure and
+necessary price to pay for the intensity invariance of $\adapt(t)$. After all,
-frequency spectrum of the rectified $\filt(t)$.
+a transformation cannot compress a range of different input intensities into a
 constant output intensity without sacrificing some of the corresponding input
 SNR. Accordingly, the trade-off between intensity invariance and SNR is not
 expected to be specific to the particular mechanisms along the pathway but
 presumably applies to any transformation that achieves or improves intensity
 invariance.
 Thresholding and temporal averaging renders feature $f_i(t)$
 intensity-invariant for sufficiently large $\sca$. The trade-off between