Finished methods.

Attempting to clean up tex files.
2026-05-16 14:43:13 +02:00
parent 155fb1eecf
commit 915b8ea4ff
14 changed files with 47 additions and 11341 deletions
--- a/main.tex
+++ b/main.tex
@@ -458,14 +458,15 @@ the following feature extraction stage.

 \subsubsection{Feature extraction by individual neurons}

-The ascending neurons extract and encode a number of different features of the
-preprocessed signal, and hence represent the signal in a higher-dimensional
-space than the preceding receptor neurons and local interneurons. Each
-ascending neuron is assumed to scan the signal for a specific template pattern,
-which can be thought of as a kernel of a particular structure and on a
-particular time scale. This process, known as template matching, can be
-modelled as a convolution of the intensity-adapted envelope $\adapt(t)$ with a
-kernel $k_i(t)$ specific to the $i$-th ascending neuron:
+The population of ascending neurons extracts and encodes a number of different
+features of the preprocessed signal, and hence represents the signal in a
+higher-dimensional space than the preceding receptor neurons and local
+interneurons~(\bcite{clemens2011efficient}). Each ascending neuron is assumed
+to scan the signal for a specific template pattern, which can be thought of as
+a kernel of a particular structure and on a particular time scale. This
+process, known as template matching, can be modelled as a convolution of the
+intensity-adapted envelope $\adapt(t)$ with a kernel $k_i(t)$ specific to the
+$i$-th ascending neuron:
 \begin{equation}
    c_i(t)\,=\,\adapt(t)\,*\,k_i(t)
    = \infint \adapt(\tau)\,\cdot\,k_i(t\,-\,\tau)\,d\tau
@@ -495,14 +496,14 @@ $\rh$ relative to the maximum of the Gaussian:
 With this, an appropriate carrier frequency $\kfi$ for obtaining a Gabor kernel
 with width $\kwi$ and desired lobe number $\kni$ can be approximated as
 \begin{equation}
-    \kfi(\kni,\,\kwi,\,\rh)\,=\,\frac{0.5\,\cdot\,(\kni\,+\,\beta_0)}{\fdrm(\kwi,\,\rh)}, \qquad \kni\,\geq\,2\enspace\forall\enspace \kni\,\in\,\mathbb{Z}
+    \kfi(\kni,\,\kwi,\,\rh)\,=\,\frac{0.5\,\cdot\,\kni\,+\,\beta_0}{\fdrm(\kwi,\,\rh)}, \qquad \kni\,\geq\,2\enspace\forall\enspace \kni\,\in\,\mathbb{Z}
    \label{eq:gabor_freq}
 \end{equation}
 % \begin{equation}
-%     \kfi(\kni,\,\kwi,\,\rh)\,=\,\frac{0.5\,\cdot\,(\kni\,+\,\beta_0)}{2\,\cdot\,\sqrt{-2\,\cdot\,\ln \rh}\cdot\kwi}, \qquad \kni\,\geq\,2\enspace\forall\enspace \kni\,\in\,\mathbb{Z}
+%     \kfi(\kni,\,\kwi,\,\rh)\,=\,\frac{0.5\,\cdot\,\kni\,+\,\beta_0}{2\,\cdot\,\sqrt{-2\,\cdot\,\ln \rh}\cdot\kwi}, \qquad \kni\,\geq\,2\enspace\forall\enspace \kni\,\in\,\mathbb{Z}
 % \end{equation}
 The relationship between $\kfi$ and $\kni$ is approximately linear except for
-small $\kni$. The offset term $\beta_0\approx0.5$ was added to balance the
+small $\kni$. The offset term $\beta_0\approx0.26$ was added to balance the
 amplitudes of the $\kni$ desired lobes of the kernel --- which should be
 maximized --- against the amplitudes of the next-outer lobes, which should not
 exceed the threshold value determined by $\rh$. Note that simple Gaussian
@@ -549,23 +550,30 @@ response~(Fig.\,\ref{fig:stages_feat}b):
    \label{eq:binary}
 \end{equation}
 The thresholding of $c_i(t)$ into $b_i(t)$ can be thought of as a
-categorization into "relevant" and "irrelevant" response values.
-% It is unclear whether such a thresholding nonlinearity is actually implemented
-% either by the ascending neurons or at some point further downstream in the SEG.
-Finally, the responses of the ascending neurons are assumed to be integrated
-somewhere in the SEG~(\bcite{ronacher1986routes}; \bcite{bauer1987separate};
-\bcite{bhavsar2017brain}). This processing step can be approximated as temporal
-averaging of the binary responses $b_i(t)$ by a lowpass filter
+categorization into "relevant" and "irrelevant" response values. Similar
+thresholding nonlinearities have been a crucial processing step in previous
+models that deal with the extraction of behaviorally relevant song features in
+insects~(\bcite{clemens2013computational}; \bcite{clemens2013feature};
+\bcite{hennig2014time}; \bcite{ronacher2015computational}).
+% However, there is no direct physiological evidence that would allow to
+% determine the exact location or underlying mechanism of such a nonlinearity in
+% either the ascending neurons or at some point further downstream in the SEG.
+
+In the grasshopper, the responses of the ascending neurons are assumed to be
+integrated somewhere in the SEG~(\bcite{ronacher1986routes};
+\bcite{bauer1987separate}; \bcite{bhavsar2017brain}). In the model pathway,
+temporal integration is implemented as temporal averaging of the binary
+responses $b_i(t)$ by a lowpass filter with extremely low cutoff frequency:
 \begin{equation}
    f_i(t)\,=\,b_i(t)\,*\,\lp, \qquad \fc\,=\,1\,\text{Hz}
    \label{eq:lowpass}
 \end{equation}
-to obtain a final set of slowly changing kernel-specific features $f_i(t)$. In
-the resulting high-dimensional feature space, different species-specific song
-patterns are characterized by a distinct combination of feature values, which
-can be read out by a simple linear classifier.
-
-% Cite somewhere:
+This processing step results in a set of slowly changing kernel-specific
+features $f_i(t)$, which is the final representation along the model
+pathway~(Fig.\,\ref{fig:stages_feat}c). In the resulting high-dimensional
+feature space, different species-specific song patterns can be distinguished by
+their distinct combination of feature values, e.\,g. using Euclidian geometry
+or a simple linear classifier.
 \begin{figure}[!ht]
    \centering
    \includegraphics[width=\textwidth]{figures/fig_feat_stages.pdf}
@@ -770,6 +778,19 @@ stable position and height of the microphone array during recording. The
 resulting recordings were then processed through the model pathway and analyzed
 according to the procedure described in Section~\ref{sec:intensity_measures}.

+\subsection{Determining kernel-specific threshold values}
+
+Different kernels $k_i(t)$ result in specific kernel responses $c_i(t)$,
+Eq.\,\ref{eq:conv}, which are then transformed further into binary responses
+$b_i(t)$, Eq.\,\ref{eq:binary}, by thresholding nonlinearity $\nl$. The
+threshold value $\thr$ is specific to each $k_i(t)$. Across all analyses,
+$\thr$ has been specified as a multiple of the pure-noise reference standard
+deviation $\sigma_{c_i}$ for input $x(t)=\noc(t)$. This ensures that $\thr$ as
+well as the resulting $b_i(t)$ and $f_i(t)$ are comparable across different
+$k_i(t)$ because each pure-noise $c_i(t)$ approximately follows a normal
+distribution~(see appendix
+Figs.\,\ref{fig:app_thresh-lp_kern-sd}-\ref{fig:app_field_kern-sd}).
+
 \section{Results}

 \subsection{Mechanisms driving the emergence of intensity invariance}