[Submitted on 31 Oct 2025]
Expanding Activation Ranges in Transformer Feedforward Networks
View PDFAbstract:We present xSiLU, an improved activation function for transformer feedforward networks that learns an optimal gating range. Experimental results show consistent improvements over baseline methods.
Submission history
[v1] Fri, 31 Oct 2025 22:38 UTC