[Submitted on 30 Oct 2025]
Dynamic Adaptive Gating with Parallel Pathways
View PDFAbstract:We present Dynamic Adaptive Gating with Parallel Pathways (DAG-PP), a novel feedforward architecture for transformers that combines multiple activation functions through learned blending weights. Our approach achieves improved validation loss compared to standard baselines while maintaining computational efficiency.
Submission history
[v1] Thu, 30 Oct 2025 07:28 UTC