Search
5 results for "optimization"
- ai · arxiv/cs.AI · 8 min
GEM activation functions match ReLU speed with smoother gradients
Krause proposes rational activation functions with tunable smoothness that reduce optimization friction in deep networks while maintaining computational efficiency.
Apr 24, 2026 Read → - engineering · arxiv/cs.LG · 8 min
Routing Optimization for Satellite Federated Learning: Tractable Boundaries
Researchers map which routing problems in orbital federated learning can be solved efficiently and which are computationally hard.
Apr 22, 2026 Read → - ai · arxiv/cs.LG · 8 min
Simpler Optimizers Make LLM Unlearning More Robust
Research shows that using lower-order optimization methods during LLM unlearning produces forgetting that resists post-training attacks better than sophisticated gradient-based approaches.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 4 min
LLMs complement but don't replace classical hyperparameter optimization
A study comparing LLM agents to classical algorithms like CMA-ES and TPE finds hybrid approaches work best for tuning model hyperparameters under compute constraints.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 5 min
Rejection-Gated Policy Optimization replaces importance weighting with learned gates
A new reinforcement learning method selects trustworthy samples via differentiable gates instead of reweighting all samples, reducing variance and improving RLHF alignment.
Apr 17, 2026 Read →