Source
arxiv/cs.LG
59 insights rewritten from this source.
- ai · arxiv/cs.LG · 4 min
Synthetic Computers Enable Agent Training at Scale
Researchers create realistic digital workspaces to train AI agents on long-horizon productivity tasks, scaling from thousands to potentially billions of simulated user environments.
May 3, 2026 Read → - ai · arxiv/cs.LG · 4 min
ActiNet: Self-Supervised Model Improves Wrist Activity Classification
Open-source deep learning tool outperforms random forest baselines for extracting activity intensity from wearable accelerometer data in epidemiological research.
May 3, 2026 Read → - ai · arxiv/cs.LG · 8 min
Mixed Precision Training Stabilizes Neural ODEs
Researchers demonstrate a framework that reduces memory use by 50% and speeds up neural ODE training 2x by carefully mixing low and high precision arithmetic.
May 3, 2026 Read → - ai · arxiv/cs.LG · 4 min
Selective-Update RNNs Match Transformers While Using Less Memory
A new RNN architecture learns when to update internal state, preserving memory across long sequences and reducing computational waste on redundant input.
May 3, 2026 Read → - ai · arxiv/cs.LG · 8 min
Logic Rules Boost Generative ML Trustworthiness in Networks
NetNomos integrates formal logic constraints into generative models to enforce networking rules and reduce hallucinations in telemetry, forecasting, and synthetic data tasks.
May 3, 2026 Read → - ai · arxiv/cs.LG · 8 min
Model Architecture Controls Whether Errors Stay Hidden
Transformer design determines if internal decision signals remain observable after training, independent of output confidence metrics.
Apr 29, 2026 Read → - engineering · arxiv/cs.LG · 4 min
Graph Neural Networks Cut QAOA Query Cost by 87%
A trust-region method using GNNs to predict QAOA parameter distributions reduces circuit evaluations while preserving solution quality on small graphs.
Apr 29, 2026 Read → - ai · arxiv/cs.LG · 8 min
Web agents plateau on short tasks; Odysseys benchmark tests realistic multi-hour workflows
New benchmark reveals frontier AI models achieve only 44.5% success on long-horizon web tasks spanning multiple sites, exposing efficiency gaps in agent design.
Apr 29, 2026 Read → - engineering · arxiv/cs.LG · 3 min
CiteRadar maps researcher influence across institutions and geography
Open-source tool transforms Google Scholar profiles into structured citation networks with geographic visualization and author metadata enrichment.
Apr 29, 2026 Read → - ai · arxiv/cs.LG · 5 min
MotionBricks: Real-Time Motion Generation at 15,000 FPS
A modular generative framework scales motion synthesis to production speeds while supporting multi-modal control without requiring animation expertise.
Apr 29, 2026 Read → - ai · arxiv/cs.LG · 5 min
Frontier coding agents now autonomously build AlphaZero pipelines
Claude Opus 4.7 successfully implements end-to-end ML systems from task descriptions alone, matching external solvers on Connect Four within three hours.
Apr 29, 2026 Read → - ai · arxiv/cs.LG · 8 min
Log-odds aggregation handles unknown state spaces in forecast combining
Chen, Peng, and Tang propose a closed-form aggregator for combining expert forecasts when the underlying outcome range is unknown, achieving tighter regret bounds than prior methods.
Apr 28, 2026 Read → - engineering · arxiv/cs.LG · 8 min
Tessera: Cache-Line Encryption for Edge AI Without Bandwidth Loss
A hardware architecture that decrypts neural network weights at 64-byte granularity, hiding cryptographic overhead within DRAM fetch latency on shared-memory edge accelerators.
Apr 28, 2026 Read → - ai · arxiv/cs.LG · 4 min
Efficient Rationale Retrieval via Student-Teacher Distillation
Rabtriever reduces computational cost of LLM-based document ranking by distilling cross-encoder knowledge into independent query-document encoders.
Apr 28, 2026 Read → - ai · arxiv/cs.LG · 8 min
Agentic AI Security Requires Layered Defense, Not Just Prompt Guards
A new framework maps AI agent vulnerabilities across seven architectural layers and four time horizons, revealing that 93% of research ignores the slowest, most dangerous threats.
Apr 28, 2026 Read → - ai · arxiv/cs.LG · 8 min
Admissible Objectives for Hierarchical Clustering Formally Characterized
Tsukuba and Ando extend the theory of objective functions for hierarchical clustering, characterizing when functions recover ground-truth structures and introducing max-type variants.
Apr 28, 2026 Read → - engineering · arxiv/cs.LG · 8 min
Learning turbulence closures via nudging sidesteps solver backprop
A data-assimilation-inspired approach trains neural network turbulence models on DNS data without embedding them in solvers, reducing computational cost and improving stability.
Apr 28, 2026 Read → - ai · arxiv/cs.LG · 4 min
Hyperbolic neural networks outperform Euclidean models in quantum simulations
Researchers demonstrate that Poincaré and Lorentz recurrent architectures consistently beat standard neural quantum states on many-body physics benchmarks.
Apr 28, 2026 Read → - ai · arxiv/cs.LG · 8 min
Neural Networks and ODEs Compute Primitive Recursion via Dynamics, Not Composition
Bournez proves recurrent ReLU networks, polynomial ODEs, and discrete maps all express primitive recursive functions through continuous-time trajectories rather than symbolic subroutine chaining.
Apr 28, 2026 Read → - engineering · arxiv/cs.LG · 8 min
Sequential decision-making reduces error drift in modular digital twins
Researchers frame error propagation in digital twins as a Markov decision process, comparing model-based and model-free approaches to optimize maintenance interventions.
Apr 27, 2026 Read → - ai · arxiv/cs.LG · 8 min
Poisoning attacks on recommender systems gain potency through worst-case modeling
Researchers propose SharpAP, a method that optimizes fake user injection attacks by targeting worst-case model structures, improving cross-system transferability.
Apr 27, 2026 Read → - ai · arxiv/cs.LG · 4 min
LLMs use hidden confidence signals to detect and fix their own errors
Research shows large language models maintain a second-order evaluative signal that predicts error detection and self-correction beyond what their output probabilities reveal.
Apr 27, 2026 Read → - ai · arxiv/cs.LG · 4 min
Neural networks unmix single Raman spectra without multiple samples
A brain-inspired deep learning model solves the underdetermined problem of identifying chemical components from one noisy mixed spectrum, enabling rapid substance detection.
Apr 27, 2026 Read → - ai · arxiv/cs.LG · 8 min
Simple graph models match deep learning for molecular prediction
Classical topological indices enhanced with regularization and ensemble methods outperform neural networks on molecular property benchmarks without GPU requirements.
Apr 23, 2026 Read → - engineering · arxiv/cs.LG · 8 min
Multi-Agent Edge Systems Hit a Scaling Wall at 100+ Agents
A new framework addresses the Synergistic Collapse problem where performance degrades superlinearly as distributed agents grow, combining neural caching, action pruning, and hardware matching.
Apr 23, 2026 Read → - ai · arxiv/cs.LG · 8 min
Dataset Distillation Fails Without Hard Labels
Soft labels mask poor dataset quality in distillation methods, making random subsets nearly as effective as curated ones.
Apr 22, 2026 Read → - ai · arxiv/cs.LG · 8 min
Concept Bottleneck Models Hit Hard Ceiling in Dermoscopy Data
Rough-set analysis reveals 16% of concept profiles in Derm7pt are internally inconsistent, capping model accuracy at 92% regardless of architecture.
Apr 22, 2026 Read → - engineering · arxiv/cs.LG · 8 min
Routing Optimization for Satellite Federated Learning: Tractable Boundaries
Researchers map which routing problems in orbital federated learning can be solved efficiently and which are computationally hard.
Apr 22, 2026 Read → - ai · arxiv/cs.LG · 8 min
Simpler Optimizers Make LLM Unlearning More Robust
Research shows that using lower-order optimization methods during LLM unlearning produces forgetting that resists post-training attacks better than sophisticated gradient-based approaches.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 8 min
Three diffusion methods unified under population genetics framework
Researchers connect discrete, Gaussian, and simplicial diffusion models through Wright-Fisher theory, enabling stable cross-domain sequence generation.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 8 min
Theory for learning blind inverse problems with finite samples
Researchers establish sample complexity bounds and optimal estimators for blind inverse problems using linear minimum mean square estimation framework.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 4 min
LLMs complement but don't replace classical hyperparameter optimization
A study comparing LLM agents to classical algorithms like CMA-ES and TPE finds hybrid approaches work best for tuning model hyperparameters under compute constraints.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 4 min
Weak Labels Fail Across Time Even When Domain Transfer Works
A study of CRISPR experiments reveals supervision drift—where the labeling mechanism itself shifts—causes model collapse in temporal transfer despite strong in-domain performance.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 8 min
Chain-of-Thought Supervision Eliminates Sample Complexity Growth
New theoretical analysis shows intermediate reasoning steps remove dependence on generation length, while end-to-end learning scales unpredictably with sequence depth.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 6 min
Automating Dataset Creation with LLMs and Search Engines
Researchers propose ADC, a method to build large labeled datasets automatically using language models and web search, reducing manual annotation work and cost.
Apr 21, 2026 Read → - engineering · arxiv/cs.LG · 4 min
Kernel-Level LLM Safety via Logit Inspection
ProbeLogits reads token probabilities before generation to enforce safety policies at the OS level, achieving parity with learned classifiers at 2.5x speed.
Apr 21, 2026 Read → - ai · arxiv/cs.LG · 4 min
Neural CTMC decouples discrete diffusion into timing and direction
A new parameterization for discrete diffusion models separates when and where tokens jump, aligning training with mathematical structure.
Apr 20, 2026 Read → - ai · arxiv/cs.LG · 8 min
Chromatic Clustering Requires New Algorithms to Match Standard Performance
Adding color constraints to correlation clustering increases computational difficulty; a new coupled approach recovers optimal approximation bounds.
Apr 20, 2026 Read → - ai · arxiv/cs.LG · 4 min
Quantum-LSTM hybrid cuts physics model training data by 100×
Federated learning with quantum-enhanced LSTM achieves classical accuracy on SUSY classification using 20K samples instead of 2M, with under 300 parameters.
Apr 20, 2026 Read → - engineering · arxiv/cs.LG · 8 min
ML predicts nonlinear distortion in massive MIMO arrays
Machine learning models forecast signal degradation from power amplifier nonlinearity in 5G/6G systems, enabling 12% throughput gains via adaptive power allocation.
Apr 20, 2026 Read → - engineering · arxiv/cs.LG · 4 min
Hybrid PINNs: Finite-Difference Regularization for Physics Solvers
Adding weak finite-difference gradient penalties to physics-informed neural networks improves boundary accuracy without replacing automatic-differentiation residuals.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 3 min
Framework uses AI outputs as features, not proxies, for labeled data
Generative Augmented Inference treats LLM predictions as informative signals rather than direct substitutes, reducing human labeling needs by 75–90% across operations tasks.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Foundation Models vs. Task-Specific ML in Electricity Price Forecasting
Time series foundation models outperform traditional deep learning on probabilistic forecasts, but well-tuned conventional models remain competitive at lower computational cost.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
LLM Panels Match Expert Clinicians in Medical Diagnosis Scoring
A study of three frontier AI models scoring real hospital cases shows calibrated LLM juries can reliably replace human expert panels for medical AI evaluation.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 5 min
Rejection-Gated Policy Optimization replaces importance weighting with learned gates
A new reinforcement learning method selects trustworthy samples via differentiable gates instead of reweighting all samples, reducing variance and improving RLHF alignment.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
INT4 Quantization Fails After FP32 Convergence in Predictable Phases
Post-training quantization assumes converged models are ready to compress, but INT4 quantization collapses in a three-phase pattern tied to weight updates, not learning rate decay.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Distilling Transformers into Mamba via Linearized Attention
A two-stage knowledge transfer method preserves Transformer performance in State Space Models by routing through linearized attention as an intermediate step.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Three-Phase Transformer: Structural Prior for Decoder Efficiency
A residual-stream architecture using cyclic channel partitioning and phase-aligned rotations achieves 7% perplexity gains with minimal parameter overhead.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 6 min
Speech Models Fail Safety Tests That Text Passes
VoxSafeBench reveals speech language models recognize social norms in text but ignore them when cues arrive through voice, speaker identity, or environment.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 6 min
Speech Models Fail Safety Tests That Text Models Pass
A new benchmark reveals that speech language models drop safety, fairness, and privacy protections when cues arrive as audio rather than text.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 4 min
Retrieval-Augmented Set Completion for Clinical Code Authoring
A two-stage approach retrieves similar clinical value sets then classifies candidates, outperforming direct LLM generation on standardized medical vocabularies.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 4 min
Retrieval beats memorization for clinical code selection
A two-stage retrieval-then-classify method outperforms direct LLM generation for assembling clinical value sets from large standardized vocabularies.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Machine Learning Maps Drug Binding to Viral RNA Pseudoknot
Spectral map analysis reveals how small-molecule inhibitors distort SARS-CoV-2 RNA structure in topology-dependent ways, with protonation state determining mechanism.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Action Aliasing Breaks Safe RL Differently Depending on Filter Placement
A formal comparison of two projection-based safety strategies reveals that embedding safeguards in the policy creates gradient rank deficiency, while environment-level filters distribute the problem to the critic.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 3 min
Transformer models outperform CNNs in prostate MRI segmentation
SwinUNETR achieves 5-point Dice improvement over standard UNet when trained on mixed-reader datasets, suggesting transformer attention handles annotation variability better.
Apr 17, 2026 Read → - engineering · arxiv/cs.LG · 8 min
Queueing Model Reveals How AI Automation Paradoxically Worsens Cyber Risk
Research from Yun et al. shows that symmetric automation in attack and defense can increase exploit success rates, with heavy-tailed patching delays creating persistent vulnerability backlogs.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Quantum kernel inference cuts query cost by removing data-size dependence
New algorithm reduces quantum machine learning inference complexity from O(N) to O(1) in data size, achieving query-optimal bounds via amplitude estimation.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Formalizing How Much Data Proves a Learning Model Right
Researchers formalize identifying information—the bits needed to confirm or reject a hypothesis—bridging information theory with practical sample complexity.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Estimating classification ceiling without perfect labels
Ushio et al. show how to measure the theoretical best-case error rate in binary classification using imperfect soft labels and calibration techniques.
Apr 17, 2026 Read →