Source

arxiv/cs.LG

59 insights rewritten from this source.

ai · arxiv/cs.LG · 4 min

Synthetic Computers Enable Agent Training at Scale

Researchers create realistic digital workspaces to train AI agents on long-horizon productivity tasks, scaling from thousands to potentially billions of simulated user environments.

May 3, 2026 Read →
ai · arxiv/cs.LG · 4 min

ActiNet: Self-Supervised Model Improves Wrist Activity Classification

Open-source deep learning tool outperforms random forest baselines for extracting activity intensity from wearable accelerometer data in epidemiological research.

May 3, 2026 Read →
ai · arxiv/cs.LG · 8 min

Mixed Precision Training Stabilizes Neural ODEs

Researchers demonstrate a framework that reduces memory use by 50% and speeds up neural ODE training 2x by carefully mixing low and high precision arithmetic.

May 3, 2026 Read →
ai · arxiv/cs.LG · 4 min

Selective-Update RNNs Match Transformers While Using Less Memory

A new RNN architecture learns when to update internal state, preserving memory across long sequences and reducing computational waste on redundant input.

May 3, 2026 Read →
ai · arxiv/cs.LG · 8 min

Logic Rules Boost Generative ML Trustworthiness in Networks

NetNomos integrates formal logic constraints into generative models to enforce networking rules and reduce hallucinations in telemetry, forecasting, and synthetic data tasks.

May 3, 2026 Read →
ai · arxiv/cs.LG · 8 min

Model Architecture Controls Whether Errors Stay Hidden

Transformer design determines if internal decision signals remain observable after training, independent of output confidence metrics.

Apr 29, 2026 Read →
engineering · arxiv/cs.LG · 4 min

Graph Neural Networks Cut QAOA Query Cost by 87%

A trust-region method using GNNs to predict QAOA parameter distributions reduces circuit evaluations while preserving solution quality on small graphs.

Apr 29, 2026 Read →
ai · arxiv/cs.LG · 8 min

Web agents plateau on short tasks; Odysseys benchmark tests realistic multi-hour workflows

New benchmark reveals frontier AI models achieve only 44.5% success on long-horizon web tasks spanning multiple sites, exposing efficiency gaps in agent design.

Apr 29, 2026 Read →
engineering · arxiv/cs.LG · 3 min

CiteRadar maps researcher influence across institutions and geography

Open-source tool transforms Google Scholar profiles into structured citation networks with geographic visualization and author metadata enrichment.

Apr 29, 2026 Read →
ai · arxiv/cs.LG · 5 min

MotionBricks: Real-Time Motion Generation at 15,000 FPS

A modular generative framework scales motion synthesis to production speeds while supporting multi-modal control without requiring animation expertise.

Apr 29, 2026 Read →
ai · arxiv/cs.LG · 5 min

Frontier coding agents now autonomously build AlphaZero pipelines

Claude Opus 4.7 successfully implements end-to-end ML systems from task descriptions alone, matching external solvers on Connect Four within three hours.

Apr 29, 2026 Read →
ai · arxiv/cs.LG · 8 min

Log-odds aggregation handles unknown state spaces in forecast combining

Chen, Peng, and Tang propose a closed-form aggregator for combining expert forecasts when the underlying outcome range is unknown, achieving tighter regret bounds than prior methods.

Apr 28, 2026 Read →
engineering · arxiv/cs.LG · 8 min

Tessera: Cache-Line Encryption for Edge AI Without Bandwidth Loss

A hardware architecture that decrypts neural network weights at 64-byte granularity, hiding cryptographic overhead within DRAM fetch latency on shared-memory edge accelerators.

Apr 28, 2026 Read →
ai · arxiv/cs.LG · 4 min

Efficient Rationale Retrieval via Student-Teacher Distillation

Rabtriever reduces computational cost of LLM-based document ranking by distilling cross-encoder knowledge into independent query-document encoders.

Apr 28, 2026 Read →
ai · arxiv/cs.LG · 8 min

Agentic AI Security Requires Layered Defense, Not Just Prompt Guards

A new framework maps AI agent vulnerabilities across seven architectural layers and four time horizons, revealing that 93% of research ignores the slowest, most dangerous threats.

Apr 28, 2026 Read →
ai · arxiv/cs.LG · 8 min

Admissible Objectives for Hierarchical Clustering Formally Characterized

Tsukuba and Ando extend the theory of objective functions for hierarchical clustering, characterizing when functions recover ground-truth structures and introducing max-type variants.

Apr 28, 2026 Read →
engineering · arxiv/cs.LG · 8 min

Learning turbulence closures via nudging sidesteps solver backprop

A data-assimilation-inspired approach trains neural network turbulence models on DNS data without embedding them in solvers, reducing computational cost and improving stability.

Apr 28, 2026 Read →
ai · arxiv/cs.LG · 4 min

Hyperbolic neural networks outperform Euclidean models in quantum simulations

Researchers demonstrate that Poincaré and Lorentz recurrent architectures consistently beat standard neural quantum states on many-body physics benchmarks.

Apr 28, 2026 Read →
ai · arxiv/cs.LG · 8 min

Neural Networks and ODEs Compute Primitive Recursion via Dynamics, Not Composition

Bournez proves recurrent ReLU networks, polynomial ODEs, and discrete maps all express primitive recursive functions through continuous-time trajectories rather than symbolic subroutine chaining.

Apr 28, 2026 Read →
engineering · arxiv/cs.LG · 8 min

Sequential decision-making reduces error drift in modular digital twins

Researchers frame error propagation in digital twins as a Markov decision process, comparing model-based and model-free approaches to optimize maintenance interventions.

Apr 27, 2026 Read →
ai · arxiv/cs.LG · 8 min

Poisoning attacks on recommender systems gain potency through worst-case modeling

Researchers propose SharpAP, a method that optimizes fake user injection attacks by targeting worst-case model structures, improving cross-system transferability.

Apr 27, 2026 Read →
ai · arxiv/cs.LG · 4 min

LLMs use hidden confidence signals to detect and fix their own errors

Research shows large language models maintain a second-order evaluative signal that predicts error detection and self-correction beyond what their output probabilities reveal.

Apr 27, 2026 Read →
ai · arxiv/cs.LG · 4 min

Neural networks unmix single Raman spectra without multiple samples

A brain-inspired deep learning model solves the underdetermined problem of identifying chemical components from one noisy mixed spectrum, enabling rapid substance detection.

Apr 27, 2026 Read →
ai · arxiv/cs.LG · 8 min

Simple graph models match deep learning for molecular prediction

Classical topological indices enhanced with regularization and ensemble methods outperform neural networks on molecular property benchmarks without GPU requirements.

Apr 23, 2026 Read →
engineering · arxiv/cs.LG · 8 min

Multi-Agent Edge Systems Hit a Scaling Wall at 100+ Agents

A new framework addresses the Synergistic Collapse problem where performance degrades superlinearly as distributed agents grow, combining neural caching, action pruning, and hardware matching.

Apr 23, 2026 Read →
ai · arxiv/cs.LG · 8 min

Dataset Distillation Fails Without Hard Labels

Soft labels mask poor dataset quality in distillation methods, making random subsets nearly as effective as curated ones.

Apr 22, 2026 Read →
ai · arxiv/cs.LG · 8 min

Concept Bottleneck Models Hit Hard Ceiling in Dermoscopy Data

Rough-set analysis reveals 16% of concept profiles in Derm7pt are internally inconsistent, capping model accuracy at 92% regardless of architecture.

Apr 22, 2026 Read →
engineering · arxiv/cs.LG · 8 min

Routing Optimization for Satellite Federated Learning: Tractable Boundaries

Researchers map which routing problems in orbital federated learning can be solved efficiently and which are computationally hard.

Apr 22, 2026 Read →
ai · arxiv/cs.LG · 8 min

Simpler Optimizers Make LLM Unlearning More Robust

Research shows that using lower-order optimization methods during LLM unlearning produces forgetting that resists post-training attacks better than sophisticated gradient-based approaches.

Apr 21, 2026 Read →
ai · arxiv/cs.LG · 8 min

Three diffusion methods unified under population genetics framework

Researchers connect discrete, Gaussian, and simplicial diffusion models through Wright-Fisher theory, enabling stable cross-domain sequence generation.

Apr 21, 2026 Read →
ai · arxiv/cs.LG · 8 min

Theory for learning blind inverse problems with finite samples

Researchers establish sample complexity bounds and optimal estimators for blind inverse problems using linear minimum mean square estimation framework.

Apr 21, 2026 Read →
ai · arxiv/cs.LG · 4 min

LLMs complement but don't replace classical hyperparameter optimization

A study comparing LLM agents to classical algorithms like CMA-ES and TPE finds hybrid approaches work best for tuning model hyperparameters under compute constraints.

Apr 21, 2026 Read →
ai · arxiv/cs.LG · 4 min

Weak Labels Fail Across Time Even When Domain Transfer Works

A study of CRISPR experiments reveals supervision drift—where the labeling mechanism itself shifts—causes model collapse in temporal transfer despite strong in-domain performance.

Apr 21, 2026 Read →
ai · arxiv/cs.LG · 8 min

Chain-of-Thought Supervision Eliminates Sample Complexity Growth

New theoretical analysis shows intermediate reasoning steps remove dependence on generation length, while end-to-end learning scales unpredictably with sequence depth.

Apr 21, 2026 Read →
ai · arxiv/cs.LG · 6 min

Automating Dataset Creation with LLMs and Search Engines

Researchers propose ADC, a method to build large labeled datasets automatically using language models and web search, reducing manual annotation work and cost.

Apr 21, 2026 Read →
engineering · arxiv/cs.LG · 4 min

Kernel-Level LLM Safety via Logit Inspection

ProbeLogits reads token probabilities before generation to enforce safety policies at the OS level, achieving parity with learned classifiers at 2.5x speed.

Apr 21, 2026 Read →
ai · arxiv/cs.LG · 4 min

Neural CTMC decouples discrete diffusion into timing and direction

A new parameterization for discrete diffusion models separates when and where tokens jump, aligning training with mathematical structure.

Apr 20, 2026 Read →
ai · arxiv/cs.LG · 8 min

Chromatic Clustering Requires New Algorithms to Match Standard Performance

Adding color constraints to correlation clustering increases computational difficulty; a new coupled approach recovers optimal approximation bounds.

Apr 20, 2026 Read →
ai · arxiv/cs.LG · 4 min

Quantum-LSTM hybrid cuts physics model training data by 100×

Federated learning with quantum-enhanced LSTM achieves classical accuracy on SUSY classification using 20K samples instead of 2M, with under 300 parameters.

Apr 20, 2026 Read →
engineering · arxiv/cs.LG · 8 min

ML predicts nonlinear distortion in massive MIMO arrays

Machine learning models forecast signal degradation from power amplifier nonlinearity in 5G/6G systems, enabling 12% throughput gains via adaptive power allocation.

Apr 20, 2026 Read →
engineering · arxiv/cs.LG · 4 min

Hybrid PINNs: Finite-Difference Regularization for Physics Solvers

Adding weak finite-difference gradient penalties to physics-informed neural networks improves boundary accuracy without replacing automatic-differentiation residuals.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 3 min

Framework uses AI outputs as features, not proxies, for labeled data

Generative Augmented Inference treats LLM predictions as informative signals rather than direct substitutes, reducing human labeling needs by 75–90% across operations tasks.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

Foundation Models vs. Task-Specific ML in Electricity Price Forecasting

Time series foundation models outperform traditional deep learning on probabilistic forecasts, but well-tuned conventional models remain competitive at lower computational cost.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

LLM Panels Match Expert Clinicians in Medical Diagnosis Scoring

A study of three frontier AI models scoring real hospital cases shows calibrated LLM juries can reliably replace human expert panels for medical AI evaluation.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 5 min

Rejection-Gated Policy Optimization replaces importance weighting with learned gates

A new reinforcement learning method selects trustworthy samples via differentiable gates instead of reweighting all samples, reducing variance and improving RLHF alignment.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

INT4 Quantization Fails After FP32 Convergence in Predictable Phases

Post-training quantization assumes converged models are ready to compress, but INT4 quantization collapses in a three-phase pattern tied to weight updates, not learning rate decay.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

Distilling Transformers into Mamba via Linearized Attention

A two-stage knowledge transfer method preserves Transformer performance in State Space Models by routing through linearized attention as an intermediate step.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

Three-Phase Transformer: Structural Prior for Decoder Efficiency

A residual-stream architecture using cyclic channel partitioning and phase-aligned rotations achieves 7% perplexity gains with minimal parameter overhead.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 6 min

Speech Models Fail Safety Tests That Text Passes

VoxSafeBench reveals speech language models recognize social norms in text but ignore them when cues arrive through voice, speaker identity, or environment.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 6 min

Speech Models Fail Safety Tests That Text Models Pass

A new benchmark reveals that speech language models drop safety, fairness, and privacy protections when cues arrive as audio rather than text.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 4 min

Retrieval-Augmented Set Completion for Clinical Code Authoring

A two-stage approach retrieves similar clinical value sets then classifies candidates, outperforming direct LLM generation on standardized medical vocabularies.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 4 min

Retrieval beats memorization for clinical code selection

A two-stage retrieval-then-classify method outperforms direct LLM generation for assembling clinical value sets from large standardized vocabularies.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

Machine Learning Maps Drug Binding to Viral RNA Pseudoknot

Spectral map analysis reveals how small-molecule inhibitors distort SARS-CoV-2 RNA structure in topology-dependent ways, with protonation state determining mechanism.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

Action Aliasing Breaks Safe RL Differently Depending on Filter Placement

A formal comparison of two projection-based safety strategies reveals that embedding safeguards in the policy creates gradient rank deficiency, while environment-level filters distribute the problem to the critic.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 3 min

Transformer models outperform CNNs in prostate MRI segmentation

SwinUNETR achieves 5-point Dice improvement over standard UNet when trained on mixed-reader datasets, suggesting transformer attention handles annotation variability better.

Apr 17, 2026 Read →
engineering · arxiv/cs.LG · 8 min

Queueing Model Reveals How AI Automation Paradoxically Worsens Cyber Risk

Research from Yun et al. shows that symmetric automation in attack and defense can increase exploit success rates, with heavy-tailed patching delays creating persistent vulnerability backlogs.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

Quantum kernel inference cuts query cost by removing data-size dependence

New algorithm reduces quantum machine learning inference complexity from O(N) to O(1) in data size, achieving query-optimal bounds via amplitude estimation.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

Formalizing How Much Data Proves a Learning Model Right

Researchers formalize identifying information—the bits needed to confirm or reject a hypothesis—bridging information theory with practical sample complexity.

Apr 17, 2026 Read →
ai · arxiv/cs.LG · 8 min

Estimating classification ceiling without perfect labels

Ushio et al. show how to measure the theoretical best-case error rate in binary classification using imperfect soft labels and calibration techniques.

Apr 17, 2026 Read →

arxiv/cs.LG

Synthetic Computers Enable Agent Training at Scale

ActiNet: Self-Supervised Model Improves Wrist Activity Classification

Mixed Precision Training Stabilizes Neural ODEs

Selective-Update RNNs Match Transformers While Using Less Memory

Logic Rules Boost Generative ML Trustworthiness in Networks

Model Architecture Controls Whether Errors Stay Hidden

Graph Neural Networks Cut QAOA Query Cost by 87%

Web agents plateau on short tasks; Odysseys benchmark tests realistic multi-hour workflows

CiteRadar maps researcher influence across institutions and geography

MotionBricks: Real-Time Motion Generation at 15,000 FPS

Frontier coding agents now autonomously build AlphaZero pipelines

Log-odds aggregation handles unknown state spaces in forecast combining

Tessera: Cache-Line Encryption for Edge AI Without Bandwidth Loss

Efficient Rationale Retrieval via Student-Teacher Distillation

Agentic AI Security Requires Layered Defense, Not Just Prompt Guards

Admissible Objectives for Hierarchical Clustering Formally Characterized

Learning turbulence closures via nudging sidesteps solver backprop

Hyperbolic neural networks outperform Euclidean models in quantum simulations

Neural Networks and ODEs Compute Primitive Recursion via Dynamics, Not Composition

Sequential decision-making reduces error drift in modular digital twins

Poisoning attacks on recommender systems gain potency through worst-case modeling

LLMs use hidden confidence signals to detect and fix their own errors

Neural networks unmix single Raman spectra without multiple samples

Simple graph models match deep learning for molecular prediction

Multi-Agent Edge Systems Hit a Scaling Wall at 100+ Agents

Dataset Distillation Fails Without Hard Labels

Concept Bottleneck Models Hit Hard Ceiling in Dermoscopy Data

Routing Optimization for Satellite Federated Learning: Tractable Boundaries

Simpler Optimizers Make LLM Unlearning More Robust

Three diffusion methods unified under population genetics framework

Theory for learning blind inverse problems with finite samples

LLMs complement but don't replace classical hyperparameter optimization

Weak Labels Fail Across Time Even When Domain Transfer Works

Chain-of-Thought Supervision Eliminates Sample Complexity Growth

Automating Dataset Creation with LLMs and Search Engines

Kernel-Level LLM Safety via Logit Inspection

Neural CTMC decouples discrete diffusion into timing and direction

Chromatic Clustering Requires New Algorithms to Match Standard Performance

Quantum-LSTM hybrid cuts physics model training data by 100×

ML predicts nonlinear distortion in massive MIMO arrays

Hybrid PINNs: Finite-Difference Regularization for Physics Solvers

Framework uses AI outputs as features, not proxies, for labeled data

Foundation Models vs. Task-Specific ML in Electricity Price Forecasting

LLM Panels Match Expert Clinicians in Medical Diagnosis Scoring

Rejection-Gated Policy Optimization replaces importance weighting with learned gates

INT4 Quantization Fails After FP32 Convergence in Predictable Phases

Distilling Transformers into Mamba via Linearized Attention

Three-Phase Transformer: Structural Prior for Decoder Efficiency

Speech Models Fail Safety Tests That Text Passes

Speech Models Fail Safety Tests That Text Models Pass

Retrieval-Augmented Set Completion for Clinical Code Authoring

Retrieval beats memorization for clinical code selection

Machine Learning Maps Drug Binding to Viral RNA Pseudoknot

Action Aliasing Breaks Safe RL Differently Depending on Filter Placement

Transformer models outperform CNNs in prostate MRI segmentation

Queueing Model Reveals How AI Automation Paradoxically Worsens Cyber Risk

Quantum kernel inference cuts query cost by removing data-size dependence

Formalizing How Much Data Proves a Learning Model Right

Estimating classification ceiling without perfect labels