Search
5 results for "inference"
- engineering · arxiv/cs.AI · 6 min
Vibration Gestures on Furniture via Efficient FPGA Neural Networks
Researchers compress neural networks for gesture recognition on low-power FPGAs, eliminating complex preprocessing and cutting energy use to under 1.2 mJ per inference.
Apr 22, 2026 Read → - engineering · hackernoon · 7 min
LLMesh routes local LLM requests across machines via one endpoint
A distributed inference broker lets teams share GPU hardware without changing application code between dev, staging, and production.
Apr 18, 2026 Read → - ai · arxiv/cs.AI · 8 min
Small Models Match Large Ones via Inference Scaffolding
McClendon et al. show that role-based prompt structuring at inference time doubles small-model performance on complex tasks without retraining.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 3 min
Framework uses AI outputs as features, not proxies, for labeled data
Generative Augmented Inference treats LLM predictions as informative signals rather than direct substitutes, reducing human labeling needs by 75–90% across operations tasks.
Apr 17, 2026 Read → - ai · arxiv/cs.LG · 8 min
Quantum kernel inference cuts query cost by removing data-size dependence
New algorithm reduces quantum machine learning inference complexity from O(N) to O(1) in data size, achieving query-optimal bounds via amplitude estimation.
Apr 17, 2026 Read →