Microsoft Proposes SageServe to Cut GPU Costs for LLM Inference

Cached 2026-02-02T19:40:34+0000

Researchers Highlight Gaps in CPU Isolation Against Microarchitectural Leaks

Cached 2026-02-02T19:40:35+0000

DroidSpeak Boosts Multi‑LLM Inference Efficiency by Up to Threefold

Cached 2026-02-02T19:40:36+0000

HarvestContainers Enables Up to 75% CPU Core Sharing for Latency‑Sensitive Kubernetes Pods

Cached 2026-02-02T19:40:35+0000

Researchers Test Brain Foundation Models for Continuous Cognitive‑Load Monitoring

Cached 2026-02-02T19:40:35+0000

Concord Tool Learns Network Configuration Contracts to Detect Misconfigurations

Cached 2026-02-02T19:40:35+0000

EgoBrain dataset launches multimodal human‑action research platform

Cached 2026-02-02T19:40:35+0000

New Methods Reduce Low‑Probability Token Dominance in RL Training of LLMs

Cached 2026-02-02T19:40:35+0000

MetaMuse Framework Boosts Cloud Algorithm Performance

Cached 2026-02-02T19:40:35+0000

VidGuard‑R1 Sets New Benchmark in AI‑Generated Video Detection

Cached 2026-02-02T19:40:35+0000

Adversarial RL Framework Boosts LLM Unit Test Generation

Cached 2026-02-02T19:40:35+0000

Generative UI Workshop Set for CHI 2026 Explores AI‑Driven Interface Design

Cached 2026-02-02T19:40:36+0000

Engaging Disability Communities in AI Image Representation

Cached 2026-02-02T19:40:36+0000

VeriStruct Extends AI‑Assisted Formal Verification to Rust Data‑Structure Modules

Cached 2026-02-02T19:40:36+0000

Niyama Boosts LLM Inference Capacity by 32% with Fine‑Grained QoS

Cached 2026-02-02T19:40:36+0000

MSCCL++ Proposes New GPU Communication Abstractions for AI Inference

Cached 2026-02-02T19:40:36+0000

SUTRADHARA Identifies Latency Bottlenecks in Tool‑Based LLM Agents

Published 2026-02-01T00:00:00-0800 Cached 2026-02-03T23:37:03+0000

OrbitalBrain Demonstrates Up to 12.4× Faster In‑Space ML Training

Published 2026-02-01T00:00:00-0800 Cached 2026-02-02T19:40:36+0000

Researchers Propose SRv6 for Fully Controllable AI Backend Traffic

Published 2026-01-29T00:00:00-0800 Cached 2026-02-02T19:40:36+0000

Predictive Inverse Dynamics Models Show Sample‑Efficiency Gains Over Behavior Cloning

Published 2026-01-29T00:00:00-0800 Cached 2026-02-02T19:40:36+0000

UniRG Framework Sets New State‑of‑the‑Art in Chest X‑Ray Report Generation

Published 2026-01-23T00:00:00-0800 Cached 2026-02-02T19:40:36+0000

Benchmarking Affordance Generalization with BusyBox

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000

New Algorithms Produce Tight Composable Coresets for Constrained Determinant Maximization

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000

SALAD‑VAE Achieves State‑of‑the‑Art Semantic Audio Compression

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000

Sublinear‑Time Approximation for Metric Steiner Forest and MIS Size Estimation

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000

Terabyte‑Scale Analytics Achieve 60× Speedup on GPU Clusters

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000

New Retrieval Method Boosts Long‑Document QA Performance

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000

Real-Time Generative Speech Restoration Achieves 20 ms Latency

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000

Sci‑Phi: First Audio LLM for Full Spatial‑Scene Description

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000

Predictive Models for Kidney Offer Acceptance Face Data and Decision Gaps

Published 2026-01-01T00:00:00-0800 Cached 2026-02-02T19:40:37+0000