Expanse is a YC P26 launch for improving effective utilization in SLURM and Kubernetes GPU/HPC clusters. It analyzes source code, job scripts, hardware topology, and telemetry before submission to recommend GPU VRAM, CPU, memory, utilization, and walltime. The team says it also detects likely failures, offers line-level optimization hints, and fine-tunes cluster-specific models over time.
Hugging Face recently announced a major upgrade to its hosted model deployment service, "Inference Endpoints," introducing a brand-new and far more modern…
The recently launched `smolagents` from Hugging Face is an AI agent framework that emphasizes being lightweight and code-centric (Code Agent). However, as the…