NVIDIA argues that robotaxi safety requires more than perception and driving decisions. The post presents Halos OS as a production safety foundation covering a certifiable OS, standardized interfaces, AI guardrails and large-scale validation. It also highlights global robotaxi collaborations using DRIVE Hyperion and the broader Halos stack across training, simulation and in-vehicle inference.
Based only on the headline, astronauts sheltered while air leak repairs were taking place and were later told to return to the ISS. The available text does not specify the leak location, severity, agencies involved, repair status, or operational impact. This should be treated as a limited incident update rather than an AI-related development.
AI security is shifting from technical jailbreaks to "Vibe Hacking," where attackers use social engineering and psychological tactics to manipulate an LLM's simulated persona. By exploiting the model's behavioral tendencies rather than code vulnerabilities, this trend establishes "psychocybersecurity" as a critical new frontier for AI alignment and safety.
In this issue of Import AI 438, Jack Clark examines two key issues concerning AI security and privacy: **1. You Are Your LLM History** As large language models…
Vercel announced in its Changelog that it is officially adding support for OpenAI's new safety guardrail model, **GPT-OSS-Safeguard-20B**, within the Vercel AI…
Google DeepMind has recently announced the strengthening of its Frontier Safety Framework (FSF) — a systematic mechanism designed to proactively identify…
Meta's safety guardrail model family has welcomed its newest member — Llama Guard 4 — which is now officially available on the Hugging Face Hub. As a…
With the explosion of AI Agent technology, AI is no longer just a passive chatbot that answers questions — it has become an entity capable of autonomously…
Google released a major update to the Gemma 2 family in late July 2024, comprising three core components: 1. **Gemma 2 2B**: A lightweight model with just 2.6B…
### Introduction: Capability Is Not Safety — A New Benchmark for LLM Safety Evaluation As large language models (LLMs) are adopted more deeply across…
With the explosion of generative AI models like Stable Diffusion, Hugging Face's Diffusers library has become the go-to tool for developers deploying and…