Latest in AI

Showing:StudentsGPTClear ×

🔥 Trending today

anthropic7 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2 ai-regulation2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Silia: A Tiny Transformer Architecture for Sub-10M Parameter Models
r/LocalLLaMA top day3 days agoPaper
A student from India shared their first paper on r/LocalLLaMA, proposing Silia, a Transformer architecture for extremely small models. The idea is to merge attention-style dynamic mixing with SwiGLU-like nonlinear transformation, aiming to save parameters in models under roughly 10M parameters. The author frames the work as an early, small-scale exploration, limited by old hardware and restricted access to larger compute.
ChatGPT vs Doubao on Gaokao Math
量子位 QbitAI6 days agoBenchmark
The article appears to test ChatGPT and Doubao on Chinese Gaokao math problems. Since the original text is unavailable, the exact questions, prompts, scores, and winner cannot be verified. It should be treated as a media-style AI capability comparison rather than a rigorous, reproducible benchmark.
Sponsor OpenAI Codex Voucher Usage for the OpenAI Challenge
Hugging Face Blog7 days agoTutorial
This Hugging Face Blog entry appears to relate to sponsor vouchers for the Build Small Hackathon, specifically OpenAI Codex voucher usage. Because the original body text is unavailable, details such as eligibility, value, deadlines, and supported tools cannot be confirmed. It is best treated as a likely participant guide rather than a major product announcement.
Show HN: Lathe - Use LLMs to learn a new domain, not skip past it
Hacker News (AI keywords)7 days agoNew Tool
Lathe is an open-source tool for generating hands-on technical tutorials with LLM skills. It combines a Go CLI, local reading UI, and commands for asking questions, extending tutorials, and verifying outputs. The project supports Claude Code, Cursor, and Codex workflows, with an emphasis on learning by typing and reasoning through the material yourself.
Tiny hackable CUDA language model implementation
Hacker News (AI keywords)9 days agoNew Tool
This GitHub project implements a compact generative pretrained transformer as an autoregressive byte-level sequence model. Its README describes causal self-attention, RoPE, feed-forward layers, AdamW, cross-entropy training, and BLAS/OpenBLAS-backed matrix operations, with CUDA toolkit listed in setup steps. It is most useful as an educational and experimental codebase, not as a production-grade replacement for large commercial LLMs.
Ask HN: What is your (AI) dev tech stack / workflow?
Hacker News (AI keywords)9 days agoCommentary
An Ask HN thread asks developers to share their current AI-assisted development setup for upcoming in-person workshops. The author wants guidance for beginners and working developers, with use cases ranging from static sites to FastAPI tools and Linux home automation. Replies cover Claude Code, Cursor, GitHub Copilot, VSCode, spec-driven development, TDD, multi-agent workflows, reviews, and quality control.
Show HN: Boxes.dev: ditch localhost; run Claude Code and Codex in the cloud
Hacker News (AI keywords)10 days agoNew Tool
Boxes.dev appeared on Hacker News as a Show HN post, positioning itself as a way to move Claude Code and Codex workflows from localhost to the cloud. Based only on the title, it seems aimed at cloud development or remote agent execution. The provided source does not include details on architecture, pricing, security, integrations, or limitations.
How LLMs Actually Work
Hacker News (AI keywords)10 days agoTutorial
The article explains how modern LLMs convert text into token IDs, embeddings, and position-aware vectors before passing them through stacked transformer blocks. It covers attention, multi-head attention, KV cache, GQA, feed-forward networks, MoE, residual streams, normalization, and decoding. Its goal is educational: helping readers understand the common architecture behind many current model families and read model cards or papers more confidently.
AI Agent Guidelines for CS336 at Stanford
Hacker News (AI keywords)13 days agoEthics
Stanford CS336’s CLAUDE.md sets boundaries for AI coding assistants such as ChatGPT, Claude Code, GitHub Copilot, and Cursor. Agents may explain concepts, review student-written code, suggest debugging checks, and point to course materials. They should not write code, complete TODOs, edit repositories, run shell commands, or implement core assignment components for students.
未來的預兆：GPT-5.5 與 AI 指數型成長的下一步★ 85
One Useful Thing (Mollick)51 days agoCommentary
Wharton School professor Ethan Mollick, writing in his well-known newsletter "One Useful Thing," has published a profound analysis of GPT-5.5. He describes…
事情的輪廓：我們目前所處的 AI 階段與未來展望 (The Shape of the Thing)★ 85
One Useful Thing (Mollick)94 days agoOpinion
Wharton School professor Ethan Mollick, in his latest article "The Shape of the Thing," sketches out a clear picture of the current state of AI technological…
代理人時代的 AI 選擇指南：不再只是聊天機器人★ 85
One Useful Thing (Mollick)116 days agoTutorial
Prominent scholar Ethan Mollick, in his latest article, points out that we have officially crossed beyond the era of simple "Chatbots" and entered what he…
2025 年末 AI 實用指南：Ethan Mollick 的主觀使用建議★ 85
One Useful Thing (Mollick)237 days agoTutorial
Wharton School professor Ethan Mollick has put together a highly personal and practical operating guide for the AI landscape of late 2025. He emphasizes that…
與魔法師共事：在參差不齊的技術前沿驗證 AI 的魔力★ 85
One Useful Thing (Mollick)275 days agoOpinion
University of Pennsylvania Wharton School professor Ethan Mollick, in his latest article, compares the experience of collaborating with generative AI (such as…
大眾智能（Mass Intelligence）：從 GPT-5 到邊緣小模型，強大 AI 正在走向普及化★ 85
One Useful Thing (Mollick)289 days agoOpinion
In this article exploring "Mass Intelligence," University of Pennsylvania Wharton School professor Ethan Mollick reveals an imminent future: high-level…
立即上手 AI：實用快速指南 (Ethan Mollick 著)★ 85
One Useful Thing (Mollick)356 days agoTutorial
University of Pennsylvania Wharton School professor Ethan Mollick recently published an extremely practical AI quick guide, "Using AI Right Now: A Quick…
用 32 隻水獺看 AI 的近代發展史：三年來的視覺演進與技術飛躍
One Useful Thing (Mollick)377 days agoCommentary
University of Pennsylvania Wharton School professor Ethan Mollick, in his well-known blog "One Useful Thing," published a visually striking and thoroughly…
如何打造專屬的 AI 生活旁白大師（以大衛·艾登堡為例）
Replicate Blog921 days agoTutorial
This technical tutorial from Replicate was inspired by a viral project from developer Charlie Holtz. The project demonstrates how to use a computer's webcam to…
圖解人類回饋強化學習 (RLHF)：ChatGPT 背後的關鍵對齊技術★ 85
Hugging Face Blog1,283 days agoTutorial
The release of ChatGPT in late 2022 triggered an explosion in generative AI, and the most critical technology behind it is Reinforcement Learning from Human…

Latest in AI

Silia: A Tiny Transformer Architecture for Sub-10M Parameter Models

ChatGPT vs Doubao on Gaokao Math

Sponsor OpenAI Codex Voucher Usage for the OpenAI Challenge

Show HN: Lathe - Use LLMs to learn a new domain, not skip past it

Tiny hackable CUDA language model implementation

Ask HN: What is your (AI) dev tech stack / workflow?

Show HN: Boxes.dev: ditch localhost; run Claude Code and Codex in the cloud

How LLMs Actually Work

AI Agent Guidelines for CS336 at Stanford

未來的預兆：GPT-5.5 與 AI 指數型成長的下一步★ 85

事情的輪廓：我們目前所處的 AI 階段與未來展望 (The Shape of the Thing)★ 85

代理人時代的 AI 選擇指南：不再只是聊天機器人★ 85

2025 年末 AI 實用指南：Ethan Mollick 的主觀使用建議★ 85

與魔法師共事：在參差不齊的技術前沿驗證 AI 的魔力★ 85

大眾智能（Mass Intelligence）：從 GPT-5 到邊緣小模型，強大 AI 正在走向普及化★ 85

立即上手 AI：實用快速指南 (Ethan Mollick 著)★ 85

用 32 隻水獺看 AI 的近代發展史：三年來的視覺演進與技術飛躍

如何打造專屬的 AI 生活旁白大師（以大衛·艾登堡為例）

圖解人類回饋強化學習 (RLHF)：ChatGPT 背後的關鍵對齊技術★ 85