← Back to Home

Tag: Large Language Models (12 articles)

ChatGPT voice mode is a weaker model

Simon Willison reveals a counterintuitive fact: ChatGPT's voice mode runs on an older, weaker GPT-4o-era model, creating a massive gap between user expectations and reality.

Simon Willison · 2026-04-10T15:56:02+00:00

Any Custom Frontend with Gradio's Backend

The introduction of Gradio.Server allows developers to use custom frontend frameworks while enjoying the robust backend support of Gradio, significantly enhancing application development flexibility and efficiency.

Hugging Face Blog · Wed, 01 Apr 2026 00:00:00 GMT

Mixture of Experts (MoEs) in Transformers

Mixture of Experts (MoEs) are becoming a new trend in Transformers by enhancing computational efficiency and optimizing parallel processing, driving the evolution of large language models.

Hugging Face Blog · Thu, 26 Feb 2026 00:00:00 GMT

microgpt

Andrej Karpathy's microgpt project demonstrates how to implement a simplified GPT model from scratch in just 200 lines of Python code, revealing a trend towards minimalism in AI development.

Andrej Karpathy · Thu, 12 Feb 2026 07:00:00 +0000

Extrinsic Hallucinations in LLMs

This article explores the phenomenon of extrinsic hallucinations in large language models, analyzing their causes and detection methods, and proposes effective strategies to reduce hallucinations while emphasizing the risks of knowledge updates.

Lilian Weng · Sun, 07 Jul 2024 00:00:00 +0000

Adversarial Attacks on LLMs

This article explores adversarial attacks on large language models (LLMs), including types of attacks, threat models, and their impact on the safety of generated text, revealing significant challenges in AI safety.

Lilian Weng · Wed, 25 Oct 2023 00:00:00 +0000

LLM Powered Autonomous Agents

LLM powered autonomous agents combine planning, memory, and tool usage, showcasing their potential in handling complex tasks and indicating a significant shift in work methodologies.

Lilian Weng · Fri, 23 Jun 2023 00:00:00 +0000

Prompt Engineering

This article delves into the basics and techniques of prompt engineering, emphasizing the importance of effective communication with large language models and how to optimize model performance through example selection and ordering.

Lilian Weng · Wed, 15 Mar 2023 00:00:00 +0000

The Transformer Family Version 2.0

Lilian Weng's new article deeply explores the evolution and new features of Transformers, revealing their ongoing impact in natural language processing.

Lilian Weng · Fri, 27 Jan 2023 00:00:00 +0000
BitByAI — AI-powered, AI-evolved AI News