Tag: Deep Learning (5 articles)

Welcome Gemma 4: Frontier multimodal intelligence on device

Gemma 4 introduces enhanced multimodal capabilities, supporting image, text, and audio inputs, significantly improving model intelligence and deployment flexibility across devices.

Hugging Face Blog · Thu, 02 Apr 2026 00:00:00 GMT

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Ulysses Sequence Parallelism addresses the challenges of training large language models with long sequences, significantly enhancing the capability to process million-token contexts.

Hugging Face Blog · Mon, 09 Mar 2026 00:00:00 GMT

Diffusion Models for Video Generation

The application of diffusion models in video generation reveals challenges in temporal consistency and data requirements.

Lilian Weng · Fri, 12 Apr 2024 00:00:00 +0000

Thinking about High-Quality Human Data

High-quality human data is crucial for modern deep learning model training, and this article explores the factors influencing data quality and methods for optimization.

Lilian Weng · Mon, 05 Feb 2024 00:00:00 +0000

Deep Neural Nets: 33 years ago and 33 years from now

Karpathy reproduces the 1989 LeCun paper on deep learning, revealing the evolution of deep learning technology and potential future directions.

Andrej Karpathy · Mon, 14 Mar 2022 07:00:00 +0000