Tagged articles

17 articles

Page 1 of 1

Feb 11, 2026 · Artificial Intelligence

How Ximalaya Used Generative AI to Revolutionize Audio Recommendations

This article details Ximalaya's journey from traditional multi‑stage recommendation pipelines to generative AI‑driven models, covering business challenges, architectural and model differences, phased deployments, knowledge distillation, semantic ID encoding, decoder‑only strategies, extensive offline and online evaluations, and future research directions.

Encoder-DecoderKnowledge DistillationRecommendation Systems

0 likes · 24 min read

How Ximalaya Used Generative AI to Revolutionize Audio Recommendations

AI Cyberspace

Feb 11, 2026 · Artificial Intelligence

From RNNs to LSTMs and GRUs: A Hands‑On Guide to Sequence Modeling in PyTorch

This tutorial explains the nature of sequential data, why traditional feed‑forward networks struggle with it, and how recurrent architectures such as RNN, LSTM, and GRU capture temporal dependencies, complete with mathematical foundations, training algorithms, and full PyTorch implementations for sentiment analysis, text generation, and encoder‑decoder models.

Encoder-DecoderGRULSTM

0 likes · 57 min read

From RNNs to LSTMs and GRUs: A Hands‑On Guide to Sequence Modeling in PyTorch

JD Tech Talk

Oct 27, 2025 · Artificial Intelligence

How Large Language Models Are Revolutionizing Generative Recommendation Systems

Over the past year, generative recommendation has made substantial progress by leveraging large language models' powerful sequence modeling and reasoning abilities, introducing a new paradigm that replaces complex handcrafted features, addresses traditional recommendation bottlenecks, and outlines the evolution, core technologies, engineering challenges, and future directions of LLM‑based recommendation systems.

AI EngineeringEncoder-DecoderLLM

0 likes · 29 min read

How Large Language Models Are Revolutionizing Generative Recommendation Systems

Wu Shixiong's Large Model Academy

Oct 23, 2025 · Artificial Intelligence

Why the Transformer Core Structure Is the Key to AI Interview Success

This article explains the fundamental purpose, architecture, and variants of the Transformer model—including Encoder‑Decoder, Encoder‑only, and Decoder‑only designs—while detailing how attention mechanisms work and why modern large‑language models favor the Decoder‑only approach, providing a concise framework for answering interview questions.

AI InterviewEncoder-DecoderLarge Language Model

0 likes · 10 min read

Why the Transformer Core Structure Is the Key to AI Interview Success

Cognitive Technology Team

Jun 29, 2025 · Artificial Intelligence

Understanding Transformers: Core Mechanics Behind Modern AI Models

This article demystifies the Transformer architecture for beginners, explaining its relationship to large models, the self‑attention and multi‑head attention mechanisms, positional encoding, and the roles of Encoder and Decoder components, using clear analogies and visual diagrams to aid comprehension.

Artificial IntelligenceEncoder-DecoderPositional Encoding

0 likes · 20 min read

Understanding Transformers: Core Mechanics Behind Modern AI Models

AI Algorithm Path

Apr 5, 2024 · Artificial Intelligence

Master CNN, RNN, GAN, and Transformer Architectures in One Guide

This article provides a friendly, step‑by‑step overview of five core deep‑learning architectures—CNN, RNN, GAN, Transformers, and encoder‑decoder—explaining their structures, key components, and typical use cases in image and natural‑language processing.

CNNEncoder-DecoderGaN

0 likes · 12 min read

Architect

Mar 19, 2024 · Artificial Intelligence

How Transformers Power Modern NLP: A Deep Dive into Encoder‑Decoder Mechanics

This article explains the core principles of Transformer models—covering input embeddings, self‑attention, multi‑head attention, positional encoding, feed‑forward networks, and decoder strategies—using concrete examples like "The cat sat on the mat" and "The quick brown fox jumps over the lazy dog" to illustrate each step.

Encoder-DecoderFeed-Forward NetworkNLP

0 likes · 13 min read

How Transformers Power Modern NLP: A Deep Dive into Encoder‑Decoder Mechanics

Rare Earth Juejin Tech Community

Nov 15, 2023 · Artificial Intelligence

Understanding the Transformer Architecture: Encoder, Decoder, and Attention Mechanisms

This article explains the Transformer model, comparing it with RNNs, detailing its encoder‑decoder structure, multi‑head and scaled dot‑product attention, embedding layers, feed‑forward networks, and the final linear‑softmax output, supplemented with diagrams and code examples.

Artificial IntelligenceEncoder-DecoderTransformer

0 likes · 10 min read

Understanding the Transformer Architecture: Encoder, Decoder, and Attention Mechanisms

Sohu Tech Products

Jul 26, 2023 · Artificial Intelligence

Attention Mechanism, Transformer Architecture, and BERT: An In-Depth Overview

This article provides a comprehensive overview of the attention mechanism, its mathematical foundations, the transformer model architecture—including encoder and decoder components—and the BERT pre‑training model, detailing their principles, implementations, and applications in natural language processing.

BERTEncoder-DecoderNLP

0 likes · 13 min read

Attention Mechanism, Transformer Architecture, and BERT: An In-Depth Overview

Rare Earth Juejin Tech Community

Oct 11, 2022 · Artificial Intelligence

GANomaly: Theory and Source Code Analysis

This article explains the GANomaly model for semi‑supervised anomaly detection, detailing its generator‑encoder‑discriminator architecture, loss functions, testing phase scoring, and provides annotated PyTorch source code to help readers implement and understand the approach.

Anomaly DetectionEncoder-DecoderGaN

0 likes · 15 min read

GANomaly: Theory and Source Code Analysis

Alimama Tech

Jun 22, 2022 · Artificial Intelligence

Learning Pixel-Level Distinctions for Video Highlight Detection

The Alibaba Mom Creative & Video Platform team introduces PLD‑VHD, a pixel‑level distinction learning framework that uses a 3D CNN encoder‑decoder with temporal and saliency modules to detect highlights, achieving state‑of‑the‑art results on public benchmarks and a 4,724‑video e‑commerce dataset, and boosting ad revenue through precise clipping and cropping.

Encoder-Decodereye trackingpixel-level distinction

0 likes · 11 min read

Learning Pixel-Level Distinctions for Video Highlight Detection

Youku Technology

Apr 2, 2022 · Artificial Intelligence

Constrained Sequence-to-Tree Generation for Hierarchical Text Classification

At SIGIR 2022, the authors present a constrained Seq2Tree model that transforms hierarchical label taxonomies into preorder sequences and applies dynamic‑dictionary decoding to ensure label consistency, achieving superior hierarchical text classification performance on benchmark datasets and real‑world deployment within Alibaba Entertainment’s AI Brain.

Artificial IntelligenceEncoder-DecoderHierarchical Text Classification

0 likes · 5 min read

Constrained Sequence-to-Tree Generation for Hierarchical Text Classification

Code DAO

Dec 25, 2021 · Artificial Intelligence

Image Captioning with Attention in TensorFlow 2.0: An End-to-End Encoder-Decoder Tutorial

This article walks through building an image‑captioning system using a TensorFlow 2.0 encoder‑decoder with Bahdanau attention, covering dataset preparation, feature extraction with InceptionV3, model architecture, training with teacher forcing, and inference on the Flickr8K dataset.

Encoder-DecoderFlickr8kImage Captioning

0 likes · 20 min read

Image Captioning with Attention in TensorFlow 2.0: An End-to-End Encoder-Decoder Tutorial

DataFunTalk

Jun 22, 2021 · Artificial Intelligence

Survey of Graph Neural Networks for Natural Language Processing

This comprehensive survey reviews the latest research on graph neural networks applied to natural language processing, covering graph construction methods, graph representation learning techniques, encoder‑decoder models, static and dynamic graph building, and discusses challenges, benchmarks, and future directions in the field.

Encoder-DecoderGraph ConstructionNLP

0 likes · 57 min read

Survey of Graph Neural Networks for Natural Language Processing

Meituan Technology Team

May 27, 2021 · Artificial Intelligence

Technical Solutions for Meal Combo Recommendation in Food Delivery

Meituan Waimai tackles long ordering decisions and low merchant combo creation by deploying an offline‑real‑time hybrid system that generates high‑quality meal combos using graph‑label induction, encoder‑decoder, and attention models, reinforced with quality classification and constraint pruning, boosting combo coverage and user experience.

AIEncoder-Decoderknowledge graph

0 likes · 20 min read

Technical Solutions for Meal Combo Recommendation in Food Delivery

Cyber Elephant Tech Team

Apr 28, 2021 · Artificial Intelligence

Understanding BERT: From Encoder-Decoder to Transformer and Attention

This article explains the BERT model by first reviewing the Encoder-Decoder framework, then detailing the attention mechanism—including self-attention and multi-head attention—before describing the full Transformer architecture and finally outlining BERT’s encoder-only design, training stages, and fine-tuning applications.

BERTEncoder-DecoderNLP

0 likes · 15 min read

Understanding BERT: From Encoder-Decoder to Transformer and Attention

Hulu Beijing

Dec 14, 2017 · Artificial Intelligence

Understanding Seq2Seq: Framework, Advantages, and Decoding Techniques

This article explains the Seq2Seq encoder‑decoder framework, its benefits for various sequence modeling tasks, and compares common decoding strategies such as greedy search and beam search, while also introducing attention and other enhancements for improved performance.

Beam SearchEncoder-Decoderattention

0 likes · 9 min read

Understanding Seq2Seq: Framework, Advantages, and Decoding Techniques