Tagged articles

7 articles

Page 1 of 1

Machine Learning Algorithms & Natural Language Processing

Apr 25, 2026 · Artificial Intelligence

ICLR 2026 Award Winners: Outstanding Papers and Alec Radford’s Test‑of‑Time Honor

ICLR 2026 announced two Outstanding Paper awards, a Honorable Mention, and two Test‑of‑Time awards—including the seminal DCGAN and DDPG papers—highlighting a 19,000‑paper submission pool with a 28% acceptance rate and showcasing new theoretical insights on Transformers and multi‑turn LLM evaluation.

DCGANDDPGICLR

0 likes · 8 min read

ICLR 2026 Award Winners: Outstanding Papers and Alec Radford’s Test‑of‑Time Honor

Bighead's Algorithm Notes

Mar 24, 2026 · Artificial Intelligence

How an Interactive Imitation‑Learning Agent Framework Trains Robust Trading Strategies

The article analyzes the simulation‑reality gap in algorithmic trading and proposes an interactive market simulator that combines a pool of imitation‑learning agents, an action‑synthesis network, and a DDPG‑based reinforcement‑learning trader, showing superior robustness and downside protection on QQQ data.

Agent-Based ModelingDDPGFinancial AI

0 likes · 16 min read

How an Interactive Imitation‑Learning Agent Framework Trains Robust Trading Strategies

Network Intelligence Research Center (NIRC)

Feb 3, 2026 · Artificial Intelligence

INCS: A DRL‑Based Intent‑Driven Network‑Wide Configuration Synthesis Framework

The article presents INCS, a novel framework that combines graph neural networks and deep reinforcement learning to achieve protocol‑agnostic, millisecond‑level, globally optimized network configuration synthesis, addressing scalability, protocol dependence, and lack of optimization in traditional SMT‑based methods, and demonstrates its superior performance on large‑scale topologies.

DDPGGraph Neural NetworkNetwork Synthesis

0 likes · 8 min read

INCS: A DRL‑Based Intent‑Driven Network‑Wide Configuration Synthesis Framework

DataFunSummit

May 16, 2022 · Artificial Intelligence

Reinforcement Learning for E‑commerce Search Ranking: RNN User State Modeling and DDPG Long‑Term Value Optimization

This presentation details how JD applied reinforcement learning—using RNN‑based user state modeling and a DDPG framework—to improve e‑commerce search ranking by optimizing long‑term cumulative value, showing significant offline and online gains in conversion and GMV.

DDPGRNNe-commerce

0 likes · 20 min read

Reinforcement Learning for E‑commerce Search Ranking: RNN User State Modeling and DDPG Long‑Term Value Optimization

Meituan Technology Team

Nov 15, 2018 · Artificial Intelligence

Reinforcement Learning for Meituan's "Guess You Like" Recommendation Ranking

Meituan enhanced its homepage “Guess You Like” recommendation slot by modeling user‑item interactions as a Markov Decision Process and applying an improved DDPG reinforcement‑learning agent that adjusts the ranking trade‑off parameter, uses advantage‑based Q decomposition, shares actor‑critic weights, and runs in a real‑time TensorFlow pipeline, delivering consistent lifts in click‑through, dwell time, and depth.

DDPGMDP ModelingOnline Learning

0 likes · 21 min read

Reinforcement Learning for Meituan's "Guess You Like" Recommendation Ranking

Sohu Tech Products

Oct 10, 2018 · Artificial Intelligence

Optimizing News Recall with DDPG Reinforcement Learning and Transformer Architecture

This article explains how reinforcement learning, specifically the DDPG algorithm combined with Transformer-based networks, is applied to improve large‑scale news recall systems, detailing the business scenario, algorithm selection, model architecture, speed optimizations, training challenges, and observed online performance gains.

AIDDPGTransformer

0 likes · 13 min read

Optimizing News Recall with DDPG Reinforcement Learning and Transformer Architecture

Sohu Tech Products

Sep 5, 2018 · Artificial Intelligence

Reinforcement Learning Theory Overview and Its Application to News Recommendation

This article reviews reinforcement learning fundamentals, contrasts it with supervised learning, surveys major RL algorithms such as DDPG and DQN, and details how these methods can be modeled for sequential news recommendation, including system architecture, state‑action definitions, and practical challenges.

AIDDPGDQN

0 likes · 15 min read

Reinforcement Learning Theory Overview and Its Application to News Recommendation