Overview of Graph Embedding Techniques: DeepWalk, LINE, node2vec, and EGES
This article provides a comprehensive overview of graph embedding methods—including DeepWalk, LINE, node2vec, and EGES—explaining their algorithms, random‑walk strategies, proximity definitions, incorporation of side information, and their applications in large‑scale recommendation systems.
1. Introduction to Graph Embedding
Graph Embedding extends Word2Vec‑style sequence embedding to graph‑structured data, generating low‑dimensional dense vectors for nodes that capture structural and local similarity.
2. DeepWalk
DeepWalk performs random walks on an item graph to produce node sequences, which are fed into Word2Vec to learn embeddings. The original paper “Billion‑scale Commodity Embedding for E‑commerce Recommendation in Alibaba” describes the workflow, including graph construction from user behavior and transition probability definitions for directed weighted graphs.
3. LINE – Improving DeepWalk
LINE replaces DFS‑based walks with BFS‑style neighborhood construction and defines first‑order and second‑order proximity. First‑order proximity models direct edge weights, while second‑order proximity models shared neighborhoods, using a weighted KL‑divergence objective and negative sampling.
4. node2vec – Balancing Homophily and Structural Equivalence
node2vec introduces biased second‑order random walks controlled by return parameter p and in‑out parameter q , allowing interpolation between breadth‑first (BFS) and depth‑first (DFS) strategies to capture both homophily and structural equivalence in the learned embeddings.
5. EGES – Enhanced Graph Embedding with Side Information
EGES extends the Base Graph Embedding (BGE) pipeline by incorporating side information (category, brand, price) with learnable weights, using a weighted skip‑gram model to alleviate cold‑start and sparsity issues in large‑scale e‑commerce recommendation.
The recommendation pipeline typically consists of matching and ranking stages, where graph embeddings are used to compute item similarity in the matching phase.
6. Summary
Graph Embedding remains a hot research and industry topic; besides DeepWalk, LINE, node2vec, and EGES, methods such as SDNE and struct2vec also merit further study.
Sohu Tech Products
A knowledge-sharing platform for Sohu's technology products. As a leading Chinese internet brand with media, video, search, and gaming services and over 700 million users, Sohu continuously drives tech innovation and practice. We’ll share practical insights and tech news here.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.