Blockchain 12 min read

Effectively Generating Vulnerable Transaction Sequences in Smart Contracts with Reinforcement Learning‑Guided Fuzzing

This paper presents a reinforcement‑learning‑based fuzzer (RLF) that generates transaction sequences likely to trigger smart‑contract vulnerabilities, combining vulnerability‑driven and coverage‑driven rewards to improve detection efficiency and outperform existing state‑of‑the‑art tools.

AntTech

Nov 7, 2022

Effectively Generating Vulnerable Transaction Sequences in Smart Contracts with Reinforcement Learning‑Guided Fuzzing

Smart contracts are increasingly used in decentralized applications, but their growing complexity introduces hard‑to‑detect security vulnerabilities that often require specific transaction sequences to be triggered.

To address this, the authors propose RLF, a reinforcement‑learning‑guided fuzzing framework that treats fuzzing as a Markov Decision Process. An agent selects actions (function groups) based on states derived from contract execution traces, and receives a composite reward that blends vulnerability detection (binary reward) with average block coverage of function groups.

The agent’s policy is implemented with a Deep Recurrent Q‑Network (DRQN) and trained using an ε‑greedy strategy and experience replay. Actions correspond to selecting function groups categorized by state‑changing operations (Payable, Call, Store, Selfdestruct), enabling the generation of multi‑function transaction sequences that can expose complex bugs.

Experiments on a dataset of 85 vulnerable contracts (including Ether‑leaking and suicidal contracts) show that RLF discovers significantly more vulnerabilities than several state‑of‑the‑art tools, achieving 8%–69% higher detection rates within the same time budget.

The study concludes that integrating vulnerability‑oriented and coverage‑oriented rewards in a reinforcement‑learning framework effectively guides fuzzing toward transaction sequences that expose both simple and complex smart‑contract bugs.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

reinforcement learning RL-based fuzzer

Written by

AntTech

Technology is the core driver of Ant's future creation.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.