Databases 9 min read

Testing MySQL Pagination Performance on Large Datasets

This article demonstrates how to create a 10‑million‑row MySQL table, batch‑insert data via a stored procedure, measure ordinary LIMIT pagination versus offset‑optimized queries, and provides practical tips for improving query speed on massive tables.

Architect's Guide

Jun 28, 2023

Testing MySQL Pagination Performance on Large Datasets

The author starts by answering an interview question about querying ten million rows, then explains that creating such a table directly is impractical and suggests using a fast database script.

Data preparation : a CREATE TABLE `user_operation_log` (...) statement defines a table with many varchar columns and an auto‑increment primary key.

Batch insert script (MySQL stored procedure) inserts ten million rows in batches of 1,000, committing after each batch:

DELIMITER ;;</code><code>CREATE PROCEDURE batch_insert_log()</code><code>BEGIN</code><code>  DECLARE i INT DEFAULT 1;</code><code>  DECLARE userId INT DEFAULT 10000000;</code><code> set @execSql = 'INSERT INTO `test`.`user_operation_log`(`user_id`, `ip`, `op_data`, `attr1`, `attr2`, `attr3`, `attr4`, `attr5`, `attr6`, `attr7`, `attr8`, `attr9`, `attr10`, `attr11`, `attr12`) VALUES';</code><code> set @execData = '';</code><code>  WHILE i<=10000000 DO</code><code>   set @attr = "'测试很长很长...的属性'";</code><code>  set @execData = concat(@execData, "(", userId + i, ", '10.0.69.175', '用户登录操作'", ",", @attr, ",", @attr, ",", @attr, ",", @attr, ",", @attr, ",", @attr, ",", @attr, ",", @attr, ",", @attr, ",", @attr, ",", @attr, ")");</code><code>  if i % 1000 = 0 then</code><code>     set @stmtSql = concat(@execSql, @execData,";");</code><code>    prepare stmt from @stmtSql;</code><code>    execute stmt;</code><code>    DEALLOCATE prepare stmt;</code><code>    commit;</code><code>    set @execData = "";</code><code>   else</code><code>     set @execData = concat(@execData, ",");</code><code>   end if;</code><code>  SET i=i+1;</code><code>  END WHILE;</code><code>END;;</code><code>DELIMITER ;

After inserting about 3.148 million rows (due to hardware limits), the author records three query times for counting rows (≈38 minutes) and three timings for a simple LIMIT query (≈50 ms each).

Ordinary pagination tests use MySQL's SELECT * FROM `user_operation_log` LIMIT offset, rows with varying offsets and row counts. Results show that larger offsets and larger result sets increase latency.

Offset‑optimization : the author demonstrates a sub‑query technique to locate the starting id and then fetch rows using

WHERE id >= (SELECT id FROM ... LIMIT offset, 1) LIMIT rows

, which leverages the primary‑key index and reduces time for large offsets. An alternative using IN (SELECT id FROM ... LIMIT ...) is provided for non‑incremental ids.

Another method restricts the query by id range with WHERE id BETWEEN start AND end or WHERE id >= start, which is extremely fast because it avoids offset scanning.

Large‑dataset optimization : the author shows that selecting fewer columns dramatically improves performance, comparing SELECT * with a column‑list query on one million rows. The conclusion is to avoid SELECT * and only retrieve needed fields.

Finally, the article wraps up with a reminder to practice these techniques, a call to share the post, and attribution to the original source.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

SQL performance testing MySQL Large Data

Written by

Architect's Guide

Dedicated to sharing programmer-architect skills—Java backend, system, microservice, and distributed architectures—to help you become a senior architect.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.