Pagination in APIs: Efficient Data Retrieval for Developers

What is Pagination in APIs?

Pagination is the process of dividing large sets of data into smaller, discrete chunks (called “pages”) to make it easier to manage and retrieve. When a client makes a request to an API for a large dataset, instead of retrieving all the data at once, the server breaks the data into pages. Each page contains a subset of the data, and the client can request specific pages as needed.

For example, an API might return a list of products, but instead of returning all the products at once (which could be thousands), the API returns only a specific number of products per page (e.g., 10 products per page). The client can then request the next page of products by providing page numbers or other identifiers.

Why is Pagination Important?

Improved Performance: Retrieving large datasets in a single request can overload the system, slow down response times, and increase memory usage. Pagination reduces the strain on the server by limiting the amount of data returned in each request.
Reduced Bandwidth Usage: By transferring smaller chunks of data, pagination reduces the bandwidth required, which is particularly important for mobile devices or slow network connections.
Better User Experience: Pagination improves the user experience by displaying data in a structured way. Users can navigate through data efficiently without having to wait for long response times.
API Scalability: As the amount of data grows, pagination helps APIs scale more effectively by ensuring that the server isn’t overwhelmed with requests for huge datasets.

Types of Pagination

There are several approaches to implementing pagination in APIs. The most common ones are:

1. Offset-Based Pagination

Offset-based pagination uses two parameters: limit (or page_size) and offset (or page). The limit defines how many items to return per page, while the offset indicates the starting point for the data.

Example:

GET /api/products?limit=10&offset=20

Here:

limit=10 specifies that only 10 products should be returned.
offset=20 means that the data should start from the 21st product (the first 20 are skipped).

Pros:

Simple to implement and widely used.
Easy to understand for developers.

Cons:

Offset values can become inefficient with large datasets. The server needs to process all the items before the offset value, which can lead to performance issues as the dataset grows.

2. Cursor-Based Pagination

Cursor-based pagination uses a unique identifier (the “cursor”) to mark the starting point of the next page. Instead of using a numeric offset, the cursor points to a specific item, and the next page retrieves data after that point.

Example:

GET /api/products?cursor=abc123&limit=10

In this case, the cursor represents the unique identifier of the last item on the previous page. The next page will start from that item and return the next 10 products.

Pros:

More efficient than offset-based pagination for large datasets.
The cursor provides better performance because the server doesn’t need to skip items, just retrieve data after the cursor.

Cons:

Slightly more complex to implement, as you need to manage and return unique cursors with each response.

3. Keyset Pagination

Keyset pagination is similar to cursor-based pagination but uses a key (e.g., an ID or timestamp) to define the order and starting point for the next page. It’s ideal for sorted datasets where data is continuously being updated (e.g., new records added).

Example:

GET /api/products?last_id=25&limit=10

Here, last_id=25 means the page will start from the product with ID 25, and the next 10 products after that will be returned.

Pros:

Very efficient for large datasets and frequently updated data.
Doesn’t require tracking large offsets or cursors.

Cons:

Can only be used if the data is sorted by a key (e.g., ID, date).

How to Implement Pagination in an API

Let’s explore how to implement pagination in an API using Node.js with the Express framework. We will focus on offset-based and cursor-based pagination.

1. Offset-Based Pagination Example (Node.js & Express)

const express = require('express');
const app = express();

// Mock data (list of products)
const products = Array.from({ length: 100 }).map((_, i) => ({
  id: i + 1,
  name: `Product ${i + 1}`,
}));

app.get('/api/products', (req, res) => {
  const limit = parseInt(req.query.limit) || 10;
  const offset = parseInt(req.query.offset) || 0;

  const paginatedProducts = products.slice(offset, offset + limit);
  res.json({
    data: paginatedProducts,
    meta: {
      total: products.length,
      limit,
      offset,
    },
  });
});

app.listen(3000, () => {
  console.log('Server running on port 3000');
});

Explanation:

limit controls how many products to return per page.
offset determines where to start in the dataset.
The server responds with a subset of products based on the offset and limit parameters.

2. Cursor-Based Pagination Example (Node.js & Express)

const express = require('express');
const app = express();

// Mock data (list of products)
const products = Array.from({ length: 100 }).map((_, i) => ({
  id: i + 1,
  name: `Product ${i + 1}`,
}));

app.get('/api/products', (req, res) => {
  const limit = parseInt(req.query.limit) || 10;
  const cursor = parseInt(req.query.cursor) || 0;

  const startIndex = products.findIndex((product) => product.id === cursor);
  const paginatedProducts = products.slice(startIndex + 1, startIndex + 1 + limit);
  
  const nextCursor = paginatedProducts.length > 0 ? paginatedProducts[paginatedProducts.length - 1].id : null;

  res.json({
    data: paginatedProducts,
    meta: {
      next_cursor: nextCursor,
    },
  });
});

app.listen(3000, () => {
  console.log('Server running on port 3000');
});

Explanation:

cursor represents the ID of the last item in the previous page.
The server returns products starting from the product with the given cursor.
The next_cursor is returned, which is the ID of the last item in the current page. This cursor is used to request the next page.

Parthik Dahima

M.Sc. (Information Technology). I explain AI, AGI, Programming and future technologies in simple language. Founder of BoxOfLearn.com.