Streaming for Speed: Instant UX with Next.js App Router

Originally published on GeekyAnts by Ajinkya Vinayak Palaskar, Software Engineer III.

We're officially in the era of impatience. Users expect apps to feel fast, responsive, and alive — especially when they're interacting with dynamic dashboards, AI assistants, or anything that looks remotely "real-time." And here's the twist: it's not just about raw performance anymore. It's about perceived speed — how quickly the user sees something useful on screen.

This is where traditional rendering approaches like Client-Side Rendering (CSR) or even full-page Server-Side Rendering (SSR) start to feel sluggish. They either wait too long before showing anything, or dump too much JavaScript on the client.

That's where streaming comes in.

With the Next.js App Router and React Server Components, streaming isn't just possible — it's surprisingly easy. You can start sending HTML to the browser before your server is even done preparing the full UI. Combine that with React.Suspense, loading.tsx, and edge rendering, and you've got a recipe for snappy, interactive apps that feel instant, even when they're doing some heavy lifting under the hood.

If you've been struggling with slow APIs, spinner fatigue, or the complexity of building fast-feeling interfaces, this article is for you.

What is Streaming in Frontend?

Streaming in frontend is the idea of sending parts of your UI to the browser as soon as they're ready, instead of waiting for the whole page to be prepared on the server first. This means users can start seeing and interacting with content sooner — especially helpful when some parts of the page take longer to fetch or compute.

With the Next.js App Router and React Server Components, streaming becomes a lot easier to set up. The App Router is built with streaming in mind and supports layouts, loading states, and nested server components that stream in progressively.

A few key pieces make this work:

React Server Components run on the server, can fetch data directly (no useEffect), and don't add to the client-side bundle. They let you build fast, data-rich UI without sending a ton of JavaScript to the browser.

React.Suspense + loading.tsx — When a part of your page is still loading, you can wrap it in a Suspense boundary and show a fallback UI (loading.tsx) while the server fetches the actual content. As each boundary resolves, React streams it to the browser.

Streaming HTML over the network — Instead of waiting for all the data to come back, the server sends down chunks of HTML as each part of the UI finishes. The browser progressively renders those chunks, improving perceived performance.

Next.js handles most of the heavy lifting here. If you're using the App Router and the default rendering setup, you're already benefiting from streaming to some extent — especially if you're using loading.tsx in nested routes.

Why Streaming Matters for UX (and Business)

The main reason streaming is worth caring about is simple: users don't like waiting.

Even if your app is fast behind the scenes, if users are staring at a blank screen or a spinner while everything loads, it feels slow. That's the gap streaming helps close — by sending and rendering parts of the UI as soon as they're ready, you give users something to look at almost instantly. That small shift can make your app feel much snappier and more responsive.

Perceived Speed vs. Actual Speed

From a technical standpoint, your API might be responding in 200ms. But if the page only appears after everything is loaded, it might take a couple of seconds before anything shows up. With streaming, the browser gets the shell of the page immediately and starts filling it in as data becomes available — so it feels faster, even if the backend takes just as long.

Less JavaScript on the Client

Because React Server Components don't ship any client-side JS, you're also reducing the amount of code the browser has to download and parse. That translates to quicker page loads and less CPU usage on the client, which helps on slower devices or networks.

All of this benefits from getting something on screen quickly, even if the rest is still loading in the background.

Real-World Use Cases Where Streaming Shines

Streaming is especially useful in apps where parts of the UI rely on slower or conditional data. It's not about replacing SSR or CSR completely — it's about using the right tool for the right part of the page. Here are a few situations where streaming makes a noticeable difference:

1. Role-based Dashboards

In dashboards where different users see different data, fetching everything upfront can be wasteful and slow. With streaming, you can load the shared layout and sidebar immediately, then stream in role-specific content as it resolves. This avoids delaying the whole page just because one section takes longer.

2. AI-powered Interfaces (e.g. Chat, Recommendations)

AI responses often involve some latency. Instead of waiting for the full response, streaming lets you render tokens or chunks as they arrive. In a chat UI, this mimics real-time typing — improving engagement and helping users feel like something's happening even before the final result is ready.

3. Pages with Slow APIs

Not every API is fast or under your control. If one section of a page fetches from a third-party service (analytics, payments, personalization, etc.), you can stream that section in later without blocking the rest of the page from showing.

4. Geo-personalized Content

With Edge Middleware or location-aware logic, you might be personalizing content based on region, language, or device. Streaming allows the base layout and static content to load immediately, while the personalized block streams in as the logic runs.

5. Multi-section Landing Pages

Some marketing or product pages include testimonials, pricing tiers, feature lists, blog previews, and more. These don't all need to be shown at once. You can load the visible parts quickly and stream the rest in — helping with both performance and SEO.

The core idea is the same: when not everything needs to block the page, don't let it.

Hands-On: Streaming an AI Response with Next.js

To see streaming in action, let's build a minimal chat-like interface where we send a message to an AI backend (e.g., OpenAI or your own LLM) and stream the response to the UI in real-time.

We'll use:

Next.js App Router
Server Actions
Readable Streams
React Suspense + loading.tsx

No extra client libraries or state managers — just the streaming logic.

Step 1: Basic Route Setup

Create a page at app/chat/page.tsx. It holds a simple message input and the component that handles the streamed response:

// app/chat/page.tsx
import { Suspense } from 'react';
import ChatStream from './chat-stream';

export default function ChatPage() {
  return (
    <main className="max-w-2xl mx-auto p-6">
      <h1 className="text-2xl font-bold mb-4">AI Chat (Streaming)</h1>
      <Suspense fallback={<p className="text-gray-400">Loading chat...</p>}>
        <ChatStream />
      </Suspense>
    </main>
  );
}

Step 2: Building the Form with a Server Action

Next, create app/chat/chat-stream.tsx. This is a Client Component that calls a server action and reads the ReadableStream response back to the UI:

// app/chat/chat-stream.tsx
'use client';

import { useState, useRef } from 'react';
import { streamAIResponse } from './actions';

export default function ChatStream() {
  const [response, setResponse] = useState('');
  const [loading, setLoading]   = useState(false);
  const inputRef = useRef<HTMLInputElement>(null);

  async function handleSubmit(e: React.FormEvent) {
    e.preventDefault();
    const message = inputRef.current?.value?.trim();
    if (!message) return;

    setResponse('');
    setLoading(true);

    const stream = await streamAIResponse(message);
    const reader  = stream.getReader();
    const decoder = new TextDecoder();

    while (true) {
      const { done, value } = await reader.read();
      if (done) break;
      setResponse(prev => prev + decoder.decode(value));
    }

    setLoading(false);
  }

  return (
    <div className="space-y-4">
      <form onSubmit={handleSubmit} className="flex gap-2">
        <input
          ref={inputRef}
          type="text"
          placeholder="Ask something..."
          className="flex-1 border rounded px-3 py-2"
        />
        <button
          type="submit"
          disabled={loading}
          className="px-4 py-2 bg-blue-600 text-white rounded disabled:opacity-50"
        >
          {loading ? 'Streaming…' : 'Send'}
        </button>
      </form>

      {response && (
        <div className="p-4 bg-gray-50 rounded border text-gray-800 whitespace-pre-wrap">
          {response}
          {loading && <span className="animate-pulse">▌</span>}
        </div>
      )}
    </div>
  );
}

Step 3: The Server Action with a ReadableStream

Now define the server action at app/chat/actions.ts. This simulates a streamed AI response word-by-word. You can swap this out for a real OpenAI stream when ready:

// app/chat/actions.ts
'use server';

export async function streamAIResponse(message: string): Promise<ReadableStream> {
  // Simulated AI response — replace with OpenAI streaming in production
  const fakeResponse = `You asked: "${message}". Here's a streaming response that arrives word by word, just like a real AI would send it over the wire.`;
  const words = fakeResponse.split(' ');

  const stream = new ReadableStream({
    async start(controller) {
      for (const word of words) {
        controller.enqueue(new TextEncoder().encode(word + ' '));
        // Simulate network/model delay
        await new Promise(resolve => setTimeout(resolve, 80));
      }
      controller.close();
    },
  });

  return stream;
}

Swapping in a real OpenAI stream: Replace the simulated loop with openai.chat.completions.create({ stream: true, ... }) and pipe the delta chunks from each choices[0].delta.content into the controller using the same enqueue pattern.

Step 4: Adding a Loading State with loading.tsx

To show an instant fallback while the chat component hydrates, add a loading.tsx alongside the page:

// app/chat/loading.tsx
export default function Loading() {
  return (
    <div className="max-w-2xl mx-auto p-6">
      <div className="h-6 w-32 bg-gray-200 rounded animate-pulse mb-4" />
      <div className="h-10 w-full bg-gray-100 rounded animate-pulse" />
    </div>
  );
}

Test It Out

Run next dev, navigate to /chat, type a message, and submit. You'll see the AI response stream in real-time, word by word. No client-side polling, no spinner waiting for the full response — just streamed content from server to UI.

Tips, Gotchas & Performance Notes

Streaming in Next.js works really well, but like anything powerful, it comes with a few trade-offs. Here are things worth keeping in mind before going all-in.

Not Everything Needs Streaming

Just because you can stream something doesn't mean you should. For example:

Static content that loads instantly doesn't benefit much
Simple components without data fetching don't really need Suspense boundaries
Streaming every little component separately can overcomplicate your layout loading logic

Use it where latency is noticeable, or where parts of the page depend on slow or dynamic data. Otherwise, you might be adding complexity without any real UX gain.

Watch Out for Mismatched Client/Server Boundaries

With Server Components, streaming usually involves wrapping parts of the UI in Suspense and using loading.tsx. But once you mix in use client components (like forms, buttons, charts, etc.), make sure your component boundaries are well-defined.

One common mistake is importing a use client component too deep in your tree, which forces everything above it to become a client component — killing the streaming benefit.

Debugging Streaming in Dev Mode

Streaming can feel a bit magical, which makes debugging tricky. Some tips:

In development, streaming can be slower or behave slightly differently than in production. Don't judge performance too early.
In Chrome DevTools → Network tab, look at the HTML response — you'll see chunks arriving progressively.
Use console.log() carefully in server components; you won't see the same logs as on the client.

Consider Edge Rendering (with Caution)

Streaming also works with Edge Functions in Next.js, which can reduce latency by serving users from closer regions. But Edge runtimes have stricter limits (e.g., no large Node APIs, different streaming behaviour in some cases). Test thoroughly before deploying to production.

Caching and Revalidation Still Matter

Streaming doesn't replace caching. If your streamed components fetch expensive or slow data, you should still use caching strategies like:

const data = await fetch('/api/data', { next: { revalidate: 60 } });

Streaming helps with perceived speed, but good caching helps with actual speed.

Final Thoughts: What's Next for Streaming in React

Streaming isn't just a neat trick — it's quickly becoming a core part of how modern React apps are built. Between React Server Components, the App Router, and server-first patterns like Server Actions, streaming makes it possible to build apps that feel fast without relying heavily on the client.

We're seeing a clear shift toward progressive rendering, where parts of the UI load as they're ready, instead of waiting for everything upfront. That's better for user experience, better for performance, and in many cases, better for business too.

If you're working on apps that deal with AI responses, dashboards, personalization, or slow APIs, this is worth exploring. You don't have to fully rearchitect your app either. Even small changes — like adding a loading.tsx to a layout or streaming a slow component — can have a big impact.

Streaming isn't a silver bullet, but when used right, it's a strong tool to make your UI feel faster, lighter, and more responsive.

Originally written by Ajinkya Vinayak Palaskar at GeekyAnts.

Streaming for Speed: Unlocking Instant UX with Next.js App Router and Server Components

What is Streaming in Frontend?

Why Streaming Matters for UX (and Business)

Perceived Speed vs. Actual Speed

Less JavaScript on the Client

Real-World Use Cases Where Streaming Shines

Hands-On: Streaming an AI Response with Next.js

Step 1: Basic Route Setup

Step 2: Building the Form with a Server Action

Step 3: The Server Action with a ReadableStream

Step 4: Adding a Loading State with loading.tsx

Test It Out

Tips, Gotchas & Performance Notes

Not Everything Needs Streaming

Watch Out for Mismatched Client/Server Boundaries

Debugging Streaming in Dev Mode

Consider Edge Rendering (with Caution)

Caching and Revalidation Still Matter

Final Thoughts: What's Next for Streaming in React

Comments

More from this blog

Beyond the Screen: Journey Creating an AI-Powered App That Listens, Learns, and Visualizes – Talk to Your Ideas!

Optimizing Image Performance in Next.js: Best Practices for Fast, Visual Web Apps

Beyond Throttling: Rate Limiting as a Strategic Layer in Modern API Systems

MCP in Action: A Developer's Take on Smarter Service Coordination

Command Palette

What is Streaming in Frontend?

Why Streaming Matters for UX (and Business)

Perceived Speed vs. Actual Speed

Less JavaScript on the Client

Real-World Use Cases Where Streaming Shines

Hands-On: Streaming an AI Response with Next.js

Step 1: Basic Route Setup

Step 2: Building the Form with a Server Action

Step 3: The Server Action with a ReadableStream

Step 4: Adding a Loading State with loading.tsx

Test It Out

Tips, Gotchas & Performance Notes

Not Everything Needs Streaming

Watch Out for Mismatched Client/Server Boundaries

Debugging Streaming in Dev Mode

Consider Edge Rendering (with Caution)

Caching and Revalidation Still Matter

Final Thoughts: What's Next for Streaming in React

Comments

More from this blog