Record Your Screen With System and Mic Audio

You hit record, walk through the bug, narrate the fix — and then you watch it back and the video is silent where the app should be talking. The button click made a sound. The video you were reacting to had music. None of it is in the file. Just your voice, floating over a muted screen.

This is the single most common screen-recording problem, and it has a specific cause: most free recorders only open your microphone. They never capture the sound your computer is actually playing. This guide explains the difference between system audio and microphone audio, why capturing system sound is genuinely harder than it should be on Windows, and how to record both at once — cleanly, in one file, without a single extra driver.

System audio vs. microphone: two completely different sources

When people say "record screen with audio," they usually mean two different things at once, and a recorder has to handle each separately:

Microphone (input): your voice. This is the easy one. Every recorder can open a mic input. It's the same stream a voice call uses.
System / desktop audio (output): everything the computer plays out — the app you're demoing, the YouTube video you're reacting to, a Zoom call, a game, a notification chime. This is the stream going to your speakers or headphones.

A microphone is an input device, and recorders are built to read inputs. System audio is an output — sound leaving the computer — and reading an output stream is a fundamentally different operation. That asymmetry is why "record computer sound and screen" trips up so many tools: they were designed around the input and treat the output as an afterthought, or skip it entirely.

Why capturing internal audio is hard on Windows

To record what your computer is playing, software has to do something called loopback: instead of listening to a microphone, it taps the audio device's render stream — the mixed-down signal on its way to the speakers — and copies it. Historically, Windows made this awkward.

For years the only built-in option was Stereo Mix, a virtual recording device some sound cards exposed. It worked, sometimes, but plenty of modern laptops and USB headsets simply don't offer it. When Stereo Mix isn't there, people reach for a virtual audio cable — software like VB-Cable that creates a fake output device, so you route your system sound into it and then record that. It works, but it's an extra install, it adds routing you have to set up correctly, and it's easy to break: change your output device and the recording goes silent again.

The modern, correct answer is WASAPI loopback. WASAPI (Windows Audio Session API) lets an application open the existing output device in loopback mode and read the exact audio already being played — no virtual device, no rerouting, nothing for the user to configure. The plumbing has been in Windows for a long time; the catch is that the recorder has to actually use it. Many don't.

How Maxisnap records both, driver-free

Maxisnap captures system audio through WASAPI loopback and your microphone at the same time, and by default mixes them into a single track so you hear everything — the app, the video, and your narration — together, the way a viewer expects. There's no Stereo Mix to enable, no virtual cable to install, no driver to chase down. You start recording and the sound is just there.

The recording itself is straightforward. Two hotkeys, both remappable:

Ctrl+Alt+4 — record the full screen (the monitor under your cursor).
Ctrl+Alt+3 — record a selected region when you only want part of the screen.

A small recording badge appears so you always know it's rolling — and it's deliberately excluded from its own capture, so it never shows up in the video. To stop, press the hotkey again, click Stop on the badge, or use the system tray.

When you stop, a result window opens with an inline player, plus Save, Copy, Upload, and Drag-out buttons. You can trim the start and end, and export to MP4, AI-optimized MP4, GIF, or WebM with quality presets. Encoding can use hardware acceleration (NVENC, Intel QuickSync, or AMD AMF) when your machine supports it, with a software fallback otherwise — and ffmpeg ships inside the installer, so there's nothing else to download.

If you'd rather edit the voice and the system sound apart, you can keep them on separate tracks instead of mixing. That's the cleanest fix for echo and for cases where you want to duck the app audio under your narration later.

One track or two: how to choose

The default — system audio and microphone mixed into one track — is right for most people. It plays correctly everywhere with zero editing, and it's what you want for a quick demo, a tutorial, or a bug report.

Choose separate tracks when you plan to take the recording into an editor and want independent control: lowering the app's volume while you talk, removing a notification sound, or fixing a sync issue. Separate tracks cost you one extra step (you mix them down later) in exchange for full control.

Common failure modes — and how to fix them

Symptom	Likely cause	Fix
Only my voice was recorded	Recorder captured the mic but not the system loopback	Use a recorder with WASAPI loopback (Maxisnap captures it by default — no setup)
No sound at all	Mic muted/wrong device, or output device changed mid-recording	Confirm the correct mic, and record the live output device directly via loopback rather than a virtual cable
System sound but no voice	Wrong microphone selected, or mic permission denied	Pick the right input device and grant microphone access in Windows privacy settings
Echo / doubled audio	Speakers play sound and the mic re-records it	Wear headphones, or record system audio and mic on separate tracks
Audio drifts out of sync	Frame rate or A/V timing mismatch during capture	Capture at a steady frame rate; trim and re-export from the result window if needed
Robotic / crackling sound	Sample-rate mismatch or an overloaded virtual-cable chain	Drop the virtual cable; loopback capture reads the device's native stream cleanly

The thread running through almost all of these: virtual cables and Stereo Mix add fragile moving parts. Reading the output device directly through loopback removes most of the ways "record screen with internal audio and mic" can go wrong.

A quick note on Mac

macOS is the opposite story for system sound. The built-in tools — Cmd+Shift+5 and QuickTime Player — record the screen and your microphone fine, but they cannot capture internal audio on their own. To record what the Mac is playing, you still need a driver such as BlackHole or Soundflower routed through an aggregate or multi-output device. If you're on a Mac, the free Mac screen recording guide walks through exactly how to set that up. Maxisnap's recorder is available on Windows today; a native macOS build is in active development, and you can ask to be notified on the contact page.

The fastest path to a recording that actually has sound

If you just want a clip with both the app audio and your voice in it, here's the whole flow on Windows:

Press Ctrl+Alt+4 (full screen) or Ctrl+Alt+3 (region).
Talk over what you're demoing — system sound and your mic are both captured, mixed by default.
Press the hotkey again to stop. Trim the dead air off the start and end.
Export to MP4 — or pick AI-optimized MP4 (~1280px, 24fps, mono) to get a tiny file you can drop straight into ChatGPT, Claude, or Gemini.

No watermark, no time limit, and the recording is free. That's the difference between a silent video you have to re-shoot and a clip that says everything the first time.

Frequently asked questions

How do I record my screen with system audio and microphone at the same time?
Use a recorder that captures the system loopback as well as the mic. In Maxisnap, press Ctrl+Alt+4 (full screen) or Ctrl+Alt+3 (region); desktop audio via WASAPI loopback and your microphone are both captured and mixed into one track by default — no virtual cable or extra driver needed.

Why does my recording only capture the microphone and not the computer sound?
Because the recorder only opened the mic input. Capturing what the computer plays requires loopback recording of the audio device's output stream. Maxisnap does this through WASAPI, so desktop sound is captured driver-free alongside the mic.

Do I need Stereo Mix or a virtual audio cable on Windows?
No. Stereo Mix is inconsistent and missing on many modern devices, and virtual cables are an extra install with routing to maintain. Maxisnap reads the desktop loopback through WASAPI natively, so neither is required.

Why is there an echo in my recording?
The same sound is being captured twice — your speakers play it and your mic picks it up again. Wear headphones, or record system audio and the mic on separate tracks so you can mute or align one in editing.

Is recording with audio free, and is there a watermark or time limit?
Yes — recording with system audio and mic is free on Windows, with no watermark and no time limit. Paid tiers only come into play if you share recordings through Maxisnap Cloud beyond the free allowance; you can always save locally or self-host.

Want the full walkthrough of recording without audio nuance? See how to record your screen on Windows for free, or learn how the combined screen recorder and screenshot tool fits into one lightweight app.