4.4 million podcast episodes are published every month on Apple Podcasts alone, yet listener surveys consistently show that audio quality is the number-one reason people abandon a new podcast within the first 90 seconds — ahead of content, hosting style, and production value. The most common audio quality complaint is "sounds echoey" or "sounds like a bathroom." This is not a microphone problem. It is a room problem. The typical untreated spare bedroom used for podcasting has an RT60 of 0.6 to 1.0 seconds. Professional podcast studios target 0.2 to 0.35 seconds. Your listeners hear that difference as the gap between "amateur" and "professional," and they make the judgement within seconds.
The fix is not expensive. For a typical 3m × 2.5m spare bedroom, £250 in materials and one afternoon of work transforms the acoustic environment from bathroom-reverberant to broadcast-ready. This article provides the exact treatment plan: the physics of why podcast rooms need specific RT60 targets, a worked calculation with real absorption coefficients, the priority order for panel placement, and a complete shopping list.
Why Podcast Audio Demands Lower RT60 Than Other Rooms
Speech intelligibility in a meeting room or classroom requires RT60 ≤ 0.5–0.6 seconds. Podcast recording requires RT60 ≤ 0.35 seconds — roughly half. The reason is how the audio is consumed.
In a meeting room, the listener and speaker share the same acoustic space. The listener's auditory system uses spatial cues, lip reading, and the precedence effect to suppress room reflections and focus on the direct speech. The room's reverberation is processed as spatial context, not as degradation.
A podcast listener wears headphones. The room's reverberation is reproduced directly in the listener's ear canal, without spatial context, without visual cues, and at a listening level the listener has chosen for maximum speech detail. In this intimate listening condition, room reflections that are imperceptible in person become obvious artifacts. A 50ms reflection from a rear wall that would go unnoticed in face-to-face conversation sounds like a distinct, distracting echo through headphones.
Furthermore, podcast audio undergoes compression, both dynamic (podcast loudness normalisation to -16 LUFS per AES TD1004.1.15-10) and data (MP3/AAC encoding at 128–192 kbps). Compression algorithms preserve the direct speech signal efficiently but handle reverberant tails unpredictably — the tail can "pump" during quiet syllable gaps, creating audible modulation artifacts. Starting with a clean, dry signal gives the compression chain the best possible input.
The targets, referenced to professional broadcast standards:
| Standard | RT60 Target (500 Hz) | Application |
|---|---|---|
| EBU Tech 3276 | 0.2–0.4 s | Broadcast studios |
| ITU-R BS.1116-3 | 0.2–0.3 s | Critical listening rooms |
| AES Technical Document | < 0.4 s | Recording/mixing rooms |
| Podcast industry practice | 0.2–0.35 s | Voice-focused recording |
| Home office (comparison) | 0.3–0.5 s | Video calls |
Worked Example: A 3m × 2.5m Spare Bedroom
Room Specification
- Length: 3.0 m
- Width: 2.5 m
- Height: 2.5 m
- Volume: 3.0 × 2.5 × 2.5 = 18.75 m³
- Ceiling: plasterboard (α = 0.05 at 500 Hz)
- Floor: carpet (medium pile, on underlay) (α = 0.25 at 500 Hz)
- Walls: painted plasterboard on timber frame (α = 0.05 at 500 Hz)
- Window: 1 double-glazed unit, 1.0m × 0.8m (α = 0.04 at 500 Hz)
- Door: hollow-core timber, 0.8m × 2.0m (α = 0.10 at 500 Hz)
- Furniture: 1 desk (1.2m × 0.6m), 1 office chair, 1 small bookshelf (0.8m × 1.0m)
Surface Areas and Existing Absorption at 500 Hz
| Surface | Area (m²) | α at 500 Hz | Absorption (sabins) |
|---|---|---|---|
| Plasterboard ceiling | 7.50 | 0.05 | 0.38 |
| Carpet floor | 7.50 | 0.25 | 1.88 |
| Plasterboard walls (minus window and door) | 24.40 | 0.05 | 1.22 |
| Window (double glazed) | 0.80 | 0.04 | 0.03 |
| Door (hollow-core) | 1.60 | 0.10 | 0.16 |
| Desk (laminate) | 0.72 | 0.05 | 0.04 |
| Office chair | — | — | 0.15 |
| Bookshelf (partially filled) | 0.80 | 0.25 | 0.20 |
| Total | — | — | 4.06 |
Current RT60
Per the Sabine equation, ISO 3382-2:2008 §A.1:
RT60 = 0.161 × 18.75 / 4.06 = 0.74 seconds
With the podcaster present (adding approximately 0.5 sabins), closed heavy curtains (1.0 × 1.5m draped, adding approximately 0.67 sabins), the RT60 drops to:
RT60 = 0.161 × 18.75 / 5.23 = 0.58 seconds
Still nearly twice the 0.30-second midpoint of the target range. On a condenser microphone at 20cm from the speaker's mouth, this room produces audible reflections that competent listeners and podcast critics will immediately identify as "room sound." It is the acoustic signature of a bedroom recording rather than a studio recording.
Target
RT60 = 0.25 seconds (the midpoint of the 0.2–0.35 second target range)
A_required = 0.161 × 18.75 / 0.25 = 12.08 sabins
Current absorption (with person + curtains): 5.23 sabins. Absorption deficit: 6.85 sabins.
Note: because the target absorption approaches 30% of the total surface area (12.08 / 41.3 = 29.3%), the Sabine equation begins to overestimate and the Eyring equation per ISO 3382-2:2008 §A.2 gives a more accurate prediction. We will calculate both and use the Eyring result as the true target.
Eyring equation: RT60 = 0.161 × V / (−S × ln(1 − ᾱ))
For the target of 0.25s: −S × ln(1 − ᾱ) = 0.161 × 18.75 / 0.25 = 12.08 So: −41.3 × ln(1 − ᾱ) = 12.08 → ln(1 − ᾱ) = −0.2925 → 1 − ᾱ = 0.746 → ᾱ = 0.254
Required average absorption coefficient: ᾱ = 0.254, or total equivalent absorption (Eyring) of approximately 10.5 sabins.
This is lower than the Sabine estimate of 12.08, confirming that Sabine overestimates in well-treated rooms. The actual treatment area needed is smaller than Sabine suggests — a cost saving of approximately 15%.
Using the Eyring target of 10.5 sabins: Absorption deficit (Eyring): 10.5 − 5.23 = 5.27 sabins.
Treatment Plan: Priority Order
The priority order for podcast room treatment differs from meeting rooms and offices because the microphone position is fixed and close to the speaker. Treatment should focus on reflections that arrive at the microphone with the highest energy and shortest delay.
Priority 1: Ceiling Cloud (2 panels — 1.44 m²)
The ceiling above the recording position is the strongest first reflection point. Sound from the speaker's mouth radiates upward, reflects off the plasterboard ceiling, and arrives at the microphone from above. In a 2.5m room with the speaker seated, the ceiling reflection path is approximately 3.0m total (1.3m up + 1.7m down to mic), arriving approximately 8.7ms after the direct signal. This is a strong, early reflection that colours the recorded voice with a "roomy" character.
Specification: 2 × mineral wool panels (600 × 1200 × 50mm, 40 kg/m³ density), fabric-wrapped, mounted flush to ceiling above recording position.
Absorption added: 1.44 × 0.85 = 1.22 sabins
Why ceiling first: Professional studios always treat the ceiling first. The ceiling reflection is the most energetic single reflection in a small room because (a) the ceiling-to-seated-head distance is short (1.2–1.3m), creating a high-energy reflection, and (b) the reflection angle is near-normal (perpendicular), which maximises the reflected energy coefficient for untreated plasterboard.
Priority 2: Corner Bass Traps (4 panels — 2.88 m²)
Room corners (where two walls meet, or where walls meet the ceiling) are where sound pressure is highest — up to 9 dB above the room average at low frequencies. Panels mounted across corners span the high-pressure zone and absorb low-frequency energy more efficiently than the same panel mounted flat on a wall.
Specification: 4 × mineral wool panels (600 × 1200 × 50mm), mounted across the upper corners of the room (where walls meet the ceiling, spanning both walls at 45 degrees). Each panel bridges the corner, creating a triangular air space behind it that dramatically improves low-frequency absorption.
The air space behind a corner-mounted panel acts as an extended air gap. For a 600mm panel mounted diagonally across a 90-degree corner, the air gap ranges from 0 at the panel edges (where it touches the walls) to approximately 210mm at the centre. This effective air gap provides significant absorption improvement at 125 and 250 Hz:
| Frequency | α (flat mount) | α (corner mount) | Improvement |
|---|---|---|---|
| 125 Hz | 0.15 | 0.55 | +267% |
| 250 Hz | 0.55 | 0.85 | +55% |
| 500 Hz | 0.85 | 0.90 | +6% |
| 1000 Hz | 0.95 | 0.95 | 0% |
Absorption added: Using the corner-mounted α values at 500 Hz: 2.88 × 0.90 = 2.59 sabins (at 500 Hz). At 125 Hz, the absorption is 2.88 × 0.55 = 1.58 sabins — critical for controlling the low-frequency "boominess" that plagues small room recordings.
Why corner traps second: After the ceiling cloud, corner traps provide the next-highest acoustic leverage because they simultaneously control low-frequency resonances (room modes) and mid/high-frequency reflections. Small rooms have prominent room modes below 200 Hz that create audible booming and colouration on voice recordings. Corner traps are the only practical treatment for these modes in a room this small.
Priority 3: Side Wall Panels at First Reflection Points (2 panels — 1.44 m²)
The side walls create lateral reflections that arrive at the microphone from the sides. These reflections are particularly problematic for podcast recording because they create a "wide" or "diffuse" quality that conflicts with the close, intimate sound listeners expect.
Specification: 2 × mineral wool panels (600 × 1200 × 50mm), mounted at seated head height on the two side walls, at the first reflection point relative to the recording position.
Finding the first reflection point: sit in your recording position. The first reflection point on each side wall is at the midpoint between your mouth and the microphone — typically directly beside you, or slightly in front of your seated position.
Absorption added: 1.44 × 0.85 = 1.22 sabins
Priority 4: Rear Wall (if budget allows — 1–2 panels)
The wall behind the recording position creates a rear reflection that arrives at the microphone from behind the speaker. This reflection is less energetic than ceiling and side wall reflections (because the speaker's head partially blocks it), but it contributes to the overall reverberant field.
If the bookshelf is already on this wall, it provides moderate treatment (α ≈ 0.25–0.35). A full bookshelf on the rear wall may be sufficient to omit dedicated panel treatment here.
Treatment Summary
| Priority | Treatment | Panels | Area (m²) | Sabins Added |
|---|---|---|---|---|
| 1 | Ceiling cloud | 2 | 1.44 | 1.22 |
| 2 | Corner bass traps | 4 | 2.88 | 2.59 |
| 3 | Side wall panels | 2 | 1.44 | 1.22 |
| — | Total | 8 | 5.76 | 5.03 |
After Treatment: Verification
Total absorption (with person + curtains + treatment): 5.23 + 5.03 = 10.26 sabins
Sabine RT60 = 0.161 × 18.75 / 10.26 = 0.29 seconds
Average absorption coefficient: ᾱ = 10.26 / 41.3 = 0.248
Eyring RT60 = 0.161 × 18.75 / (−41.3 × ln(1 − 0.248)) = 0.161 × 18.75 / 11.76 = 0.26 seconds
Both estimates fall within the 0.2–0.35 second target range. The Eyring prediction of 0.26 seconds is more accurate for this level of absorption and indicates a room that will sound professionally dry without being oppressively dead.
Shopping List and Cost
| Item | Specification | Quantity | Unit Cost | Total |
|---|---|---|---|---|
| Mineral wool boards | 50mm, 40 kg/m³, 600×1200mm | 8 boards (5.76 m²) | £8 each | £64 |
| Mineral wool boards (corner traps) | 50mm, 40 kg/m³, 600×1200mm | (included above) | — | — |
| Breathable acoustic fabric | 1.5m wide, polyester or cotton blend | 7 m | £7/m | £49 |
| Timber battens | 25×25mm PAR softwood | 20 m | £1.50/m | £30 |
| Spray adhesive | High-tack | 2 cans | £8 each | £16 |
| Staple gun + staples | Arrow T50 or equivalent, 10mm staples | 1 | £15 | £15 |
| Corner mounting brackets | L-brackets, stainless steel, 50mm | 16 (4 per corner trap) | £1.50 each | £24 |
| Picture hooks (heavy duty) | 10 kg rated | 8 | £1.50 each | £12 |
| Command strips (large) | 3M, ceiling-rated, 7.2 kg | 8 pairs | £3 per pair | £24 |
| Miscellaneous (screws, rawl plugs, wire) | — | — | — | £15 |
| Total | — | — | — | £249 |
Microphone Technique in a Treated Room
The room treatment maximises its benefit when combined with proper microphone technique. In a well-treated room (RT60 ≤ 0.35s), the microphone technique determines the final direct-to-reverberant ratio.
Distance Matters
The critical distance — the distance from a sound source at which the direct and reverberant sound levels are equal — is determined by room absorption. In the untreated room (A = 5.23 sabins), the critical distance is:
r_c = 0.057 × √(A) = 0.057 × √5.23 = 0.13 m (13 cm)
In the treated room (A = 10.26 sabins):
r_c = 0.057 × √(10.26) = 0.18 m (18 cm)
At a microphone distance of 15cm (standard podcast technique), the direct-to-reverberant ratio in the treated room is approximately +6 dB — the direct speech is four times louder than the room reflections at the microphone position. In the untreated room at the same distance, the D/R ratio is approximately +1 dB — the room reflections are nearly as loud as the direct signal.
Recommended Setup
- Microphone type: Dynamic cardioid (e.g., Shure SM7B, ElectroVoice RE20, Rode PodMic) or large-diaphragm condenser in cardioid pattern (e.g., Audio-Technica AT2020, Rode NT1)
- Distance: 10–20 cm from mouth (closer for dynamic, further for condenser)
- Orientation: Microphone aimed at mouth, with the null point of the cardioid pattern aimed at the most reflective remaining surface (typically the untreated wall or window)
- Pop filter: Essential at 10–20cm distance to prevent plosive distortion
- Shock mount: Prevents desk vibrations from reaching the microphone diaphragm
What Professional Studios Do Differently
Professional podcast studios achieve RT60 of 0.15–0.25 seconds through:
- Thicker absorption: 100–150mm mineral wool or fibreglass panels instead of 50mm, providing effective absorption down to 80 Hz
- Floor-to-ceiling corner traps: Triangular columns of mineral wool spanning the full room height in two or four corners, providing massive low-frequency absorption
- Floating floor: Carpet over an acoustic underlay over a resilient layer, decoupling the floor from the building structure to eliminate structure-borne noise
- Isolated construction: Room-within-a-room construction with air gaps between inner and outer walls, preventing external noise from entering the recording space
- HVAC silencing: Ductwork lined with acoustic insulation and fitted with silencers to reduce ventilation noise below 25 dBA (NR 20)
- Diffusion: Rear wall treated with quadratic residue diffusers (QRDs) instead of absorption, preserving a sense of space without adding reflections at the microphone position
Frequency-by-Frequency Analysis
Speech has energy across a wide frequency range, but different frequencies affect the recording in different ways:
| Frequency Range | Effect on Recording | Treatment Priority |
|---|---|---|
| 80–200 Hz (bass) | "Boominess," proximity effect exaggeration, room modes | Corner bass traps (Priority 2) |
| 200–500 Hz (low-mid) | "Muddiness," nasal quality, chest resonance reflections | Thick wall panels + corner traps |
| 500–2000 Hz (mid) | "Hollowness," the primary "room sound" listeners identify | Ceiling cloud + side walls (Priority 1, 3) |
| 2000–8000 Hz (high) | "Sibilance splash," metallic quality from hard surfaces | Any porous absorber controls this range |
The treatment plan addresses all ranges: corner bass traps handle 80–250 Hz, the ceiling cloud and side wall panels handle 250–8000 Hz. The combination provides broadband control that produces the clean, neutral recording environment a podcast microphone needs.
Common Podcast Room Mistakes
Mistake 1: Foam on All Walls
Covering every wall with 25mm acoustic foam creates a room that absorbs high frequencies aggressively (α ≈ 0.90 above 2000 Hz) but barely touches low frequencies (α ≈ 0.10 at 125 Hz). The result is a recording that sounds "dead" on sibilants but "boomy" on vowels — an unnatural, unpleasant quality that no amount of EQ can fix in post-production. The correct approach is balanced, broadband absorption using 50mm+ mineral wool, prioritising the reflection points and corners described above.
Mistake 2: Egg Crate Mattress Toppers
These have no meaningful acoustic absorption. They are thin, non-porous, and provide negligible friction to air molecule oscillation. Their textured surface creates visual complexity that some people associate with "studio" aesthetics, but the acoustic effect is essentially zero — α ≤ 0.05 across all frequencies.
Mistake 3: Recording in a Closet
A walk-in wardrobe full of clothes does provide significant absorption (thick clothing has α ≈ 0.50–0.70) and can achieve very low RT60. However, closets are typically very small (under 3 m³), which creates severe acoustic problems: pronounced room modes below 300 Hz, an oppressive "closet sound" from the extreme absorption ratio, and poor ventilation that causes discomfort during long recording sessions. A treated bedroom is acoustically superior to an untreated closet.
Mistake 4: Ignoring the Floor
In a room with carpet, the floor is already the second-best absorber after the treated ceiling. If the podcast room has a hard floor (laminate, timber, vinyl), adding a thick rug (minimum 10mm pile on an underlay) beneath and around the recording position adds 0.6–1.5 sabins — a significant and cost-effective contribution.
Related Reading
- Home Office Echo on Zoom Calls — The £180 DIY Fix — similar treatment approach for video call rooms with a lower RT60 target
- Why Does My Room Echo? The Physics, the Diagnosis, and the Fix — the fundamental physics of room echo
- How Do Acoustic Panels Work? The Physics of Sound Absorption — deep dive into porous absorbers, membrane absorbers, and Helmholtz resonators