Optimizing File Delivery: The Technical Specs Voice Actors Must Meet
WAV vs. MP3, LUFS targets, sample rates, file naming: the exact technical specs voice actors need to nail every client delivery.

One Bad File Can Cost You the Gig
You nailed the audition. Your tone was perfect, the pacing was spot-on, and the client loved your read. Then you delivered the final files, and everything fell apart. The audio was the wrong format, the sample rate didn't match the specs, and the file naming looked like it came from a random number generator. The client moved on to someone else.
This scenario plays out more often than most voice actors want to admit. Technical file delivery separates working professionals from talented amateurs. Clients, especially those in broadcast, e-learning, and gaming, have rigid technical requirements, and meeting them without being asked is one of the fastest ways to build a reputation for reliability.
Audio Formats: Knowing What to Deliver and When
Not every project calls for the same file type, and sending the wrong format signals inexperience. Here are the formats you'll encounter most often:
WAV (Broadcast WAV / .wav): The industry standard for most professional deliveries. Uncompressed, lossless, and universally accepted by editors and post-production houses.
MP3 (.mp3): Used for web content, podcasts, and quick review files. Always deliver at 192 kbps or higher unless told otherwise.
When a client doesn't specify a format, deliver WAV files. It's the safest default, and most production pipelines are built around it. If file size is a concern (say, for a large e-learning project with hundreds of files), offer both a WAV master and an MP3 delivery version.
Bit Depth and Sample Rate: The Numbers That Matter
These two specifications define the resolution of your audio, and getting them wrong can create real problems downstream.
Sample rate refers to how many times per second the audio is captured. The most common specs you'll see are:
44.1 kHz: CD quality, common for music and some podcast workflows.
48 kHz: The broadcast and video standard. If you record for commercials, explainer videos, or anything synced to picture, this is your default.
96 kHz: Occasionally requested for high-end audiobook or cinematic projects.
Bit depth determines the dynamic range of your recording. Record at 24-bit unless explicitly told to deliver 16-bit. Recording at 24-bit gives you more headroom and a lower noise floor, which makes editing and processing far easier for the client's engineer.
A critical rule: never upsample. If you recorded at 44.1 kHz and the client needs 48 kHz, re-record. Upsampling doesn't add information. It inflates the file and can introduce artifacts.
Loudness Standards and Level Targets
Delivering audio at the wrong loudness level is one of the most common technical mistakes voice actors make. Different platforms and use cases have specific loudness targets, measured in LUFS (Loudness Units relative to Full Scale).
Broadcast (TV/radio): -24 LUFS (EBU R128) in Europe, -24 LKFS (ATSC A/85) in North America
Online video and e-learning: -16 to -20 LUFS, depending on the platform
Podcasts: -16 LUFS (mono) or -19 LUFS (stereo) per Apple's recommendations
Audiobooks (ACX/Findaway): -18 to -23 dB RMS, with peaks no higher than -3 dBFS
Invest time learning a loudness meter plugin. Tools like Youlean Loudness Meter (free) or iZotope Insight give you real-time LUFS readings so you can hit the target before export. Delivering audio that's too hot or too quiet creates extra work for the client, and extra work means fewer repeat bookings.
File Naming and Organization: The Silent Professional Move
Clean file naming seems trivial until you're a producer sorting through 200 files from eight different voice actors. The voice actor who labeled everything clearly and consistently gets hired again.
Follow these principles unless the client provides a naming convention:
Use underscores or hyphens instead of spaces. Spaces cause problems in many production systems.
Include the project name, script reference, and take number: BrandX_Radio30_Take2.wav
Never use generic names like final_audio.wav or voiceover.mp3.
If delivering multiple files, include a consistent numbering scheme: Module01_Lesson03_Page07.wav
When a client provides a naming convention in their brief, follow it exactly. Don't "improve" it. Automated ingest systems used by studios and agencies often rely on specific naming patterns to route files correctly.
Folder Structure for Large Projects
For projects with dozens or hundreds of files, organize deliveries into clearly labeled folders. A typical structure might look like this: a top-level folder named with the project and date, sub-folders for each chapter, module, or section, and a separate folder for any alternate takes or pickups. Compress the entire package into a single .zip file before uploading. It keeps everything together and reduces transfer errors.
Noise Floor, Processing, and Delivery-Ready Audio
Most clients expect delivery-ready audio unless they've specifically asked for raw or dry files. Delivery-ready typically means:
A noise floor at or below -60 dBFS
No mouth clicks, plosives, or room reflections audible in the recording
Breaths either removed or reduced to a natural level
Consistent levels throughout the file with no sudden volume jumps
Appropriate head and tail silence, usually 0.5 to 1 second of room tone at the start and end
Be cautious with noise reduction tools. Over-processing creates that hollow, underwater sound that immediately signals a subpar recording environment. If your raw recordings need heavy noise reduction to sound acceptable, address the room acoustics first. No plugin fully compensates for a bad recording space.
To Process or Not to Process
Some clients, particularly those with in-house audio engineers, want completely dry, unprocessed files. Others want fully mastered audio ready for broadcast. When in doubt, ask during the booking stage. Delivering heavily compressed and EQ'd audio to a team that planned to do their own processing is as problematic as sending raw files to a client who expected a polished product.
Transfer Methods and Delivery Speed
How you deliver files matters almost as much as the files themselves. Email attachments cap out around 25 MB, which rules them out for most professional deliveries. Instead, use reliable cloud-based transfer methods:
Google Drive or Dropbox: Simple, widely used, and most clients already have accounts.
WeTransfer: Good for one-off deliveries up to 2 GB on the free tier.
Client-provided portals: Many studios and agencies use platforms like Source-Connect, ISDN replacements, or proprietary upload systems.
Turnaround time expectations vary by project type. A 30-second commercial spot might need same-day delivery, while a full audiobook chapter could have a weekly cadence. Whatever the agreed timeline, deliver early when you can. Consistently beating deadlines builds trust and makes you the first call for rush jobs, which often come with premium rates.
Build Your Reputation One File at a Time
Technical excellence in file delivery will win you a second booking. Clients remember the voice actor who delivered clean, correctly formatted files with sensible naming. No follow-up emails needed, no re-exports requested. That reliability compounds over time into a career built on repeat business and referrals.
If you're a voice actor ready to connect with clients who value professionalism and real human talent, RealVOTalent.com is built for exactly that. Create your profile, showcase your skills, and start landing projects where your technical standards match your vocal talent.

Written by
Trevor O'Hare
Founder, RealVOTalent
Trevor is a professional voice actor who has worked in audio for over two decades and been in the voiceover industry since 2019, completing thousands of projects for Fortune 500 companies and small businesses alike. He also coaches voice talent at VOTrainer.com.
Ready to Find Your Perfect Voice?
Browse our roster of vetted, professional voice talent — 100% human, no AI.
Browse Voice Talent

