Poorly prepared source files add 20-40% to localization project timelines and cost. This checklist walks through the 12 essential file preparation steps before sending content to a translation or DTP vendor: from font embedding and link management through text extraction and CAT tool optimization. Based on our analysis of 10,000+ client projects, with specific guidance for InDesign, FrameMaker, Word, PowerPoint, and PDF workflows. Save hours of rework by preparing files correctly the first time.
TL;DR
Source file preparation is the most commonly under-invested step in localization workflows, yet it has an outsized impact on project outcomes. Poorly prepared source files create downstream problems that compound through translation, DTP, and QA: text trapped in non-editable image layers cannot be translated without reconstruction, inconsistent styling forces DTP operators to manually re-style every paragraph per language, broken links cause missing images in translated output, missing fonts produce typographic substitutions that fail brand standards, and hidden or deleted content that re-appears during format conversions creates confusing artifacts in final output.
Quantifying the impact, our analysis of 10,000+ client projects shows that well-prepared source files localize 20-40% faster and cost 15-30% less than poorly prepared files, with additional benefits in final quality and lower revision cycles. For a typical US$50,000 multilingual DTP project, this represents US$7,500-15,000 in direct cost savings plus several days of timeline reduction. At enterprise scale with multi-million-dollar annual localization spend, disciplined file preparation processes produce cumulative savings well worth the upfront investment in preparation standards, training, and quality gates before files leave the source creation team.
Following are the 12 essential steps that should be completed before any source file is handed to a translation or DTP vendor. Each step takes minutes but prevents hours of downstream rework.
Adobe InDesign files benefit most from consistent paragraph and character style application. Avoid manual overrides — styles should define all typography decisions, not direct formatting on individual text runs. Use the Preflight panel to identify missing fonts, broken links, and overset text before handoff. Package files using File > Package to gather all fonts, links, and metadata into a single folder for vendor delivery. Verify that no text is accidentally placed on hidden layers or outside page boundaries, since hidden content can surface during format conversions.
Adobe FrameMaker documents, particularly structured FrameMaker using DITA or custom DTDs, benefit from validation before handoff. Run the Structure View to confirm all elements are valid per the applicable schema. Verify cross-references resolve correctly. Check conditional text settings and ensure the correct conditions are active for your translation scope. For books and large documents, regenerate all tables of contents, lists of figures, and indexes to verify they're current. Share the book file plus all referenced chapter files, graphic files, and the DTD or EDD.
Microsoft Word documents benefit from disciplined style use rather than direct formatting. Avoid tables used purely for layout — they often break badly when text expands. Use Word's Heading 1-6 styles for headings rather than manual bold and sizing. Remove track changes (accept or reject all) before handoff. Clean hidden text and comments that aren't intended for final output. For complex Word documents with lots of tracked revisions, consider producing a clean version as the translation source while keeping the tracked version for reference.
PDF files present special challenges. Always provide editable source files (Word, InDesign, FrameMaker) rather than PDFs when possible — PDFs require reconstruction into editable format before translation can begin, adding cost and time. If only PDF is available, clearly indicate whether it's editable PDF (generated from source) or scanned PDF (requires OCR). For scanned PDFs, identify the original source software if known to inform reconstruction approach. Flag any PDFs with security restrictions, since these complicate OCR and text extraction.
Step-by-Step Checklist
Create a clean copy of source files separate from your working files, labeled clearly with version and date
Embed all fonts into document files and package linked images/graphics using native tools (InDesign Package, FrameMaker Save As Archive)
Remove hidden text, tracked changes, comments, and deleted content that won't appear in final output
Use paragraph and character styles consistently — avoid manual formatting overrides
Remove line breaks within paragraphs; use only paragraph returns at end of paragraphs
Clearly identify code, file paths, product names, and other content that should not be translated
Where possible, convert text-in-images to live editable text for translation
Ensure files use UTF-8 encoding; test by opening in a text editor to verify character support
Verify footnotes, endnotes, cross-references, and index entries are properly tagged and resolve correctly
Write a brief note documenting any intentional layout constraints (fixed widths, pagination requirements, etc.)
Bundle source files, reference materials, style guides, glossaries, and any special instructions for vendor
Provide clear scope statement, target languages, specific deliverables, and realistic deadline expectations
Related Services
Accurate, culturally relevant, and scalable multilingual content — human translation and MTPE across 100+ languages.
Pixel-perfect layouts across every language — from Latin scripts to RTL, CJK, and Indic.
Prepare your content for efficient, accurate, and scalable localization workflows — zero data loss.
Related Definitions
Multilingual desktop publishing (multilingual DTP) is the process of formatting and laying out translated content across...
Machine Translation Post-Editing (MTPE), also called PEMT, is a hybrid translation workflow where raw machine translatio...
FAQs
Send us one file. We'll DTP it for free — so you can judge our quality before committing. No strings attached.
99.5% on-time delivery · 125,000+ projects · Avg. 2hr response