The Truth / scorecard

Word Copilot: a strong writer with a short attention span

"Excellent at first drafts and rewrites, mediocre at the long, formatted, template-bound documents Word actually exists for."

7/10

What's genuinely good

  • Drafting from referenced files is the headline act: point it at up to three source documents with / and it produces a genuinely usable first draft
  • Rewriting with constraints — 'make this half the length, keep the legal caveats, lose the passive voice' — is reliable and fast
  • Summarizing long documents you've been handed (the 80-page policy, the contract redline) is consistently strong
  • Tone transformation actually works: formal to plain-language, technical to executive, without mangling the meaning
  • Chat-with-your-document for Q&A ('does this contract mention termination fees?') beats Ctrl+F by a mile

What sucks

  • Coherence drops with length: past a dozen pages of generated text, sections repeat, terminology drifts, and the argument loses its thread
  • Formatting and styles handling is clumsy — it writes text, but ask it to respect your heading hierarchy or numbered styles and it shrugs
  • It cites referenced documents it didn't meaningfully use, giving drafts a false aura of being grounded in your sources
  • Templates and complex layouts (tables, text boxes, multi-column, forms) confuse it badly — exactly the documents enterprises live in
  • The three-file reference cap is real and arbitrary: your project has nine source documents, Copilot takes three

The honest assessment

Word Copilot is a very good writing assistant grafted onto an app whose hardest problems aren’t writing. Sentence by sentence and section by section, it’s strong: drafts come out clean, rewrites respect your constraints, and the tone-shifting genuinely works. If your job involves staring at a blank page that needs to become two pages of competent prose, the blank-page problem is solved.

The flagship move — drafting from referenced files — deserves its reputation. Slash in a meeting recap and a requirements doc and ask for a project brief, and what comes back is 70% of the way there, which is exactly what a first draft should be. The other reliably great mode is reading, not writing: summarizing the 80-page document someone sent you at 4:55pm, or interrogating a contract with direct questions, works almost every time.

Then you ask it to behave like Word, and the cracks show. Coherence degrades with length: a long generated document repeats itself, drifts terminology, and forgets in section six what it claimed in section two. Formatting is worse — Copilot produces text, and your carefully built style hierarchy, numbered headings, and template structure are somewhere between ignored and damaged. And there’s a quieter problem that should bother you more: it will cite the referenced files in drafts where it demonstrably didn’t use their content, which makes a draft look grounded when it’s confabulated. In regulated or legal contexts, that’s not a quirk, it’s a hazard.

The workarounds that change the score

Three habits push Word Copilot from a 7 to a personal 8 or 9:

  1. Draft in sections, never in documents. One section per prompt, with explicit instructions for what it should contain. You become the coherence layer — which is the part of the job that was always yours anyway. Long documents built section-by-section don’t drift.
  2. Pick your three references like they’re scarce — because they are. Don’t slash in whatever’s nearby. If you have nine sources, summarize them into one consolidated brief first (Copilot will do this), then reference the brief. Curated input is the whole game.
  3. Verify every citation by spot-check. When a draft claims something came from your referenced file, check one or two claims against the source before the draft leaves your hands. Thirty seconds of checking catches the confabulated-grounding problem before it becomes your problem.

What Microsoft won’t tell you

  • The demo is always a fresh blank document, never the company template. There’s a reason: template fidelity is where the experience falls apart, and that’s most enterprise documents.
  • “Drafts based on your files” oversells the grounding. It’s inspired by your files; it is not constrained to them. The distinction matters enormously when the file is a contract and the draft is going to a client.
  • Length is the silent quality variable. Nobody publishes a number, but every heavy user learns the same lesson: short generations are good, long generations need a human editor, and the crossover comes sooner than you’d hope.

Bottom line

Treat Word Copilot as a brilliant junior writer: give it sources, give it one section at a time, check its citations, and do the formatting yourself. Used that way, it’s the fastest path from nothing to a real draft that exists. Asked to produce the long, templated, styled document your org actually ships — alone, in one shot — it isn’t there yet, and you should plan as if it won’t be soon.

← All scorecards