Pulling a table out of a PDF (without retyping every cell)
When PDF→Excel works, when it doesn't, and the manual cleanup that's usually still faster than starting from scratch.
You need the data in a 40-row table from a PDF. Retyping is 30 minutes and three errors you won't notice until later. Converting and cleaning up is 5 minutes.
Ready to try this yourself?
Open PDF to Word in a new tab and read the rest while you upload.
PDF → Excel is doing two jobs at once: extracting the text and reconstructing the row/column grid. The text part is easy; the grid part is where conversions earn their reputation for being unreliable. Evixpdf's converter handles cleanly drawn tables well and warns you about the messy cases.
What converts well
- Tables with visible borders (gridlines on every cell).
- Single-page tables with consistent column widths.
- Tables from financial reports, invoices, and exported spreadsheets — they were tables to begin with.
What needs cleanup
Tables without borders — the converter infers columns from horizontal whitespace. Two columns very close together may get merged. Inspect, split, save.
Multi-page tables where the header repeats. The output gets the header rows duplicated; delete the dupes. Five seconds per page.
Tables with merged cells. There's no clean way to round-trip a merged cell to Excel; you'll get the data in the top-left cell and blanks in the others. Manual fix.