Legal Document Management with Bates Numbering
A comprehensive guide to Bates numbering for legal professionals, covering numbering schemes, implementation methods, and best practices for litigation support.
What Is Bates Numbering
Bates numbering is a method of indexing legal documents for identification and retrieval. Named after the Bates Manufacturing Company, which produced an early automatic numbering machine, Bates numbering assigns a unique sequential identifier to every page in a collection of documents. These identifiers ensure that every page can be uniquely referenced during legal proceedings, depositions, and document review.
A typical Bates number consists of a prefix (often identifying the producing party or document set), a sequential number (padded with leading zeros for consistent formatting), and optionally a suffix. For example, "ACME-000001" through "ACME-015234" might identify all pages produced by Acme Corporation in response to a discovery request. The numbering is sequential across the entire production, not per document, so that each page in the collection has a globally unique identifier.
Bates numbering serves several critical legal functions. During discovery, it provides a precise way to reference specific pages in pleadings, depositions, and correspondence. "Please refer to ACME-003456" is unambiguous. During trial, exhibits can be identified and tracked by their Bates numbers. For appellate proceedings, the trial record can be referenced with precision. The sequential nature of Bates numbers also proves completeness: if numbers 1 through 1000 are produced with number 547 missing, the gap is immediately apparent.
Numbering Scheme Design
Designing an effective Bates numbering scheme requires planning before any documents are numbered. The scheme must accommodate the expected volume of documents while remaining clear and consistent. Key decisions include the prefix format, the number of digits, and whether suffixes are needed.
The prefix typically identifies the producing party or the document collection. Common approaches include party abbreviations ("ACME," "DEF," "PLT"), case reference numbers, or production batch identifiers. Keep prefixes short (3-6 characters) to avoid making the Bates number unwieldy. Use consistent separator characters (hyphens are standard) between the prefix and number.
The number of digits should be sufficient for the total expected page count with room for supplemental productions. If the initial production contains 50,000 pages, use at least 6 digits (allowing up to 999,999 pages). Starting with too few digits creates problems when supplemental productions push the count past the maximum. It is standard practice to agree on the numbering format with opposing counsel before production begins, especially regarding the prefix, number of digits, and the starting number for supplemental productions. Consistent formatting across all productions in a case makes it easy to identify which production a page came from and ensures that references in legal documents are unambiguous.
Applying Bates Numbers to PDFs
Bates numbers are applied to PDFs as a stamp or overlay on each page, typically in the footer area. The number should be positioned consistently on every page and should not obscure the document content. Adobe Acrobat Pro provides a dedicated Bates Numbering feature that processes a batch of PDFs, applying sequential numbers across all pages of all files in the batch.
The placement of Bates numbers requires careful consideration. The bottom-right corner of the page is the most common position, but bottom-center and bottom-left are also used. The font should be small enough not to be intrusive but large enough to be legible when photocopied or scanned (typically 8-10 point). A consistent font and color (usually black) should be used throughout the production. For documents with existing content near the page borders, the Bates number may need to be placed in a white space area or the page may need a slight margin adjustment to create space.
Beyond the visible stamp, Bates numbers should be added to the document metadata or as searchable text in an OCR layer so that they can be found through text search. This dual approach, visible stamp plus searchable text, ensures that the Bates number functions both when viewing the document and when searching through a document collection electronically. Some litigation support tools automatically create a database mapping Bates number ranges to source documents, filenames, and document metadata.
Handling Multi-Format Document Collections
Real-world legal document productions involve documents in many formats: paper documents that must be scanned, existing PDFs, Microsoft Office files, emails, spreadsheets, images, and more. All must be converted to a consistent format and Bates numbered in sequence. The standard approach is to convert everything to PDF before numbering.
For paper documents, the scanning workflow should produce high-quality scans (at least 300 DPI, black and white for text, color for documents where color is relevant) with OCR applied for text searchability. Each physical document should become a separate PDF file. The scanning should preserve the document's original page sequence and note any physical characteristics (staples indicating pages that belong together, paper clips grouping documents).
For electronic documents, conversion to PDF should preserve the document's visual appearance. Microsoft Office files should be converted using the application's built-in PDF export (rather than a generic print-to-PDF driver) to maintain formatting, table structure, and embedded objects. Emails should be converted including header information (From, To, Date, Subject) and any attachments should be converted separately but numbered in sequence with the parent email. Spreadsheets present a particular challenge because they may have multiple worksheets, wide columns that do not fit on a single page, and print areas that do not capture all relevant data. Define print settings that capture the complete content before PDF conversion.
Quality Control and Verification
After Bates numbering a document production, quality control verifies that the numbering is correct, complete, and consistent. The verification process should check that numbers are sequential without gaps or duplicates, that every page in every document bears a Bates number, that the numbers are legible and consistently positioned, and that the database mapping Bates ranges to documents is accurate.
Automated verification is feasible for large productions. A script can extract the Bates number from each page (using OCR or text extraction), sort the numbers, and check for gaps and duplicates. It can also verify that the first and last Bates numbers of each document match the database entries. For the visual positioning check, rendering each page and verifying that the Bates number appears in the expected region can be automated using image analysis.
Manual spot-checking complements automated verification. Review a sample of pages across the production, including the first and last pages, pages at document boundaries (where one document ends and another begins), and pages from documents of different types (scanned paper, converted Word documents, converted emails). Verify that the Bates number is visible, correctly positioned, and correctly sequenced. For productions under litigation hold or regulatory obligation, document the QC process and results as part of the production record.
Bates Numbering in E-Discovery Workflows
In modern e-discovery, Bates numbering occurs within a broader workflow of document collection, processing, review, and production. E-discovery platforms like Relativity, Nuix, and Concordance include Bates numbering as a production feature. Documents are collected, processed (extracting text and metadata), loaded into the review platform, reviewed by attorneys, and then produced with Bates numbers to opposing counsel.
The sequencing of Bates numbers in e-discovery follows the production order, which may differ from the order in which documents were collected or reviewed. Documents are typically produced in a logical order: grouped by custodian, then by folder or date within each custodian's documents. This organization helps the receiving party navigate the production. The Bates number range for each custodian or document group is recorded in the production log.
E-discovery productions often include a load file alongside the Bates-numbered PDFs. The load file is a structured data file (typically in Concordance .dat, Relativity .opt, or IPRO .lfp format) that maps each Bates number range to its document metadata: original filename, author, date, email subject, custodian, and other fields. This allows the receiving party to load the production into their own review platform with full searchability. The load file also indicates document boundaries (where one document ends and the next begins within the sequentially numbered page collection).
Common Challenges and Solutions
Several challenges arise in Bates numbering practice. Supplemental productions must continue the numbering sequence from the previous production. If the initial production ended at ACME-015234, the supplemental production should start at ACME-015235. Tracking the last number used and communicating it clearly to whoever prepares the supplemental production prevents numbering conflicts.
Redactions and privilege logs interact with Bates numbering. When a page is redacted, it retains its Bates number; the number identifies the redacted page in the privilege log that explains why the redaction was applied. When an entire document is withheld on privilege grounds, the convention varies: some practitioners assign Bates numbers to withheld documents (creating placeholder pages noting "Withheld on privilege"), while others note the gap in numbering in the privilege log.
Re-productions (correcting errors in a previous production) require careful numbering management. The corrected pages should retain their original Bates numbers to avoid confusion with references in existing legal filings that cite the original numbers. If additional pages must be added, they should receive new numbers rather than inserting numbers into the existing sequence (e.g., ACME-005000.1 for an added page, though this is non-standard and should be communicated clearly to all parties). Document and communicate any numbering anomalies to opposing counsel and the court to prevent disputes about production completeness.