Best Practices
Follow these tips and best practices to get the best results from Transez document extraction.
Document Quality
- Use high-resolution scans or clear digital documents for best results
- Ensure text is readable and not obscured by watermarks or stamps
- Avoid heavily skewed or rotated documents when possible
- For scanned documents, use OCR-quality scans (300 DPI or higher)
Writing Effective Prompts
- Be specific about what data you want to extract
- Include column names and data types in your prompts
- Specify format preferences (e.g., 'Format dates as YYYY-MM-DD')
- Use examples when describing complex extraction requirements
- Break down complex extractions into simpler, step-by-step instructions
Data Formatting
- Specify number formats (decimals, thousands separators)
- Define date formats consistently across all documents
- Specify currency symbols and formats
- Indicate how to handle missing or empty fields
- Define text transformations (uppercase, lowercase, trim spaces)
Handling Edge Cases
- Specify how to handle merged cells or complex table structures
- Define behavior for duplicate entries or conflicting data
- Indicate how to process multi-page documents
- Specify handling of special characters or formatting
- Define rules for data validation and error handling
Review and Validation
- Always review extracted data before finalizing
- Use the interactive Excel preview to make corrections
- Verify critical fields like amounts, dates, and IDs
- Check for consistency across multiple documents
- Save your extraction templates for future use
Performance Optimization
- Process similar documents together for consistency
- Use batch processing for multiple files
- Save and reuse successful extraction prompts
- Break large documents into smaller sections if needed
- Monitor your usage to stay within plan limits
Ready to Apply These Practices?
Start extracting data from your documents with these best practices in mind.
Go to Dashboard