Data Extraction Prompts

🌐 Languages: English Français

15+ ready-to-use prompts for extracting structured data from images, PDFs, and documents


Receipt & Invoice Prompts

1. Basic Receipt Extraction

Extract expense data from receipt images in ~/Cowork-Workspace/input/receipts/

Create an Excel file with columns:
- Date
- Vendor/Store
- Amount
- Currency
- Category (auto-detect: Food, Transport, Office, etc.)
- Payment Method (if visible)
- Notes

Add a totals row at the bottom.
Save to: ~/Cowork-Workspace/output/expenses.xlsx

2. Detailed Receipt Processing

Process receipts in ~/Cowork-Workspace/input/receipts/ with detailed extraction.

Excel structure:
Sheet 1 - Line Items:
- Receipt ID
- Date
- Vendor
- Item Description
- Quantity
- Unit Price
- Total Price
- Category

Sheet 2 - Summary:
- Total by Category
- Total by Vendor
- Total by Month
- Grand Total

Sheet 3 - Receipt Index:
- Receipt ID
- Filename
- Date
- Vendor
- Total
- Status (complete/partial extraction)

Use [European/US] formula syntax.
Save to: ~/Cowork-Workspace/output/detailed-expenses.xlsx

3. Invoice Data Extraction

Extract data from invoices in ~/Cowork-Workspace/input/invoices/

Create Excel with:
- Invoice Number
- Invoice Date
- Due Date
- Vendor Name
- Vendor Address
- Subtotal
- Tax Amount
- Total Amount
- Payment Status (if detectable)
- Line Items (separate sheet)

Flag any invoices that couldn't be fully parsed.
Save to: ~/Cowork-Workspace/output/invoice-tracker.xlsx

PDF Extraction Prompts

4. PDF Table Extraction

Extract tables from PDF documents in ~/Cowork-Workspace/input/pdfs/

For each PDF:
- Identify all tables
- Extract to separate Excel sheets
- Preserve table structure and headers
- Note source PDF and page number

Output: One Excel file per PDF
Save to: ~/Cowork-Workspace/output/extracted-tables/
Create index file: ~/Cowork-Workspace/output/table-index.txt

5. PDF Form Data Extraction

Extract filled form data from PDFs in ~/Cowork-Workspace/input/forms/

Create a structured output:
- One row per form
- Columns for each form field
- Include filename for reference

Handle variations in form completion (empty fields = blank cell).
Save to: ~/Cowork-Workspace/output/form-data.xlsx

6. PDF Contract Key Terms

Extract key terms from contracts in ~/Cowork-Workspace/input/contracts/

For each contract, identify:
- Parties involved
- Effective date
- Term/Duration
- Renewal terms
- Termination clause summary
- Key obligations
- Payment terms
- Important dates

Create Excel with one row per contract.
Save to: ~/Cowork-Workspace/output/contract-summary.xlsx

💡 Claude Legal Note: If you have access to Claude Legal (Cowork extension), this contract term extraction can be automated with additional risk detection and problematic clause identification. See Extensions & Plugins.


Image Data Prompts

7. Business Card Extraction

Extract contact information from business card images in ~/Cowork-Workspace/input/cards/

Create Excel with columns:
- Name
- Title
- Company
- Email
- Phone
- Mobile
- Address
- Website
- LinkedIn (if present)
- Source Filename

Save to: ~/Cowork-Workspace/output/contacts.xlsx

Also create a VCF file for import: ~/Cowork-Workspace/output/contacts.vcf

8. Screenshot Data Extraction

Extract data from screenshots in ~/Cowork-Workspace/input/screenshots/

Screenshots contain: [describe what - e.g., "software settings", "web forms", "charts"]

Create structured output with:
- Screenshot filename
- Type of content
- Extracted text/data
- Key values identified

Save to: ~/Cowork-Workspace/output/screenshot-data.xlsx

9. Handwritten Notes Extraction

Extract text from handwritten note images in ~/Cowork-Workspace/input/notes/

Create a document with:
- Source image filename
- Extracted text (best interpretation)
- Confidence notes (unclear sections marked)
- Date if visible

Format: Word document
Save to: ~/Cowork-Workspace/output/transcribed-notes.docx

Structured Data Prompts

10. CSV Cleanup and Standardization

Clean and standardize CSV files in ~/Cowork-Workspace/input/csv-files/

Operations:
- Standardize date formats to YYYY-MM-DD
- Trim whitespace from all cells
- Standardize phone number format
- Normalize country names
- Remove duplicate rows
- Flag data quality issues

Output cleaned files to: ~/Cowork-Workspace/output/cleaned/
Create data quality report: ~/Cowork-Workspace/output/data-quality-report.md

11. Multi-Source Data Consolidation

Consolidate data from multiple files in ~/Cowork-Workspace/input/data-sources/

Files may include: [CSV, Excel, text files]
Common field: [specify key field, e.g., "email address"]

Create a master Excel file with:
- All unique records
- Combined data from all sources
- Source tracking (which file each data point came from)
- Duplicate flagging

Save to: ~/Cowork-Workspace/output/consolidated-data.xlsx

12. JSON/XML to Excel

Convert JSON/XML files in ~/Cowork-Workspace/input/data/ to Excel format.

For each file:
- Flatten nested structures appropriately
- Create clear column headers
- Handle arrays as multiple rows
- Preserve data types where possible

Save converted files to: ~/Cowork-Workspace/output/converted/
Use original filename with .xlsx extension.

Specialized Extraction Prompts

13. Meeting/Event Details Extraction

Extract event details from documents in ~/Cowork-Workspace/input/events/

For each event, capture:
- Event name
- Date
- Time
- Location/Venue
- Organizer
- Attendees (if listed)
- Agenda items
- Special notes

Create Excel with one row per event.
Create ICS calendar file for import.
Save to: ~/Cowork-Workspace/output/events.xlsx
Save to: ~/Cowork-Workspace/output/events.ics

14. Product/Inventory Data

Extract product information from documents/images in ~/Cowork-Workspace/input/products/

Capture:
- Product name
- SKU/ID (if visible)
- Description
- Price
- Category
- Specifications
- Source document

Create Excel product catalog.
Save to: ~/Cowork-Workspace/output/product-catalog.xlsx

15. Research Paper Data

Extract structured data from research papers in ~/Cowork-Workspace/input/papers/

For each paper:
- Title
- Authors
- Publication date
- Journal/Conference
- Abstract (first 200 words)
- Keywords
- Methodology (brief)
- Key findings (3-5 points)
- Cited count (if available)

Create Excel bibliography.
Save to: ~/Cowork-Workspace/output/research-catalog.xlsx

16. Quote/Estimate Comparison

Extract and compare quotes from ~/Cowork-Workspace/input/quotes/

For each quote:
- Vendor name
- Quote date
- Valid until
- Line items with prices
- Subtotal
- Tax
- Total
- Terms

Create comparison matrix in Excel.
Highlight: lowest price, best terms, recommended option.
Save to: ~/Cowork-Workspace/output/quote-comparison.xlsx

Quality Control Prompts

17. Data Validation Report

Validate data extracted to ~/Cowork-Workspace/output/[previous-output].xlsx

Check for:
- Missing required fields
- Invalid date formats
- Numeric fields with text
- Duplicate entries
- Outlier values
- Inconsistent formatting

Create validation report with:
- Issue type
- Row/Cell reference
- Current value
- Suggested correction

Save to: ~/Cowork-Workspace/output/validation-report.xlsx

Customization Notes

For OCR accuracy:

For regional settings:

For handling errors:


*Back to Prompts Index Cowork Documentation*