Partition Excel Files by Column Value for Ingestion or ETL Pipelines

Data engineers frequently receive Excel files from business stakeholders that need to be partitioned by a key column — region, date, entity, status — before loading into a database or downstream pipeline. Writing a one-off pandas script for each new file format is overhead that Deliteful eliminates.

Business-supplied Excel files are rarely partition-friendly out of the box. A finance team sends a single .xlsx with 50,000 rows spanning 12 regions. An ops team sends a consolidated weekly dump that needs to be split by entity before loading into separate schema targets. The standard approach — a quick Python script with openpyxl or pandas — is fast to write once but becomes a maintenance burden when column names shift or new stakeholders send differently-structured files.

Deliteful handles the split server-side: upload the .xlsx, specify the partition column by name, and download one file per unique value. It's not a replacement for a production pipeline, but it's a reliable, reproducible way to handle the 'split this before I can ingest it' step without spinning up a script. Outputs are clean .xlsx files with the original header row, up to 200 partitions per input file.

How it works

  1. 1

    Create a free Deliteful account

    Sign up with Google OAuth in about 3 clicks — no credit card required.

  2. 2

    Upload the Excel file to partition

    Drop in the .xlsx or .xls file you received from a stakeholder — up to 200 MB per file.

  3. 3

    Enter the partition column name

    Type the exact column header — e.g., 'Region', 'Entity', or 'Status'.

  4. 4

    Download the partitioned files

    Receive one .xlsx per unique column value, ready for ingestion or further transformation.

Frequently asked questions

Can I use this to partition an Excel file before loading into a database?
Yes. Upload the file, specify the partition column, and download one .xlsx per partition. These can then be ingested individually or converted to CSV for your pipeline.
Does Deliteful preserve data types or just treat values as strings?
Deliteful reads and writes values as-is from the Excel file. Numeric and date values are preserved as values, but formulas are not — output files contain computed values only.
What happens to rows where the partition column is blank?
Rows with empty values in the specified column are collected into a single output file. No rows are silently dropped from the dataset.
What are the file size limits?
Excel files up to 200 MB per file are supported, with a batch limit of 50 files or 2 GB total per run. The tool creates up to 200 output files per input file.
What is the maximum number of output files per run?
200 output files per input file. Rows belonging to unique values beyond that limit are not included in any output.

Sign up free with Google and partition your next stakeholder Excel file by column value in under a minute — no script required.