Software

2 minute read

DataPorter — A Rails engine that turns data imports into a self-service feature

February 8, 2026

If you’ve ever worked on a client-facing Rails app, you know the drill. At some point, someone sends you a CSV. “Can you import this into the app?” Sure, you write a quick Rake task, parse the file, done.

Then the next file arrives. The columns are in a different order. There’s a semicolon separator instead of commas (thanks, European Excel). Some rows have missing data, others have typos in the email field. Your script crashes halfway through, you fix it, re-run, realize 200 rows were already inserted, now you have duplicates…

And the best part: you’re the one running these imports. Every time. In the console. Because your client can’t exactly run rails runner import_contacts.rb on their own.

I got tired of this loop. So I built DataPorter — a Rails engine that turns data imports into a self-service feature. Your clients upload their own files, see a preview of what’s going to happen, fix their mistakes before importing, and you never touch a console again.

SerylLns
/
data_porter

Mountable Rails engine for CSV, XLSX, JSON & API data imports. Declarative DSL, live preview, dry run, real-time progress via ActionCable.

DataPorter

A mountable Rails engine for data import workflows: Upload, Map, Preview, Import.

Supports CSV, JSON, XLSX, and API sources with a declarative DSL for defining import targets. Business-agnostic by design — all domain logic lives in your host app.

Features

4 source types — CSV, XLSX, JSON, and API with a unified parsing pipeline
Interactive column mapping — Drag-free UI to match file headers to target fields (docs)
Mapping templates — Save and reuse column mappings across imports (docs)
Real-time progress — JSON polling with animated progress bar, no ActionCable required
Dry run mode — Validate against the database without persisting
Standalone UI — Self-contained layout with Turbo Drive and Stimulus, no host app dependencies
Import params — Declare extra form fields (select, text, number, hidden) per target for scoped imports (docs)
Per-target source filtering — Each target declares…

View on GitHub

The idea

You write a small Target class that describes your import (what columns you expect, how to save a record), and the engine gives you a full UI: file upload with drag-and-drop, interactive column mapping, data preview, real-time progress bar, error reports.

Here’s what a target looks like in practice:

class ProductTarget < DataPorter::Target
  label "Products"
  model_name "Product"
  sources :csv, :xlsx

  columns do
    column :name,  type: :string, required: true
    column :price, type: :decimal
    column :sku,   type: :string
  end

  def persist(record, context:)
    Product.create!(record.attributes)
  end
end

Mount the engine, visit /imports, done. Your users (or your client’s team) can upload files and import data without bothering you.

What’s in the box

The stuff I kept rebuilding on every project, now built once:

CSV, XLSX, JSON, API — Four source types. CSV auto-detects delimiters and encoding (semicolons from European Excel, Latin-1, BOM… the classics)
Column mapping — Users match file headers to your fields with dropdowns. They can save mappings as templates for recurring imports
Preview step — See parsed data before committing. Required fields highlighted, validation errors visible per row
Dry run — “What if I import this?” Runs everything in a transaction and rolls back. Great for letting non-technical users test safely
Progress bar — Real-time, no ActionCable needed (just JSON polling)
Reject export — After import, download a CSV of failed rows with error messages. Clients love this one
Multi-tenant — One config line to scope imports per user, per hotel, per organization — whatever your model is. Polymorphic, so it works with anything
Standalone UI — Ships its own layout with Turbo + Stimulus. No asset pipeline dependency, works with any Rails app

Why I built it this way

A few decisions that might be interesting:

The engine is completely business-agnostic. It doesn’t know anything about your models. All the domain logic (validation rules, persistence, transformations) lives in your Target classes. The engine just orchestrates the flow.

It’s also designed to work without authentication. If your app has current_user, great, it’ll capture it. If not (internal tool, admin panel), it still works fine. The scope feature is opt-in.

I went with Phlex components for the UI instead of partials. Faster rendering, easier to test, and I genuinely enjoy writing views in Ruby now.

Where it’s at

We’re using it in production on a concierge management app (hotel contacts, booking imports). It handles CSV, XLSX, JSON and API imports with the same Target, which is pretty satisfying.

413 specs, 0 failures, 0 Rubocop offenses
Ruby >= 3.2, Rails >= 7.0
MIT license

I’m also writing a blog series that traces the entire creation of this gem from scratch — the architecture decisions, the TDD workflow, the bugs, everything. Stay tuned if you’re into the “how it was built” side!

Would love your feedback

This is my first published gem so I’m definitely open to criticism:

Does the Target DSL feel right? Too magic? Not enough?
Missing a feature that would be a dealbreaker for your use case?
Anything that looks off in the repo?

SerylLns
/
data_porter

Mountable Rails engine for CSV, XLSX, JSON & API data imports. Declarative DSL, live preview, dry run, real-time progress via ActionCable.

DataPorter

A mountable Rails engine for data import workflows: Upload, Map, Preview, Import.

Supports CSV, JSON, XLSX, and API sources with a declarative DSL for defining import targets. Business-agnostic by design — all domain logic lives in your host app.

Features

4 source types — CSV, XLSX, JSON, and API with a unified parsing pipeline
Interactive column mapping — Drag-free UI to match file headers to target fields (docs)
Mapping templates — Save and reuse column mappings across imports (docs)
Real-time progress — JSON polling with animated progress bar, no ActionCable required
Dry run mode — Validate against the database without persisting
Standalone UI — Self-contained layout with Turbo Drive and Stimulus, no host app dependencies
Import params — Declare extra form fields (select, text, number, hidden) per target for scoped imports (docs)
Per-target source filtering — Each target declares…

View on GitHub

Happy to answer any questions!

Elysia JIT “compiler” and why it’s one of the fastest JavaScript framework

February 8, 2026

Software

Maravel-Framework 10.61.9 Benchmarks vs Lumen and Laravel

February 8, 2026

M	T	W	T	F	S	S
						1
2	3	4	5	6	7	8
9	10	11	12	13	14	15
16	17	18	19	20	21	22
23	24	25	26	27	28

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Hand-Picked Top-Read Stories

🥂 Beginner-Friendly Guide ‘Champagne Tower’ – Problem 799 (C++, Python, JavaScript)

Airbnb says a third of its customer support is now handled by AI in the U.S. and Canada

VIDEO PODCAST | Heeding the Uncertainty of the Supply Chain

Trending Tags

DataPorter — A Rails engine that turns data imports into a self-service feature

SerylLns
/
data_porter

Mountable Rails engine for CSV, XLSX, JSON & API data imports. Declarative DSL, live preview, dry run, real-time progress via ActionCable.

DataPorter

Features

The idea

What’s in the box

Why I built it this way

Where it’s at

Would love your feedback

SerylLns
/
data_porter

Mountable Rails engine for CSV, XLSX, JSON & API data imports. Declarative DSL, live preview, dry run, real-time progress via ActionCable.

DataPorter

Features

Leave a Reply Cancel reply

Previous Post

Elysia JIT “compiler” and why it’s one of the fastest JavaScript framework

Next Post

Maravel-Framework 10.61.9 Benchmarks vs Lumen and Laravel

DataPorter — A Rails engine that turns data imports into a self-service feature

SerylLns / data_porter

Mountable Rails engine for CSV, XLSX, JSON & API data imports. Declarative DSL, live preview, dry run, real-time progress via ActionCable.

DataPorter

Features

The idea

What’s in the box

Why I built it this way

Where it’s at

Would love your feedback

SerylLns / data_porter

Mountable Rails engine for CSV, XLSX, JSON & API data imports. Declarative DSL, live preview, dry run, real-time progress via ActionCable.

DataPorter

Features

Leave a Reply Cancel reply

Previous Post

Next Post

Related Posts

SerylLns
/
data_porter

SerylLns
/
data_porter