Building Interoperable EHR Integration Pipelines: Lessons from Connecting 12 Systems

Key Takeaways

HL7 v2 and FHIR are not competing standards -- they solve different problems. Most real-world integrations involve both, and the design challenge is building a translation layer between them that does not lose clinical semantics.
A hub-and-spoke architecture with a FHIR R4 canonical model reduces the number of interfaces from n(n-1) to 2n, but the canonical model design is the hard part -- it determines what information you can and cannot preserve across systems.
Mirth Connect is a solid integration engine for healthcare, but its JavaScript-based transformers become a maintenance burden at scale. We ended up building a YAML-based mapping DSL on top of it for the common transformation patterns.
Schema drift across EHR versions is the long-term maintenance challenge. A system that works perfectly today will break when one of the connected EHRs upgrades, and the break is often subtle rather than catastrophic.
Dead-letter queues with automated error classification are essential. In a production integration processing thousands of messages daily, errors are not exceptional -- they are continuous, and the system needs to handle them without human intervention for the common cases.

The Interoperability Challenge

Healthcare networks that grow through acquisition end up with multiple EHR systems. This is normal. Forcing a system migration on an acquired practice disrupts clinical workflows and is expensive, so the pragmatic approach is to let each entity keep its EHR and build integration infrastructure around the edges. The question is how to build that infrastructure so it does not become an unmaintainable mess.

The naive approach -- point-to-point integration between every pair of systems -- scales quadratically. With 12 systems, you need n(n-1)/2 = 66 bidirectional connections, each with its own mapping logic, error handling, and monitoring. When any system upgrades its API, every connected interface must be updated. We have seen networks attempt this and end up with a team that does nothing but maintain integration code.

The Practical Landscape

In a typical multi-entity healthcare network, you will encounter a mix of EHR platforms. Some (Epic, Cerner, Athenahealth) have production-ready FHIR APIs. Many more communicate via HL7 v2.x -- versions 2.3 through 2.5.1 are common. A few specialty-specific systems have only proprietary REST APIs or flat-file CDA exchange. Some legacy departmental systems communicate exclusively via fax or PDF. Building an integration layer means handling all of these, not just the well-behaved ones.

Multiple EHR platforms with incompatible data models, APIs, and communication protocols
Point-to-point integration requires n(n-1) unidirectional interfaces with quadratic maintenance cost
Fax remains the primary method for inter-office clinical document exchange in many settings
Providers lose significant time per referral gathering records manually from other systems
Incomplete record visibility leads to duplicate diagnostic tests and missed medication interactions

Integration Architecture: The Hub-and-Spoke Model

The hub-and-spoke pattern is well-established in healthcare integration. A central integration engine mediates all data exchange. Each EHR connects to the hub through a single adapter pair (inbound and outbound), reducing total interfaces from n(n-1) to 2n. When a new system joins the network, you build two adapters instead of n-1. The hub handles routing, transformation, terminology mapping, and error management.

Canonical Data Model

The hub needs a canonical data model -- an intermediate representation that all inbound messages are transformed into before routing to destinations. We chose FHIR R4 as the canonical model for a few reasons: it is a well-specified standard with good tooling, it covers the clinical resource types we need (Patient, Encounter, Condition, MedicationRequest, Observation, DiagnosticReport, AllergyIntolerance, Procedure, DocumentReference), and it is the direction the industry is moving, which means newer systems can be connected with less transformation effort.

FHIR R4 canonical model serves as the universal intermediate representation
Inbound adapters transform source-specific formats into the canonical model
Outbound adapters transform the canonical model into target-specific formats
Adding a new system requires 2 adapters instead of n-1 point-to-point connections
Canonical model versioning allows evolution without breaking existing adapters

Publish-Subscribe Routing

The hub uses pub-sub for message routing. Each connected system registers subscriptions based on the resource types and patient populations it cares about. When a lab result arrives from the hospital, the hub evaluates subscriptions and routes it to every system that has an active patient record for that patient and subscribes to Observation resources. This prevents flooding systems with irrelevant data. The subscription model also makes it easy to add new consumers without modifying the publisher side.

HL7 v2 vs FHIR: Where Each Fits

A common misconception is that FHIR replaces HL7 v2. In practice, they coexist. FHIR is the better integration standard -- it is resource-based with explicit references, uses modern web protocols (REST, JSON), and has a well-defined extension mechanism. But the majority of installed EHR systems still communicate primarily via HL7 v2.x, and many will continue to do so for years. The practical question is not "which standard should we use" but "how do we translate between them without losing information."

FHIR-Native Adapters

For systems with production-ready FHIR APIs, the adapter is relatively straightforward. FHIR Subscriptions provide event notification when resources are created or updated. The adapter validates incoming resources against the R4 schema, checks for required extensions (like the network-wide patient identifier), and publishes to the canonical message bus. These adapters are the simplest to build and the most reliable in production, with error rates typically under 0.1%. The main gotcha is that different EHR vendors implement FHIR with different levels of completeness -- some resource types may be read-only, and search parameter support varies.

HL7 v2 to FHIR Translation

This is where most of the engineering effort goes. HL7 v2 is segment-based with implicit context. A single ORU (Observation Result) message might need to be decomposed into a Patient resource, an Encounter resource, multiple Observation resources, and a DiagnosticReport resource, each with proper cross-references. The mapping is not a simple field-to-field exercise -- it requires understanding the semantic relationships embedded in the v2 message structure.

ADT messages map to Patient and Encounter FHIR resources, with v2 event types (A01, A02, A03, etc.) driving Encounter status lifecycle
ORU messages decompose into Observation, DiagnosticReport, and Specimen resources with referential integrity between them
ORM messages translate to ServiceRequest and MedicationRequest resources
Z-segments (custom extensions) require per-system configuration -- these carry institution-specific data that has no standard mapping
Character encoding normalization from various legacy encodings (Windows-1252, ISO-8859-1) to UTF-8

We built a YAML-based mapping DSL that allows integration analysts to define field mappings declaratively for the common patterns. This reduced the time to onboard a new HL7 v2 system from about 6 weeks of developer time to 2 weeks of analyst configuration, with developers only involved for complex transformation logic that the DSL cannot express. The DSL handles about 80% of mappings; the remaining 20% require custom JavaScript transformers in Mirth.

Mirth Connect Channel Design

We chose Mirth Connect (NextGen Connect Integration Engine) as the integration platform. The reasons were practical: native support for healthcare messaging standards (HL7 v2, FHIR, CDA, DICOM), a channel-based architecture that maps well to hub-and-spoke, JavaScript transformers for custom logic, and a large installed base in healthcare, which means there is institutional knowledge available. We deployed on a two-node active-active cluster behind a load balancer.

Channel Architecture

Each EHR connection is a pair of Mirth channels -- one inbound, one outbound. Inbound channels handle protocol negotiation (TCP/MLLP for HL7 v2, HTTPS for FHIR and REST, SFTP for file-based exchanges), validation, and transformation to the canonical model. Outbound channels subscribe to the canonical bus, filter by subscription rules, transform to the target format, and deliver. This separation of concerns makes channels independently testable and deployable.

One inbound and one outbound channel per connected system -- about 24 channels total for a 12-system network
Two-node active-active cluster with automatic failover
Channel-level monitoring with alerts on queue depth, error rate, and processing latency
Message archival with configurable retention for HIPAA compliance
Custom dashboard for real-time message flow and system health visibility

Performance Considerations

Our initial implementation had a performance bottleneck in the transformation layer. Complex HL7 v2 to FHIR translations took 800ms to 1.2 seconds per message, which is fine for low volume but problematic at scale. The main culprits were terminology lookups (SNOMED, LOINC, ICD-10 code resolution hitting the database on every message) and repeated JavaScript compilation. We fixed this by adding a terminology mapping cache and pre-compiling transformation scripts. Post-optimization, median transformation time dropped to about 120ms with p99 under 450ms.

We also added message prioritization. Clinical decision support messages (allergy alerts, drug interaction warnings) route at highest priority with guaranteed fast delivery. Routine demographic updates process at standard priority. Bulk historical data loads process at low priority during off-peak hours to avoid impacting real-time clinical data flow. This was a simple change (priority queues in Mirth) that solved a real problem during our initial data migration phase.

Message Transformation and Schema Drift

Connecting the transport layer is the easier half. The harder problem is ensuring that data means the same thing across systems. A blood pressure observation might be coded as LOINC 85354-9 (panel) in one system and as two separate observations for systolic (8480-6) and diastolic (8462-4) in another. A medication might be recorded by brand name in one system, generic name in another, and NDC code in a third. Patient identity -- determining that "John Smith, DOB 1965-03-15" in Epic is the same person as "Jonathan A. Smith, DOB 03/15/1965" in eClinicalWorks -- requires probabilistic matching.

Terminology Standardization

We run a terminology service that maps local codes to standard terminologies: SNOMED CT for clinical findings, LOINC for lab observations, RxNorm for medications, ICD-10-CM for diagnoses, CPT for procedures. When a message arrives with a local code, the service attempts to map it. If no mapping exists, it flags the code for manual review and passes the original code through with a "local" system identifier. This pragmatic approach -- pass through what you cannot map rather than dropping it -- ensures data flow is never blocked by terminology gaps.

Terminology service maintains tens of thousands of code mappings across SNOMED, LOINC, RxNorm, ICD-10, and CPT
Automated mapping suggestions using code description similarity get about 75-80% of new codes right
Unmapped codes pass through with local identifiers rather than being dropped
Monthly terminology reconciliation identifies mapping gaps and updates
Medication normalization resolves brand/generic/NDC discrepancies using the RxNorm API

Handling Schema Drift

Schema drift is the quiet killer of integration pipelines. An EHR vendor releases a version update that changes the structure of an HL7 segment, adds a required field, or moves data to a different segment. These changes are often documented in release notes that nobody reads until something breaks. Our approach is defensive: adapters validate incoming messages against a known schema, and any structural deviation that the adapter cannot handle is routed to a review queue rather than silently producing bad data.

We also maintain an Enterprise Master Patient Index (EMPI) for cross-system patient identification. The matching algorithm evaluates demographic fields -- name variants, date of birth, address, phone, insurance identifiers -- with weighted scoring and fuzzy matching (Jaro-Winkler for names, normalized address comparison, phonetic matching). High-confidence matches link automatically; ambiguous matches go to a review queue. The EMPI is foundational -- without accurate patient matching, every other integration feature produces unreliable results.

Retry and Dead-Letter Strategies

In a production integration environment processing thousands of messages daily across multiple systems, errors are continuous, not exceptional. Systems go down for maintenance, network connections drop, message formats change, and disk space fills up. The question is not whether errors will occur but how the system handles them without requiring constant human intervention.

Error Classification

We classify every failed message into one of five categories, each with a different recovery strategy. Transient errors (network timeouts, service temporarily unavailable) get automatic retry with exponential backoff. Validation errors (schema violations, missing required fields) get auto-correction for known patterns or human review for unknown ones. Transformation errors (unmappable codes) are held pending until the terminology team adds a mapping. Delivery errors (target system rejection, duplicate detection) are analyzed for root cause patterns. Fatal errors (data corruption, HIPAA violations) trigger immediate alerts and quarantine.

Transient errors: automatic retry with exponential backoff (1s, 5s, 30s, 2m, 10m, 1h) up to 24 hours
Validation errors: auto-correction for known patterns (missing segment terminators, date format mismatches), human review queue for unknown patterns
Transformation errors: message held in pending state until terminology mapping is added, then reprocessed automatically
Delivery errors: duplicate detection resolved automatically, rejections analyzed for recurring patterns
Fatal errors: immediate alert, message quarantined for investigation

Conflict Resolution for Bi-Directional Sync

Bi-directional sync creates a conflict resolution problem: what happens when two systems update the same patient record simultaneously? We use a data domain authority model. Each data element has a designated authoritative system -- the PCP office is authoritative for medication lists, the hospital for inpatient encounters, the lab system for test results, the registration system for demographics. When a conflict is detected, the authoritative system wins. Non-authoritative updates are preserved as secondary versions, visible to clinicians but flagged for reconciliation.

The automated error recovery pipeline handles the large majority of integration errors without human intervention. The remaining errors are escalated with full context: the original message, the error classification, recovery attempts made, and suggested resolution steps. This context is what makes the errors actionable -- an alert that says "message failed" is useless; an alert that says "ORU message from System X failed transformation because LOINC code 12345-6 is not in the terminology service, here is the message, here is how to add the mapping" is something an analyst can resolve in minutes.

What We Learned

After running an integration platform like this in production for a year, a few lessons crystallized.

Technical Lessons

The canonical data model design is the single most consequential architectural decision. What you include in the model determines what information survives translation. Changing the canonical model after production launch is expensive.
HL7 v2 to FHIR translation is the bulk of the engineering work and the primary source of bugs. Investing in a declarative mapping layer pays off quickly.
Terminology mapping is a continuous process, not a one-time setup. New codes appear regularly, and maintaining the mapping tables requires ongoing analyst time.
The EMPI (patient matching) is foundational infrastructure that should be built first, not bolted on later. Bad patient matching undermines everything else.
Schema drift from EHR version upgrades is the primary cause of production incidents. Defensive validation at adapter boundaries catches drift before it produces bad data.
Error classification with automated recovery handles the majority of issues, but you need good escalation paths for the rest. Context-rich error reports are the difference between a 5-minute fix and a multi-hour investigation.

Organizational Lessons

The harder problems were organizational, not technical. Getting agreement across independent entities on data ownership, terminology standards, and quality expectations took more time than building the integration platform itself. Each connected system has its own IT team with its own priorities, and coordinating EHR upgrades across the network requires more project management than engineering. The integration platform is a shared resource, but the connected systems are owned independently. Establishing clear governance -- who decides what data flows where, who is responsible for data quality, who pays for adapter maintenance -- was essential and non-obvious.

If you are planning a multi-system EHR integration, invest in the canonical model design and the governance framework before writing code. Those two decisions constrain everything that follows, and they are much harder to change after the fact.

Working on EHR Integration?

We Have Been Through It

If you are connecting EHR systems and navigating the HL7 v2 / FHIR landscape, we are happy to share what worked and what did not.

Get in Touch

Clinical & Front-Office

Revenue Cycle & Insurance

Finance & Accounting

Procurement & Supply Chain

Sales & RevOps

People & HR

IT & Engineering

End-to-end delivery, one trusted team

Transforming Every Industry

FHIR-Native Data Fabric

Healthcare

RECOMMENDED BLOGS

AI Agents for Enterprise

HL7 v2 vs FHIR: Practical Trade-offs in EHR Integration Pipeline Design

Key Takeaways

The Interoperability Challenge

The Practical Landscape

Integration Architecture: The Hub-and-Spoke Model

Canonical Data Model

Publish-Subscribe Routing

HL7 v2 vs FHIR: Where Each Fits

FHIR-Native Adapters

HL7 v2 to FHIR Translation

Mirth Connect Channel Design

Channel Architecture

Performance Considerations

Message Transformation and Schema Drift

Terminology Standardization

Handling Schema Drift

Retry and Dead-Letter Strategies

Error Classification

Conflict Resolution for Bi-Directional Sync

What We Learned

Technical Lessons

Organizational Lessons

We Have Been Through It

Two offices. One team.

HL7 v2 vs FHIR: Practical Trade-offs in EHR Integration Pipeline Design

Key Takeaways

The Interoperability Challenge

The Practical Landscape

Integration Architecture: The Hub-and-Spoke Model

Canonical Data Model

Publish-Subscribe Routing

HL7 v2 vs FHIR: Where Each Fits

FHIR-Native Adapters

HL7 v2 to FHIR Translation

Mirth Connect Channel Design

Channel Architecture

Performance Considerations

Message Transformation and Schema Drift

Terminology Standardization

Handling Schema Drift

Retry and Dead-Letter Strategies

Error Classification

Conflict Resolution for Bi-Directional Sync

What We Learned

Technical Lessons

Organizational Lessons

We Have Been Through It

AI Charge Capture: Stop Leaving Revenue on the Table

AI Denial Management: How to Predict, Prevent, and Appeal Claim Denials

AI Healthcare Solutions: The Foundation of Smart Hospitals

Get healthcare tech insights in your inbox.