Skip to content

Introduction to Healthcare Systems

Summary

This chapter provides a comprehensive overview of the healthcare system landscape, covering cost models (fee-for-service vs value-based care), key stakeholders (patients, providers, payers), and critical medical coding systems (ICD, CPT, HCPCS, Drug Codes). You will understand the complexities of electronic health records, medical terminology, clinical workflows, and healthcare interoperability. This domain knowledge is essential for designing effective graph models that accurately represent healthcare data.

Concepts Covered

This chapter covers the following 20 concepts from the learning graph:

  1. Healthcare System
  2. Healthcare Cost
  3. Per-Person Healthcare Cost
  4. Fee-For-Service Model
  5. Value-Based Care
  6. Healthcare Payer
  7. Healthcare Provider
  8. Healthcare Patient
  9. Electronic Health Record
  10. Medical Coding System
  11. ICD Code
  12. CPT Code
  13. HCPCS Code
  14. Drug Code
  15. Medical Terminology
  16. Clinical Workflow
  17. Patient Demographics
  18. Medical Encounter
  19. Healthcare Interoperability
  20. Healthcare Data Exchange

Prerequisites

This chapter builds on concepts from:


Introduction to the Healthcare Ecosystem

The healthcare system represents one of the most complex information ecosystems in modern society, characterized by intricate relationships among patients, providers, payers, regulatory bodies, pharmaceutical companies, and medical device manufacturers. Unlike other industries where data flows are relatively straightforward, healthcare involves multi-directional information exchange across organizational boundaries while maintaining strict privacy and security requirements. Understanding this ecosystem is essential for designing graph data models that accurately capture the interconnected nature of clinical, financial, and administrative data.

The United States healthcare system faces a distinctive challenge: it delivers world-class medical innovation while simultaneously experiencing the highest per-person healthcare costs globally. This paradox stems from systemic inefficiencies in how healthcare services are organized, delivered, and reimbursed. Graph database technologies offer promising solutions to these challenges by enabling more sophisticated analytics on interconnected healthcare data, facilitating the transition from volume-based to value-based care models, and supporting real-time clinical decision-making.

Before diving into data modeling approaches, we must first understand the healthcare domain itself—the stakeholders, workflows, terminology, and data standards that shape how healthcare information is captured, exchanged, and analyzed.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
<summary>Healthcare Ecosystem Stakeholder Diagram</summary>

Use the hc-graph-generator skill to create the following MicroSim called 'hc-stakeholders'

Purpose: Illustrate the three primary stakeholder perspectives in healthcare and their key relationships

Components to show:
- Central triangle with three points labeled: "Patient", "Provider", and "Payer"
- Each stakeholder as a different colored circle (Patient: blue, Provider: green, Payer: orange)
- Bidirectional arrows connecting all three stakeholders
- Secondary entities around the perimeter: Pharmacy, Lab, Hospital, Clinic, Insurance Company, Employer, Government

Relationships (arrows with labels):
- Patient ↔ Provider: "Care Delivery", "Medical Records"
- Provider ↔ Payer: "Claims", "Reimbursement"
- Payer ↔ Patient: "Coverage", "Premiums"
- Provider → Pharmacy: "Prescriptions"
- Provider → Lab: "Test Orders", "Results"
- Payer → Pharmacy: "Formulary Rules", "Reimbursement"
- Employer → Payer: "Group Coverage"
- Government → Payer: "Regulations", "Medicare/Medicaid"

Style: Network diagram with central triangle and satellite nodes

Visual hierarchy:
- Primary stakeholders (larger circles, bold labels)
- Secondary entities (smaller circles, regular labels)
- Primary relationships (thick arrows)
- Secondary relationships (thin arrows)

Color scheme:
- Patient entities: Blue tones
- Provider entities: Green tones
- Payer entities: Orange tones
- Government/regulatory: Gray

Annotations:
- Label showing "Data flows in all directions"
- Note: "Each relationship generates complex data exchanges"

Implementation: SVG-based diagram or Mermaid diagram embedded in markdown

Understanding Healthcare Economics

The Healthcare Cost Crisis

Healthcare cost refers to the total financial resources consumed by healthcare services, encompassing direct medical expenses (physician fees, hospital charges, medications, procedures) as well as indirect costs (administrative overhead, insurance processing, regulatory compliance). In the United States, total healthcare expenditures exceeded $4.3 trillion in 2021, representing approximately 18.3% of the nation's GDP—far exceeding other developed nations where healthcare typically represents 9-12% of GDP.

The per-person healthcare cost metric provides a normalized view of healthcare spending by dividing total expenditures by population. In 2021, the United States averaged approximately $12,900 per person annually, compared to $6,000-7,000 in other high-income countries like Germany, Canada, and France. This dramatic cost differential exists despite comparable or sometimes superior health outcomes in lower-spending nations, indicating systemic inefficiencies in the U.S. healthcare delivery and payment models.

Several factors contribute to elevated U.S. healthcare costs:

  • Administrative complexity: Fragmented payer systems require extensive billing, coding, and authorization processes
  • Fee-for-service incentives: Payment models that reward volume over value
  • Pharmaceutical pricing: Higher drug prices compared to international markets with price controls
  • Defensive medicine: Excessive testing and procedures driven by liability concerns
  • Chronic disease burden: Growing prevalence of expensive chronic conditions requiring long-term management
  • Technology adoption costs: Expensive medical equipment and electronic health record systems
  • Market consolidation: Hospital and provider mergers reducing competition
 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
<summary>Per-Person Healthcare Cost Comparison Chart</summary>
Use the chart-generator skill to create the following chart

Chart type: Bar chart with dual y-axis

Purpose: Compare per-person healthcare spending across countries and show U.S. cost premium

X-axis: Countries (United States, Switzerland, Germany, Sweden, Netherlands, France, Canada, United Kingdom, Japan)

Y-axis (left): Per-person annual healthcare spending (USD)
Y-axis (right): Healthcare spending as % of GDP

Data series 1 (bars - primary y-axis):
- United States: $12,914
- Switzerland: $7,138
- Germany: $6,731
- Sweden: $6,262
- Netherlands: $6,190
- France: $5,564
- Canada: $5,418
- United Kingdom: $5,087
- Japan: $4,691

Data series 2 (line with markers - secondary y-axis):
- United States: 18.3%
- Switzerland: 11.3%
- Germany: 11.7%
- Sweden: 10.9%
- Netherlands: 10.1%
- France: 11.0%
- Canada: 10.8%
- United Kingdom: 10.2%
- Japan: 10.9%

Color scheme:
- U.S. bar: Red (to highlight)
- Other countries: Blue
- Line graph: Orange with circular markers

Title: "Per-Person Healthcare Spending: United States vs. Comparable Nations (2021)"

Annotations:
- Arrow pointing to U.S. bar: "2x higher than comparable nations"
- Shaded region showing "Typical range for developed nations"
- Data source note: "OECD Health Statistics 2022"

Legend: Position top-right showing bar and line meanings

Implementation: Chart.js library with responsive design

Payment Models: Fee-For-Service vs. Value-Based Care

The fee-for-service model (FFS) represents the traditional healthcare payment approach where providers receive reimbursement for each individual service, procedure, test, or visit performed. Under FFS, a physician conducting a 15-minute office visit, ordering two lab tests, and performing a minor procedure would bill separately for each component. This model creates problematic incentives: providers generate more revenue by delivering more services, regardless of whether those services improve patient outcomes or represent the most efficient care pathway.

Fee-for-service contributes to healthcare cost escalation through several mechanisms:

  • Volume incentives: Providers are rewarded for quantity rather than quality of care
  • Fragmented care: Each specialist focuses on their narrow domain without coordinating across the patient's complete care needs
  • Overutilization: Financial incentives favor performing additional tests and procedures
  • Reactive rather than preventive: Payment occurs when patients are sick, not for keeping them healthy
  • Administrative burden: Each service requires separate coding, billing, and claims processing

Here's a comparison of the fundamental differences between payment models:

Dimension Fee-For-Service Value-Based Care
Payment basis Per service/procedure Per patient or outcome
Risk bearer Payer assumes risk Provider assumes partial/full risk
Primary incentive Maximize service volume Improve outcomes, reduce costs
Care coordination Minimal Essential
Data requirements Service codes, charges Outcomes, quality metrics, costs
Preventive care focus Low High
Technology needs Billing systems Analytics, predictive models
Provider mindset "How many patients can I see?" "How can I keep patients healthy?"

Value-based care (VBC) represents a fundamental restructuring of healthcare economics, where providers receive compensation based on patient health outcomes rather than service volume. Under value-based models, providers might receive a fixed payment per patient (capitation), bonus payments for achieving quality benchmarks, or shared savings when they deliver care more efficiently than baseline costs. This alignment of financial incentives with patient outcomes theoretically encourages providers to emphasize prevention, care coordination, and evidence-based medicine.

Value-based care models include several variants:

  • Pay-for-Performance (P4P): Bonus payments for meeting quality metrics
  • Bundled Payments: Single payment covering all services for an episode of care
  • Accountable Care Organizations (ACOs): Provider groups sharing responsibility for patient populations
  • Capitation: Fixed per-patient-per-month payment regardless of services consumed
  • Shared Savings/Risk: Providers share financial gains from cost reductions while maintaining quality
  1
  2
  3
  4
  5
  6
  7
  8
  9
 10
 11
 12
 13
 14
 15
 16
 17
 18
 19
 20
 21
 22
 23
 24
 25
 26
 27
 28
 29
 30
 31
 32
 33
 34
 35
 36
 37
 38
 39
 40
 41
 42
 43
 44
 45
 46
 47
 48
 49
 50
 51
 52
 53
 54
 55
 56
 57
 58
 59
 60
 61
 62
 63
 64
 65
 66
 67
 68
 69
 70
 71
 72
 73
 74
 75
 76
 77
 78
 79
 80
 81
 82
 83
 84
 85
 86
 87
 88
 89
 90
 91
 92
 93
 94
 95
 96
 97
 98
 99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
<summary>Healthcare Payment Model Comparison MicroSim</summary>
Type: microsim

Learning objective: Enable students to explore how different payment models affect provider behavior, healthcare costs, and patient outcomes through an interactive simulation

Canvas layout (1000x700px):
- Top section (1000x150): Title and model selector
- Left side (650x550): Visualization area showing patient flow and costs
- Right side (350x550): Control panel and metrics dashboard

Visual elements in simulation area:
- 10 patient icons moving through a healthcare journey
- Provider building (hospital icon)
- Service stations: Office Visit, Lab Tests, Imaging, Procedures, Follow-up
- Money indicators ($) showing costs generated
- Health meter for each patient (green=healthy, yellow=moderate, red=poor)
- Timeline showing days elapsed

Interactive controls (right panel):

1. Payment Model Selector (radio buttons):
   - Fee-For-Service
   - Value-Based Care (Bundled Payment)
   - Value-Based Care (Capitation)
   - Value-Based Care (ACO Shared Savings)

2. Patient Population Characteristics (sliders):
   - Chronic disease prevalence: 0-50% (default: 30%)
   - Health literacy level: Low/Medium/High (default: Medium)
   - Complexity of conditions: Simple/Mixed/Complex (default: Mixed)

3. Provider Behavior Parameters (sliders):
   - Care coordination investment: 0-100% (default: 50%)
   - Preventive care emphasis: 0-100% (default: 50%)
   - Technology adoption: 0-100% (default: 50%)

4. Simulation Controls:
   - "Start Simulation" button
   - "Reset" button
   - Speed slider: 1x to 10x (default: 3x)
   - "Pause/Resume" button

Metrics Dashboard (updates in real-time):
- Total Cost per Patient: $X,XXX
- Number of Services per Patient: X.X
- Average Health Outcome Score: XX/100
- Provider Revenue: $XX,XXX
- Provider Margin: XX%
- Preventive Services Ratio: XX%
- Emergency Visit Rate: XX%
- Patient Satisfaction: XX/100

Default parameters for each payment model:

Fee-For-Service:
- Base office visit: $150
- Lab test: $100
- Imaging: $400
- Procedure: $800
- Follow-up: $120
- Provider incentive: Maximize service volume
- Typical services per patient: 8-12
- Total cost per patient: $3,500-5,000

Bundled Payment (episode of care):
- Bundle payment: $3,000 per episode
- Provider keeps savings if costs < $3,000
- Provider loses money if costs > $3,000
- Provider incentive: Efficient care delivery
- Typical services per patient: 5-7
- Total cost per patient: $2,500-3,200

Capitation:
- Per-member-per-month: $250
- Covers all services for time period
- Provider incentive: Prevention and care coordination
- Typical services per patient: 3-5
- Total cost per patient: $1,800-2,800

Behavior modeling:

Under Fee-For-Service:
- Patients undergo more diagnostic tests
- More procedures performed
- More follow-up visits scheduled
- Less investment in prevention
- Higher short-term costs
- Outcomes: moderate improvement

Under Value-Based Care:
- More preventive services
- Better care coordination
- Fewer unnecessary tests
- More patient education
- Lower long-term costs
- Outcomes: greater improvement

Animation behavior:
- Patient icons move through service stations
- $ icons pop up when services delivered
- Health meters change color based on care received
- Cost counter increments
- Comparison metrics update continuously
- At end, show side-by-side comparison of models

Visual feedback:
- Highlight differences in care pathways between models
- Show cost accumulation differently (rapid under FFS, controlled under VBC)
- Display outcome improvements with color changes
- Graph total cost trajectory over time

Educational annotations:
- Hover over service stations to see costs and frequency
- Click patient icons to see individual care journey
- Hover over metrics to see explanations
- Display key insights when simulation completes

Implementation notes:
- Use p5.js for rendering and animation
- Store model parameters in JavaScript objects
- Implement simple probability models for care decisions
- Use random variation within realistic ranges
- Ensure reproducible results with seed option
- Include "Explanation" mode that pauses and highlights key decision points

Implementation: p5.js with HTML/CSS controls, deployed in /docs/sims/healthcare-payment-models/

The transition from fee-for-service to value-based care represents one of the most significant ongoing transformations in U.S. healthcare. This shift creates substantial data challenges: VBC requires comprehensive patient data across time and care settings, sophisticated risk stratification models, real-time quality measurement, and predictive analytics to identify high-risk patients before expensive complications occur. Graph databases excel at these requirements by naturally representing the complex, interconnected relationships among patients, providers, conditions, treatments, and outcomes.

Key Healthcare Stakeholders

The Healthcare Patient

The healthcare patient represents an individual receiving or seeking medical services, but in data modeling terms, patients are far more than simple demographic records. A patient embodies a complex information entity with temporal clinical history, multiple concurrent conditions, medication regimens, procedure histories, care team relationships, insurance coverage, care preferences, family medical history, social determinants of health, and longitudinal outcomes. Modern healthcare informatics increasingly recognizes patients as active participants in their care rather than passive recipients, which implies bidirectional information flows and patient-generated health data.

Patient demographics encompass the core identifying and descriptive attributes of individuals within healthcare systems, including age, gender, race, ethnicity, language preferences, contact information, emergency contacts, and social determinants of health such as education level, housing stability, food security, and transportation access. While traditional healthcare systems limited demographics to administrative identifiers, contemporary population health approaches recognize that demographic and social factors significantly influence health outcomes and care utilization patterns. Graph models naturally accommodate this complexity by representing demographic attributes as node properties while allowing flexible connections to social determinant nodes that may be shared across patient populations.

From a graph modeling perspective, patient nodes serve as central hubs connecting to:

  • Provider relationships (primary care physician, specialists, care team members)
  • Encounter history (office visits, emergency department visits, hospitalizations, telehealth sessions)
  • Condition and diagnosis nodes (chronic diseases, acute conditions, resolved issues)
  • Medication regimens (current prescriptions, historical medications, allergies)
  • Procedure history (surgeries, diagnostic tests, imaging studies, treatments)
  • Insurance coverage (current and historical payer relationships)
  • Clinical documents (lab results, radiology reports, clinical notes)
  • Care plans and treatment protocols
  • Family relationships and medical history
  • Social determinants and community resources

The Healthcare Provider

The healthcare provider encompasses individuals and organizations delivering medical services, including physicians (primary care and specialists), nurses, physician assistants, nurse practitioners, therapists, hospitals, clinics, urgent care centers, skilled nursing facilities, home health agencies, and ancillary service providers such as laboratories and imaging centers. In healthcare data ecosystems, providers function as both data generators (creating clinical documentation, ordering tests, prescribing medications) and data consumers (reviewing patient histories, analyzing test results, coordinating care across teams).

Provider entities in graph models require rich property sets and relationship structures:

Individual provider properties:

  • National Provider Identifier (NPI)
  • Specialties and subspecialties
  • Board certifications
  • License jurisdictions
  • Practice locations
  • Hospital affiliations
  • Accepting new patients status
  • Languages spoken

Organizational provider properties:

  • Facility type (hospital, clinic, urgent care, etc.)
  • Bed capacity (for hospitals)
  • Service lines offered
  • Accreditation status
  • Quality ratings
  • Medicare/Medicaid participation
  • Insurance networks participated

Provider relationships in graph models:

  • Affiliation: Individual provider → Organization provider
  • Referral networks: Provider → Provider (referral patterns)
  • Care team: Multiple providers → Patient (coordinated care)
  • Coverage arrangements: Provider → Provider (call coverage, backup)
  • Supervision: Attending physician → Resident/Fellow
  • Consultation: Requesting provider → Consulting provider

The Healthcare Payer

The healthcare payer represents entities that finance healthcare services, primarily insurance companies (commercial insurers, Blue Cross Blue Shield plans), government programs (Medicare, Medicaid, TRICARE, Veterans Affairs), and self-insured employers. Payers play a critical role in healthcare data ecosystems by adjudicating claims, negotiating provider payment rates, establishing coverage policies, managing formularies, detecting fraud and abuse, and increasingly, driving quality improvement initiatives through value-based payment models.

Payer organizations maintain extensive data on:

  • Member enrollment and eligibility
  • Benefit plan designs and coverage rules
  • Provider networks and contracted rates
  • Claims history (submitted, adjudicated, paid, denied)
  • Prior authorization requirements and approvals
  • Utilization management (case management, disease management)
  • Quality metrics and performance scorecards
  • Fraud, waste, and abuse detection patterns
  • Pharmacy benefits and formulary rules
  • Care management programs

The payer-provider-patient triangle creates complex data exchange requirements:

Data Flow Information Exchanged Purpose
Patient → Payer Enrollment applications, eligibility verification requests Establish/confirm coverage
Payer → Patient Insurance cards, benefit explanations, claim denials, EOBs Communicate coverage details
Provider → Payer Claims, prior authorization requests, medical records Seek reimbursement, approval
Payer → Provider Claim adjudication results, authorization decisions, payment Reimburse services, manage utilization
Payer → Patient → Provider Insurance information, coverage details Enable billing and care decisions

Clinical Operations and Healthcare Data

Medical Encounters

A medical encounter represents any interaction between a patient and healthcare provider for the purpose of assessment, diagnosis, treatment, counseling, or preventive care. Encounters vary widely in type, setting, duration, and complexity, ranging from brief telehealth check-ins to multi-day intensive care hospitalizations. From a data modeling perspective, encounters serve as temporal containers that link patients, providers, locations, diagnoses, procedures, medications, and charges within a specific timeframe.

Encounter types include:

  • Ambulatory/Outpatient: Office visits, clinic appointments, urgent care visits
  • Emergency: Emergency department visits for acute conditions
  • Inpatient: Hospital admissions requiring overnight stays
  • Observation: Short-term hospital monitoring without formal admission
  • Surgical: Operating room procedures (may be inpatient or outpatient)
  • Telehealth: Virtual visits via video, phone, or asynchronous messaging
  • Home Health: Provider visits to patient's residence
  • Skilled Nursing: Care in long-term care facilities
  • Hospice: End-of-life care services

Each encounter generates substantial structured and unstructured data:

Structured encounter data:

  • Encounter ID (unique identifier)
  • Encounter type and class
  • Admission/start date-time
  • Discharge/end date-time
  • Primary and secondary diagnoses (ICD codes)
  • Procedures performed (CPT/HCPCS codes)
  • Chief complaint and reason for visit
  • Attending provider and care team
  • Facility and department location
  • Disposition (discharged home, admitted, transferred, etc.)
  • Length of stay
  • Total charges and expected reimbursement

Unstructured encounter data:

  • Provider clinical notes (history and physical, progress notes, discharge summaries)
  • Nursing documentation
  • Radiology and lab reports
  • Pathology findings
  • Operative reports
  • Consultation notes

View Encounter Workflow Fullscreen

  1
  2
  3
  4
  5
  6
  7
  8
  9
 10
 11
 12
 13
 14
 15
 16
 17
 18
 19
 20
 21
 22
 23
 24
 25
 26
 27
 28
 29
 30
 31
 32
 33
 34
 35
 36
 37
 38
 39
 40
 41
 42
 43
 44
 45
 46
 47
 48
 49
 50
 51
 52
 53
 54
 55
 56
 57
 58
 59
 60
 61
 62
 63
 64
 65
 66
 67
 68
 69
 70
 71
 72
 73
 74
 75
 76
 77
 78
 79
 80
 81
 82
 83
 84
 85
 86
 87
 88
 89
 90
 91
 92
 93
 94
 95
 96
 97
 98
 99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
<summary>Clinical Encounter Workflow Diagram</summary>
Use the mermaid-generator skill to create the following MicroSim

Name: encounter-workflow

Purpose: Illustrate the typical flow of a patient encounter from arrival through billing, showing data capture points and system interactions

Visual style: Swimlane flowchart with four lanes representing different roles/systems

Swimlanes:
1. Patient
2. Clinical Staff (Registration, Nursing, Physician)
3. Clinical Systems (EHR, CPOE, Lab/Imaging)
4. Administrative Systems (Billing, Claims)

Workflow steps:

**Patient Lane:**
1. Patient Arrives
   Hover: "Patient presents to clinic or hospital"

2. Provides Information
   Hover: "Demographics, insurance, chief complaint"

3. Undergoes Care
   Hover: "Examination, tests, procedures, treatments"

4. Receives Instructions
   Hover: "Discharge instructions, prescriptions, follow-up appointments"

5. Receives Bill/EOB
   Hover: "Explanation of Benefits showing charges and coverage"

**Clinical Staff Lane:**
1. Registration
   Hover: "Verify identity, check insurance eligibility, collect copay"
   Data captured: Demographics, insurance, reason for visit

2. Triage/Vitals
   Hover: "Record vital signs, chief complaint, medications, allergies"
   Data captured: Blood pressure, temperature, pulse, height, weight

3. Provider Assessment
   Hover: "History, physical exam, differential diagnosis"
   Data captured: History of present illness, review of systems, exam findings

4. Order Entry
   Hover: "Labs, imaging, medications, procedures"
   Data captured: Orders sent to CPOE

5. Results Review
   Hover: "Interpret test results, revise diagnosis"
   Data captured: Results integrated into EHR

6. Documentation
   Hover: "Clinical notes, diagnosis codes, procedure codes"
   Data captured: Encounter note, ICD codes, CPT codes

7. Discharge/Follow-up
   Hover: "Prescriptions, referrals, next appointment"
   Data captured: Care plan, prescriptions, referral orders

**Clinical Systems Lane:**
1. EHR: Create Encounter
   Hover: "New encounter record created with patient context"

2. CPOE: Receive Orders
   Hover: "Computerized Provider Order Entry processes orders"

3. Lab/Imaging: Perform Tests
   Hover: "Diagnostic services execute ordered tests"

4. Results Interface
   Hover: "Test results flow back to EHR via HL7/FHIR"

5. EHR: Document Finalization
   Hover: "Provider signs note, triggers billing process"

**Administrative Systems Lane:**
1. Eligibility Check
   Hover: "Real-time verification of insurance coverage"
   System: Claims clearinghouse or payer portal

2. Charge Capture
   Hover: "Extract billable services from clinical documentation"
   System: Charge Description Master (CDM)

3. Medical Coding
   Hover: "Assign ICD, CPT, HCPCS codes to encounter"
   System: Coding module (may use AI/NLP assistance)

4. Claim Generation
   Hover: "Create CMS-1500 or UB-04 claim with all required data elements"
   System: Billing system

5. Claim Submission
   Hover: "Electronic submission to payer via EDI 837 transaction"
   System: Claims clearinghouse

6. Adjudication Wait
   Hover: "Payer reviews claim, determines payment"
   System: Payer adjudication engine

7. Payment Posting
   Hover: "Record payment, contractual adjustments, patient responsibility"
   System: Billing system accounts receivable

Decision points:
- After Registration: Is eligibility active? (Yes → Continue, No → Patient pay or reschedule)
- After Provider Assessment: Admit to hospital? (Yes → Inpatient workflow, No → Continue outpatient)
- After Documentation: Coding complete? (Yes → Submit claim, No → Return to coder)
- After Claim Submission: Claim accepted? (Yes → Wait for payment, No → Fix errors and resubmit)

Color coding:
- Patient activities: Blue
- Clinical activities: Green
- System processes: Purple
- Financial processes: Orange
- Decision points: Yellow diamonds

Annotations:
- Typical timeframes for key stages
- Data standards used (HL7, FHIR, X12)
- Common pain points and delays

Implementation: Mermaid diagram or interactive SVG with hover states, embedded in markdown or as separate HTML file

Clinical Workflows

Clinical workflow describes the sequence of tasks, decisions, handoffs, and information exchanges that occur during healthcare delivery. Clinical workflows span multiple timeframes—from seconds (responding to a cardiac arrest) to years (managing chronic disease progression)—and involve coordination across diverse roles, systems, and organizations. Effective graph modeling of healthcare data requires understanding these workflows because they determine how data elements relate temporally and causally.

Common clinical workflows include:

Ambulatory Care Workflow: 1. Appointment scheduling and pre-visit planning 2. Check-in and registration 3. Triage and vital signs collection 4. Provider encounter (history, examination, assessment, plan) 5. Order entry (labs, imaging, medications, referrals) 6. Patient education and discharge instructions 7. Follow-up appointment scheduling 8. Results notification and management

Inpatient Care Workflow: 1. Admission (emergency department, direct admit, transfer) 2. Initial assessment and order set activation 3. Daily rounding and progress notes 4. Order management and care plan updates 5. Multidisciplinary care coordination (nursing, pharmacy, case management, therapy) 6. Transition planning (discharge planning, post-acute care arrangements) 7. Discharge and follow-up

Medication Management Workflow: 1. Provider prescribing (with clinical decision support checks) 2. Pharmacist verification and screening 3. Dispensing and labeling 4. Nursing administration (inpatient) or patient pickup (outpatient) 5. Medication reconciliation at transitions of care 6. Adherence monitoring and refill management 7. Adverse event monitoring and reporting

Electronic Health Records

The electronic health record (EHR) serves as the digital repository for patient clinical data, replacing paper charts with structured and unstructured electronic information. EHR systems capture, store, and present patient data to support clinical decision-making, care coordination, quality measurement, and regulatory reporting. Unlike simple digitized records, modern EHRs include clinical decision support, computerized provider order entry (CPOE), interoperability interfaces, patient portals, population health analytics, and revenue cycle integration.

Major EHR vendors include Epic, Cerner (Oracle Health), Meditech, Allscripts, athenahealth, and eClinicalWorks, each with proprietary data models and varying interoperability capabilities. This fragmentation creates challenges for healthcare data integration, as different EHRs structure information differently despite adherence to common standards like HL7 and FHIR.

Core EHR functionality:

  • Clinical documentation: Notes, templates, voice recognition, natural language processing
  • Medication management: ePrescribing, medication reconciliation, drug interaction checking
  • Order entry: Labs, imaging, procedures, consultations with clinical decision support
  • Results management: Lab, pathology, and radiology result review and acknowledgment
  • Problem lists: Active and historical diagnoses and conditions
  • Care planning: Treatment protocols, goals, interventions, care team coordination
  • Patient portal: Secure messaging, test result access, appointment scheduling, bill payment

EHR data challenges for graph modeling:

  • Data model variability: Each EHR structures data differently (relational tables, objects, documents)
  • Unstructured content: Clinical notes contain rich information not captured in structured fields
  • Temporal complexity: Data elements have effective dates, update histories, and validity periods
  • Relationship inference: Many relationships are implicit and must be inferred from context
  • Data quality issues: Missing data, inconsistent terminology, duplicate records, data entry errors

Medical Terminology and Coding Systems

Medical Terminology

Medical terminology represents the specialized language of healthcare, built from Greek and Latin roots combined systematically to describe anatomical structures, physiological processes, pathological conditions, diagnostic procedures, and therapeutic interventions. Medical terms follow consistent construction rules: roots (word cores), prefixes (modifiers preceding roots), and suffixes (modifiers following roots). For example, "gastroenterology" combines "gastro-" (stomach), "entero-" (intestine), and "-logy" (study of) to indicate the study of digestive system disorders.

Understanding medical terminology is essential for healthcare data modeling because:

  • Precision: Medical terms convey specific clinical meanings that general language cannot capture
  • Standardization: Consistent terminology enables clear communication across providers and systems
  • Semantic relationships: Term structure reveals hierarchical and relational connections (e.g., all "-itis" terms indicate inflammation)
  • Code mapping: Medical terminology forms the foundation for medical coding systems

Common medical terminology patterns:

Component Type Example Meaning
cardi- Root cardiology Heart
-itis Suffix arthritis Inflammation
hyper- Prefix hypertension Above/excessive
-ectomy Suffix appendectomy Surgical removal
nephro- Root nephrology Kidney
-pathy Suffix neuropathy Disease/disorder
brady- Prefix bradycardia Slow
-plasty Suffix rhinoplasty Surgical repair

Medical Coding Systems Overview

A medical coding system translates clinical documentation (diagnoses, procedures, services, supplies, medications) into standardized alphanumeric codes used for claims submission, statistical analysis, quality measurement, and population health management. Medical coding serves as the bridge between clinical care and healthcare finance, enabling payers to determine reimbursement based on documented services. Multiple coding systems coexist in healthcare, each serving distinct purposes and governed by different organizations.

The primary medical coding systems in U.S. healthcare are:

  • ICD (International Classification of Diseases): Diagnosis and procedure codes
  • CPT (Current Procedural Terminology): Physician services and procedures
  • HCPCS (Healthcare Common Procedure Coding System): Services, supplies, and equipment not in CPT
  • NDC (National Drug Codes): Pharmaceutical products and medications
  • LOINC (Logical Observation Identifiers Names and Codes): Lab and clinical observations
  • SNOMED CT (Systematized Nomenclature of Medicine Clinical Terms): Comprehensive clinical terminology
  • RxNorm: Normalized medication names and relationships
  1
  2
  3
  4
  5
  6
  7
  8
  9
 10
 11
 12
 13
 14
 15
 16
 17
 18
 19
 20
 21
 22
 23
 24
 25
 26
 27
 28
 29
 30
 31
 32
 33
 34
 35
 36
 37
 38
 39
 40
 41
 42
 43
 44
 45
 46
 47
 48
 49
 50
 51
 52
 53
 54
 55
 56
 57
 58
 59
 60
 61
 62
 63
 64
 65
 66
 67
 68
 69
 70
 71
 72
 73
 74
 75
 76
 77
 78
 79
 80
 81
 82
 83
 84
 85
 86
 87
 88
 89
 90
 91
 92
 93
 94
 95
 96
 97
 98
 99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
<summary>Medical Coding Systems Interactive Infographic</summary>
Type: infographic

Purpose: Provide a visual overview of the major medical coding systems, their purposes, governing bodies, and relationships

Layout: Central hub-and-spoke design with "Medical Coding Systems" at center

Main coding systems (spokes radiating from center):

1. **ICD (International Classification of Diseases)**
   Position: Top center
   Color: Red
   Icon: Medical diagnosis symbol
   Click to expand details:
   - Full name: International Classification of Diseases, 10th/11th Revision
   - Governing body: World Health Organization (WHO)
   - U.S. version: ICD-10-CM (Clinical Modification) for diagnoses
   - U.S. procedure codes: ICD-10-PCS (Procedure Coding System)
   - Purpose: Classify diseases, injuries, causes of death
   - Structure: Hierarchical categories with 3-7 character codes
   - Example: E11.9 = Type 2 diabetes mellitus without complications
   - Use cases: Claims, epidemiology, quality measures, mortality statistics
   - Update frequency: Annual (October 1)

2. **CPT (Current Procedural Terminology)**
   Position: Upper right
   Color: Blue
   Icon: Stethoscope/procedure symbol
   Click to expand details:
   - Full name: Current Procedural Terminology
   - Governing body: American Medical Association (AMA)
   - Purpose: Describe physician services, procedures, diagnostic tests
   - Structure: 5-digit codes with optional 2-digit modifiers
   - Categories: Category I (common procedures), Category II (quality measures), Category III (emerging procedures)
   - Example: 99213 = Office visit, established patient, level 3
   - Use cases: Professional fee billing, utilization tracking, performance measurement
   - Update frequency: Annual (January 1)

3. **HCPCS (Healthcare Common Procedure Coding System)**
   Position: Right
   Color: Green
   Icon: Medical supply/durable medical equipment
   Click to expand details:
   - Full name: Healthcare Common Procedure Coding System
   - Governing body: Centers for Medicare & Medicaid Services (CMS)
   - Structure: Level I = CPT codes, Level II = National codes (A-V)
   - Purpose: Services, supplies, equipment, ambulance, drugs not in CPT
   - Example: E0163 = Commode chair, mobile or stationary
   - Use cases: Medicare/Medicaid billing, durable medical equipment, supplies
   - Update frequency: Quarterly

4. **NDC/Drug Codes**
   Position: Lower right
   Color: Purple
   Icon: Pill/medication bottle
   Click to expand details:
   - Full name: National Drug Code
   - Governing body: U.S. Food and Drug Administration (FDA)
   - Structure: 10-11 digit code (labeler-product-package)
   - Purpose: Uniquely identify drug products
   - Example: 0071-0155-23 = Lipitor 20mg, 90 tablets
   - Related systems: RxNorm (normalized names), GPI (therapeutic classification)
   - Use cases: Pharmacy billing, formulary management, drug utilization review
   - Update frequency: Continuous as products registered

5. **LOINC (Lab/Clinical Observations)**
   Position: Lower left
   Color: Orange
   Icon: Laboratory test tube
   Click to expand details:
   - Full name: Logical Observation Identifiers Names and Codes
   - Governing body: Regenstrief Institute
   - Purpose: Identify laboratory and clinical observations
   - Structure: Numeric codes with six-part names
   - Example: 2339-0 = Glucose [Mass/volume] in Blood
   - Use cases: Lab results exchange, clinical observations, vital signs
   - Update frequency: Biannually

6. **SNOMED CT (Clinical Terminology)**
   Position: Left
   Color: Teal
   Icon: Medical hierarchy tree
   Click to expand details:
   - Full name: Systematized Nomenclature of Medicine Clinical Terms
   - Governing body: International Health Terminology Standards Development Organisation (IHTSDO)
   - Purpose: Comprehensive clinical terminology and ontology
   - Structure: Concept codes with hierarchical relationships
   - Example: 44054006 = Diabetes mellitus type 2
   - Use cases: EHR documentation, clinical decision support, semantic interoperability
   - Update frequency: Biannual releases

Relationship connectors (lines between spokes):
- ICD ↔ CPT: "Often billed together" (dotted line)
- CPT ↔ HCPCS: "HCPCS Level I = CPT" (solid line)
- ICD ↔ SNOMED CT: "SNOMED more granular than ICD" (dotted line)
- NDC ↔ HCPCS: "Some HCPCS codes map to drug categories" (dotted line)
- All systems → Center: "All contribute to healthcare data ecosystem"

Interactive features:
- Hover over coding system: Show brief description and icon highlight
- Click coding system: Expand detailed panel with full information
- Click relationship line: Show mapping examples between systems
- Toggle button: "Show update timelines" (displays timeline view of update frequencies)
- Search box: Enter a condition/procedure to see which coding systems apply

Visual enhancements:
- Each coding system uses distinct color
- Icons represent primary use case
- Pulsing animation on hover
- Smooth expand/collapse animations
- Clean, modern flat design

Annotations:
- "All U.S. healthcare claims must include ICD diagnosis codes"
- "CPT and HCPCS together describe all billable services"
- "Multiple coding systems often describe the same clinical concept"

Footer information:
- "Understanding these coding systems is essential for healthcare data modeling"
- "Graph databases can represent relationships among codes from different systems"

Implementation: HTML/CSS/JavaScript with SVG or Canvas, responsive design, deployed as standalone HTML file

ICD Codes: Diagnosis and Inpatient Procedures

ICD codes (International Classification of Diseases) represent the global standard for classifying diseases, injuries, causes of death, and inpatient hospital procedures. The World Health Organization (WHO) maintains the international version (currently ICD-11), while the United States uses ICD-10-CM (Clinical Modification) for diagnoses and ICD-10-PCS (Procedure Coding System) for inpatient procedures. ICD-10-CM was adopted in the U.S. in October 2015, replacing the decades-old ICD-9-CM system and expanding from approximately 14,000 diagnosis codes to over 70,000, enabling far greater clinical specificity.

ICD-10-CM code structure:

  • Character 1: Category (letter, except U)
  • Character 2: Etiology, anatomic site, or manifestation
  • Character 3: Additional detail (completes the category)
  • Character 4-7: Even greater specificity (laterality, severity, episode of care, etc.)

Examples demonstrating increasing specificity:

  • E11: Type 2 diabetes mellitus (category)
  • E11.6: Type 2 diabetes mellitus with other specified complications
  • E11.65: Type 2 diabetes mellitus with hyperglycemia
  • E11.641: Type 2 diabetes mellitus with hypoglycemia with coma

ICD codes serve multiple purposes beyond billing:

  • Claims adjudication: Payers use ICD codes to determine medical necessity and appropriate reimbursement
  • Epidemiology: Public health tracking of disease prevalence and incidence
  • Quality measurement: Many quality metrics require specific diagnosis codes
  • Research: Disease registries and clinical research studies
  • Population health: Risk stratification and care management program enrollment

ICD-10-PCS codes describe inpatient hospital procedures with seven-character alphanumeric codes, each character representing a specific attribute:

  1. Section: Type of procedure (Medical/Surgical, Obstetrics, Imaging, etc.)
  2. Body System: Anatomical region operated on
  3. Root Operation: Objective of the procedure (excision, repair, replacement, etc.)
  4. Body Part: Specific anatomical site
  5. Approach: How the body part was reached (open, percutaneous, via natural opening, etc.)
  6. Device: Device left in place, if any
  7. Qualifier: Additional detail

Example: 0DT60ZZ = Resection of stomach, open approach

CPT Codes: Physician Services and Outpatient Procedures

CPT codes (Current Procedural Terminology), maintained by the American Medical Association (AMA), describe physician services, outpatient procedures, diagnostic tests, and therapeutic services. CPT codes form the foundation of professional fee billing—when a physician sees a patient, performs a procedure, or orders a test, CPT codes translate those services into billable line items. CPT contains over 10,000 codes updated annually, with a rigorous process for adding, modifying, or deleting codes based on evolving medical practice.

CPT code categories:

Category I (5-digit numeric codes): Established procedures and services

  • Evaluation and Management (E&M) (99202-99499): Office visits, consultations, hospital rounds, emergency department visits
  • Anesthesia (00100-01999): Anesthesia services by anatomical site
  • Surgery (10021-69990): Organized by body system
  • Radiology (70010-79999): Diagnostic and interventional imaging
  • Pathology and Laboratory (80047-89398): Lab tests and analyses
  • Medicine (90281-99607): Immunizations, dialysis, physical therapy, etc.

Category II (4 digits + letter F): Optional quality measurement codes

  • Used for performance measurement programs
  • Not used for reimbursement
  • Example: 3074F = Most recent systolic blood pressure <130 mmHg

Category III (4 digits + letter T): Temporary codes for emerging procedures

  • Used for new technologies under evaluation
  • May eventually become Category I codes or be retired
  • Example: 0075T = Transcatheter placement of extracranial vertebral artery stent(s)

CPT modifiers (2-digit codes) provide additional information:

  • -25: Significant, separately identifiable E&M service on same day as procedure
  • -50: Bilateral procedure
  • -51: Multiple procedures
  • -59: Distinct procedural service (unbundling modifier)
  • -76: Repeat procedure by same physician
  • -LT/-RT: Left/right side indicators

Common CPT code examples:

Code Description Typical Reimbursement
99213 Office visit, established patient, level 3 $100-150
99214 Office visit, established patient, level 4 $150-200
99285 Emergency department visit, high severity $300-500
29881 Arthroscopy, knee, surgical $1,500-2,500
80053 Comprehensive metabolic panel (lab test) $15-30
71046 Chest X-ray, 2 views $50-100

HCPCS Codes: Medical Supplies and Services

HCPCS codes (Healthcare Common Procedure Coding System, pronounced "hick-picks") is a two-level coding system maintained by the Centers for Medicare & Medicaid Services (CMS). Level I HCPCS codes are identical to CPT codes. Level II HCPCS codes (commonly referred to simply as "HCPCS codes") cover services, supplies, and equipment not included in CPT, particularly items relevant to Medicare and Medicaid billing.

Level II HCPCS code structure:

  • First character: Letter (A-V) indicating code category
  • Next four characters: Numbers providing specificity
  • Optional modifiers: 2-character alphanumeric codes

HCPCS Level II categories:

Code Range Category Examples
A codes Transportation, supplies, administrative A0428 (Ambulance service, basic life support), A4253 (Blood glucose test strips, box of 50)
B codes Enteral and parenteral therapy B4034 (Enteral feeding supply kit)
C codes Temporary hospital outpatient PPS C1713 (Anchor/screw for opposing bone-to-bone)
D codes Dental procedures D0120 (Periodic oral evaluation)
E codes Durable medical equipment (DME) E0143 (Walker, folding, wheeled), E0163 (Commode chair)
G codes Temporary procedures/services G0439 (Annual wellness visit, first occurrence)
J codes Drugs administered other than oral J0129 (Injection, abatacept, 10 mg)
K codes Temporary codes for DME K0001 (Standard wheelchair)
L codes Orthotics and prosthetics L3260 (Surgical boot/shoe)
P codes Pathology and laboratory P9010 (Blood, split unit)
Q codes Temporary codes Q4081 (Injection, epoetin alfa, 100 units)
S codes Temporary national codes (non-Medicare) S0630 (Removal of sutures by someone other than physician)
V codes Vision and hearing services V2020 (Frames, purchases)

HCPCS codes are particularly important for:

  • Durable medical equipment (DME): Wheelchairs, hospital beds, oxygen equipment, walkers, crutches
  • Orthotics and prosthetics: Braces, artificial limbs, orthopedic shoes
  • Injectable drugs: Chemotherapy, biologics, vaccines given in office or hospital settings
  • Ambulance services: Different levels and types of emergency medical transport
  • Medical supplies: Diabetic supplies, ostomy supplies, incontinence products

Drug Codes: Pharmaceutical Products

Drug codes identify pharmaceutical products for prescribing, dispensing, billing, and clinical decision support. The primary drug coding systems in U.S. healthcare are:

National Drug Code (NDC): FDA-assigned 10-11 digit identifier for drug products

  • Segment 1 (Labeler): Manufacturer or distributor (4-5 digits)
  • Segment 2 (Product): Drug formulation and strength (3-4 digits)
  • Segment 3 (Package): Package size and type (1-2 digits)

Example: 0071-0156-23 - 0071 = Pfizer (labeler) - 0156 = Lipitor 10mg tablet (product) - 23 = Bottle of 90 tablets (package)

RxNorm: National Library of Medicine system providing normalized names for clinical drugs

RxNorm links various drug vocabularies (NDC, SNOMED CT, MeSH, FDA) and provides standard naming conventions at multiple levels of granularity:

  • Ingredient: Active pharmaceutical ingredient (e.g., atorvastatin)
  • Clinical Drug: Ingredient + strength (e.g., atorvastatin 10 mg)
  • Branded Drug: Brand name + ingredient + strength (e.g., Lipitor 10 mg)
  • Clinical Drug Form: Ingredient + strength + dose form (e.g., atorvastatin 10 mg oral tablet)
  • Branded Drug Form: Complete product specification (e.g., Lipitor 10 mg oral tablet)

Other drug classification systems:

  • Generic Product Identifier (GPI): Hierarchical classification by therapeutic class
  • American Hospital Formulary Service (AHFS): Pharmacologic-therapeutic classification
  • Anatomical Therapeutic Chemical (ATC): WHO classification system

Drug codes support critical healthcare functions:

  • ePrescribing: Electronic transmission of prescriptions to pharmacies
  • Drug interaction checking: Clinical decision support for contraindications and interactions
  • Formulary management: Determining which drugs are covered by insurance plans and at what tier
  • Medication reconciliation: Comparing medication lists across care transitions
  • Adverse event reporting: Pharmacovigilance and safety monitoring
  • Pharmacy billing: Submission of pharmacy claims with NDC codes

Healthcare Interoperability and Data Exchange

The Interoperability Challenge

Healthcare interoperability refers to the ability of healthcare information systems to exchange, interpret, and use data across organizational boundaries, enabling seamless information flow among providers, payers, patients, and public health agencies. True interoperability requires not just technical data exchange (syntactic interoperability) but also shared understanding of meaning (semantic interoperability) and coordinated workflows (process interoperability). Despite decades of effort and billions of dollars invested in health IT, interoperability remains one of healthcare's most persistent challenges.

Barriers to healthcare interoperability include:

Technical barriers: - Proprietary EHR data models and interfaces - Heterogeneous data formats (HL7 v2, CDA, FHIR, X12, NCPDP, DICOM) - Inconsistent identifier systems across organizations - Legacy systems with limited integration capabilities - Network security restrictions and firewall configurations

Semantic barriers: - Multiple coding systems describing the same clinical concepts - Local terminology variations and custom codes - Incomplete or missing standardized terminology use - Different granularity in documentation practices - Ambiguous or context-dependent clinical terms

Organizational barriers: - Competitive concerns about sharing patient data - Lack of business incentives for interoperability investments - Information blocking practices to maintain patient populations - Complex data sharing agreements and legal concerns - Varying privacy and consent frameworks across states

Regulatory barriers: - HIPAA privacy and security requirements - 42 CFR Part 2 restrictions on substance use disorder records - State-specific privacy laws (e.g., mental health, genetic data, HIV status) - Data ownership ambiguities - Consent requirements for health information exchange

The 21st Century Cures Act (2016) and subsequent regulations require healthcare providers and EHR vendors to implement standardized APIs, prohibit information blocking, and enable patients to access their complete electronic health information. These policies are accelerating the adoption of FHIR (Fast Healthcare Interoperability Resources) as the emerging standard for healthcare data exchange.

Healthcare Data Exchange Standards and Approaches

Healthcare data exchange encompasses the technical mechanisms, standards, and organizational frameworks for sharing health information. Multiple exchange paradigms coexist in modern healthcare, each optimized for different use cases, technical capabilities, and organizational relationships.

Data exchange standards:

HL7 Version 2.x (Health Level Seven): - Message-based standard from the 1980s-90s - Pipe-delimited text format (e.g., PID|1||12345^^^MRN^MR||DOE^JOHN^||19600101|M) - Common message types: ADT (admissions), ORU (results), ORM (orders), SIU (scheduling) - Still widely used for intra-organizational interfaces - Flexible structure leads to implementation variations

HL7 CDA (Clinical Document Architecture): - XML-based standard for clinical documents - Structures documents (discharge summaries, progress notes, imaging reports) - Continuity of Care Document (CCD) and Consolidated CDA (C-CDA) are common implementations - Human-readable and machine-processable - Required for Meaningful Use and ONC certification

HL7 FHIR (Fast Healthcare Interoperability Resources): - Modern RESTful API standard (2014-present) - JSON and XML formats - Resource-based model (Patient, Encounter, Observation, Medication, etc.) - Easier to implement than previous HL7 standards - Supports web-based and mobile applications - Growing adoption for patient access, payer-provider exchange, public health reporting

X12 EDI (Electronic Data Interchange): - ANSI standard for administrative transactions - Common transaction sets: - 270/271: Eligibility inquiry and response - 276/277: Claim status inquiry and response - 278: Prior authorization - 837: Claims submission - 835: Payment/remittance advice - Fixed-length and delimited formats - Required for HIPAA-covered transactions

DICOM (Digital Imaging and Communications in Medicine): - Standard for medical imaging (X-rays, CT, MRI, ultrasound) - Defines image formats and transmission protocols - Includes patient and study metadata - Basis for PACS (Picture Archiving and Communication Systems)

Data exchange approaches:

Approach Description Use Cases Advantages Disadvantages
Direct messaging Secure email-like exchange using Direct Protocol Provider-to-provider referrals, transitions of care Simple, encrypted, "push" model Requires known recipient address, no query capability
Health Information Exchanges (HIEs) Regional or statewide networks aggregating data Emergency department access to patient history Broad coverage, query for missing information Governance complexity, funding challenges, variable data quality
EHR vendor networks Data sharing within same EHR vendor ecosystem Care coordination among Epic or Cerner sites Easier semantic interoperability Limited to single vendor, proprietary
FHIR APIs Standardized RESTful APIs for data access Patient apps, payer integrations, research Standards-based, modern architecture Implementation variations, security complexity
Point-to-point interfaces Custom connections between specific systems Lab results, radiology images, ADT notifications Optimized for specific workflow Maintenance burden, brittle, non-scalable

Graph databases offer unique advantages for healthcare interoperability challenges:

  • Schema flexibility: Easily accommodate data from multiple sources with varying structures
  • Relationship representation: Naturally model connections among patients, encounters, providers, diagnoses, medications
  • Identity resolution: Graph algorithms can link records representing same patient across systems
  • Data lineage: Track provenance of data elements across exchanges
  • Semantic mapping: Represent relationships among coding systems (ICD, SNOMED CT, LOINC, etc.)
  • Master data management: Create unified views of patients, providers, and facilities across sources
  1
  2
  3
  4
  5
  6
  7
  8
  9
 10
 11
 12
 13
 14
 15
 16
 17
 18
 19
 20
 21
 22
 23
 24
 25
 26
 27
 28
 29
 30
 31
 32
 33
 34
 35
 36
 37
 38
 39
 40
 41
 42
 43
 44
 45
 46
 47
 48
 49
 50
 51
 52
 53
 54
 55
 56
 57
 58
 59
 60
 61
 62
 63
 64
 65
 66
 67
 68
 69
 70
 71
 72
 73
 74
 75
 76
 77
 78
 79
 80
 81
 82
 83
 84
 85
 86
 87
 88
 89
 90
 91
 92
 93
 94
 95
 96
 97
 98
 99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
<summary>Healthcare Data Integration Graph Model</summary>
Type: graph-model

Purpose: Demonstrate how graph databases can integrate healthcare data from multiple sources (EHR, claims, lab, pharmacy) around a unified patient view

Node types:

1. **Patient** (blue circles, large)
   Properties: patient_id, MRN, name, DOB, gender, address
   Labels: "Master Patient Record"

2. **Source System** (gray hexagons, medium)
   Properties: system_name, vendor, organization
   Examples: "Epic EHR - Hospital A", "Claims Database - Payer X", "LabCorp Interface", "CVS Pharmacy"

3. **Encounter** (green squares, medium)
   Properties: encounter_id, date, type, facility, chief_complaint
   Examples: "Office Visit 2024-01-15", "ED Visit 2024-02-03", "Hospitalization 2024-03-10"

4. **Diagnosis** (red diamonds, small)
   Properties: ICD_code, description, onset_date, status
   Examples: "E11.9 Type 2 Diabetes", "I10 Essential Hypertension", "J45.909 Asthma"

5. **Medication** (purple pill shapes, small)
   Properties: drug_name, NDC, RxNorm, dose, frequency, start_date
   Examples: "Metformin 500mg BID", "Lisinopril 10mg daily", "Albuterol inhaler PRN"

6. **Lab Result** (orange test tubes, small)
   Properties: LOINC_code, test_name, value, units, date, reference_range
   Examples: "HbA1c 7.2%", "Glucose 145 mg/dL", "Creatinine 0.9 mg/dL"

7. **Provider** (teal stethoscope icons, medium)
   Properties: NPI, name, specialty, organization
   Examples: "Dr. Smith - PCP", "Dr. Jones - Endocrinology", "Dr. Lee - Emergency Medicine"

8. **Coding System** (yellow books, small)
   Properties: system_name, version, code
   Examples: "ICD-10-CM", "RxNorm", "LOINC", "SNOMED CT"

Edge types:

1. **SOURCED_FROM** (Patient → Source System)
   Properties: identifier_type, identifier_value, confidence_score
   Style: Dashed gray arrows
   Purpose: Show which systems contributed patient data

2. **HAD_ENCOUNTER** (Patient → Encounter)
   Properties: admission_date, discharge_date
   Style: Solid green arrows
   Purpose: Link patient to care episodes

3. **DOCUMENTED_BY** (Encounter → Provider)
   Properties: role (attending, consulting, PCP)
   Style: Solid teal arrows
   Purpose: Connect encounters to providers

4. **DIAGNOSED_WITH** (Encounter → Diagnosis)
   Properties: diagnosis_type (primary, secondary), documentation_date
   Style: Solid red arrows
   Purpose: Link diagnoses to encounters

5. **PRESCRIBED** (Encounter → Medication)
   Properties: prescription_date, prescribing_provider, status (active, discontinued)
   Style: Solid purple arrows
   Purpose: Connect medications to encounters where prescribed

6. **ORDERED** (Encounter → Lab Result)
   Properties: order_date, ordering_provider
   Style: Solid orange arrows
   Purpose: Link lab results to encounters

7. **TAKES** (Patient → Medication)
   Properties: start_date, end_date, adherence_score
   Style: Dashed purple arrows
   Purpose: Show current medication regimen

8. **HAS_DIAGNOSIS** (Patient → Diagnosis)
   Properties: onset_date, status (active, resolved, historical)
   Style: Dashed red arrows
   Purpose: Link patient to problem list

9. **CODED_AS** (Diagnosis → Coding System)
   Properties: code_value, version
   Style: Dotted yellow arrows
   Purpose: Show code mappings across terminologies

10. **MAPS_TO** (Coding System → Coding System)
    Properties: mapping_confidence, mapping_type (equivalent, narrower, broader)
    Style: Bidirectional dotted yellow arrows
    Purpose: Represent semantic relationships among coding systems

Sample data structure:

Central patient node: "John Doe, DOB: 1960-01-01"
├─ SOURCED_FROM → "Epic EHR - Hospital A" (MRN: 123456)
├─ SOURCED_FROM → "Claims Database - Payer X" (Member ID: 987654)
├─ SOURCED_FROM → "LabCorp Interface" (Account: LC-456)
├─ HAD_ENCOUNTER → "Office Visit 2024-01-15"
│  ├─ DOCUMENTED_BY → "Dr. Smith - PCP"
│  ├─ DIAGNOSED_WITH → "E11.9 Type 2 Diabetes"
│  ├─ PRESCRIBED → "Metformin 500mg BID"
│  └─ ORDERED → "HbA1c 7.2%"
├─ HAD_ENCOUNTER → "Office Visit 2024-04-10"
│  ├─ DOCUMENTED_BY → "Dr. Smith - PCP"
│  └─ ORDERED → "HbA1c 6.8%"
├─ HAS_DIAGNOSIS → "E11.9 Type 2 Diabetes"
│  └─ CODED_AS → "ICD-10-CM: E11.9"
│     └─ MAPS_TO → "SNOMED CT: 44054006 (Diabetes mellitus type 2)"
├─ HAS_DIAGNOSIS → "I10 Essential Hypertension"
└─ TAKES → "Metformin 500mg BID"

Layout: Hierarchical radial layout with patient at center

Interactive features:
- **Zoom**: Mouse wheel to zoom in/out
- **Pan**: Click and drag background
- **Node hover**: Show full properties in tooltip
- **Node click**: Highlight all connected nodes and edges (1-hop neighborhood)
- **Edge hover**: Show relationship properties
- **Double-click node**: Expand to show additional connections (if collapsed)
- **Right-click node**: Context menu with options:
  - "Find similar patients" (show patients with similar diagnoses)
  - "View timeline" (temporal view of connected encounters)
  - "Show data lineage" (highlight source systems)
- **Filter panel**: Toggle node types on/off to simplify view
- **Search box**: Find specific patients, diagnoses, medications by name or code

Visual styling:
- **Node size**: Based on number of connections (degree centrality)
- **Edge thickness**: Thicker for more recent/active relationships
- **Color coding**: Consistent colors for node types
- **Transparency**: Lower opacity for historical/inactive elements
- **Animation**: Gentle pulsing for nodes with recent activity
- **Clustering**: Visually group related nodes (encounters + associated diagnoses/meds)

Legend (top-right panel):
- Node shape and color key
- Edge style meanings
- Interaction instructions
- "Show sample queries" button

Sample queries panel (expandable):
1. "Find all encounters where diabetes was diagnosed"
2. "Show medication history across all sources"
3. "Identify gaps in lab monitoring"
4. "Trace data lineage for HbA1c result"
5. "Find coding system mappings for diabetes"

Educational annotations:
- Callout: "Patient data integrated from 4 source systems"
- Callout: "Graph model handles multiple identifiers per patient"
- Callout: "Coding system relationships enable semantic queries"
- Callout: "Temporal relationships show care progression"

Canvas size: 1000x800px

Implementation:
- vis-network JavaScript library for graph visualization
- D3.js for enhanced layout and animations
- Responsive design for various screen sizes
- Export options: PNG image, JSON data, Cypher queries
- Deployed in /docs/sims/healthcare-data-integration-graph/

Data generation:
- Use realistic synthetic patient data
- Include 1 patient (center), 4 source systems, 5-7 encounters, 3-5 diagnoses, 4-6 medications, 6-10 lab results, 3-4 providers
- Ensure temporal consistency (later encounters reference earlier diagnoses)
- Include code mappings among ICD, SNOMED, RxNorm, LOINC

Summary and Key Takeaways

This chapter provided a comprehensive overview of the healthcare system domain knowledge essential for effective graph data modeling. Understanding healthcare economics, stakeholder perspectives, clinical workflows, and data standards forms the foundation for designing graph models that accurately represent the complexity and interconnectedness of healthcare information.

Key concepts covered:

Healthcare Economics: - U.S. healthcare costs are twice those of comparable nations ($12,900 per person annually) - Fee-for-service models incentivize volume over value, contributing to cost escalation - Value-based care aligns payment with outcomes, requiring sophisticated data analytics - Graph databases support the transition to value-based care through advanced relationship analytics

Healthcare Stakeholders: - Patients: Central information hub connecting clinical, administrative, and social determinant data - Providers: Individual clinicians and organizations generating and consuming clinical documentation - Payers: Insurance companies and government programs financing care and driving quality initiatives - All three stakeholders generate complex, interconnected data requiring graph representation

Clinical Operations: - Medical encounters: Temporal containers linking patients, providers, diagnoses, procedures, and charges - Clinical workflows: Multi-step processes spanning seconds to years, involving coordination across roles and systems - Electronic health records: Digital repositories with structured and unstructured patient data, varying by vendor - Patient demographics: Core identifying attributes plus social determinants influencing health outcomes

Medical Coding and Terminology: - Medical terminology: Specialized language built from Greek/Latin roots enabling precise clinical communication - ICD codes: Classify diagnoses and inpatient procedures (70,000+ codes) - CPT codes: Describe physician services and outpatient procedures (10,000+ codes) - HCPCS codes: Cover supplies, equipment, and services not in CPT - Drug codes: Identify pharmaceutical products (NDC, RxNorm, GPI) - Multiple coding systems require semantic mapping and relationship management

Interoperability and Data Exchange: - Healthcare interoperability: Ability to exchange and meaningfully use health information across organizations - Data exchange standards: HL7 v2, CDA, FHIR, X12, DICOM serve different exchange needs - Exchange approaches: Direct messaging, HIEs, vendor networks, FHIR APIs, point-to-point interfaces - Graph advantages: Schema flexibility, relationship representation, identity resolution, semantic mapping

With this healthcare domain foundation, you are now prepared to design graph data models that accurately capture the intricate relationships among patients, providers, payers, clinical concepts, and healthcare transactions. The next chapters will build on this knowledge to develop specific graph modeling patterns for patient-centric, provider-centric, and payer-centric perspectives.

References

  1. FHIR Overview - 2024 - HL7 International - Official Fast Healthcare Interoperability Resources (FHIR) specification describing modern healthcare data exchange standards including RESTful APIs and resource-based data models essential for understanding healthcare interoperability.

  2. ICD-10-CM Official Guidelines - 2024 - Centers for Disease Control and Prevention - Comprehensive resource for the International Classification of Diseases, 10th Revision, Clinical Modification coding system used throughout U.S. healthcare for diagnosis classification and billing.

  3. Healthcare Payment Systems Overview - 2024 - Centers for Medicare & Medicaid Services - Official CMS documentation explaining Medicare fee-for-service payment systems, DRGs, and reimbursement methodologies that form the foundation of U.S. healthcare economics.