NLP Pipeline Architecture

flowchart TD
    Raw["Raw Text:
Hey, can you show me last quarter sales?"]:::io subgraph L1["Character Level - Layer 1: Text Preprocessing"] direction LR Norm["Text Normalization"]:::l1 Tok["Tokenization"]:::l1 Norm --> Tok end subgraph L2["Word Level - Layer 2: Morphological Analysis"] direction LR Stem["Stemming"]:::l2 Lemma["Lemmatization"]:::l2 Stem --> Lemma end subgraph L3["Word Level - Layer 3: Syntactic Analysis"] direction LR POS["Part-of-Speech Tagging"]:::l3 Dep["Dependency Parsing"]:::l3 POS --> Dep end subgraph L4["Sentence Level - Layer 4: Semantic Analysis"] direction LR NER["Named Entity Recognition"]:::l4 Coref["Coreference Resolution"]:::l4 NER --> Coref end Out["Structured Output:
Ready for intent recognition and query execution"]:::io Raw -->|normalized tokens| L1 L1 -->|root forms| L2 L2 -->|grammatical tags| L3 L3 -->|entity relationships| L4 L4 --> Out classDef io fill:#90a4ae,stroke:#37474f,stroke-width:2px,color:#fff,font-size:14px classDef l1 fill:#bbdefb,stroke:#1565c0,stroke-width:2px,color:#0d2c54,font-size:15px classDef l2 fill:#90caf9,stroke:#1565c0,stroke-width:2px,color:#0d2c54,font-size:15px classDef l3 fill:#42a5f5,stroke:#0d47a1,stroke-width:2px,color:#fff,font-size:15px classDef l4 fill:#1565c0,stroke:#0d47a1,stroke-width:2px,color:#fff,font-size:15px style L1 fill:#e3f2fd,stroke:#64b5f6,stroke-width:1px style L2 fill:#e3f2fd,stroke:#42a5f5,stroke-width:1px style L3 fill:#e3f2fd,stroke:#1e88e5,stroke-width:1px style L4 fill:#e3f2fd,stroke:#1565c0,stroke-width:1px linkStyle default stroke:#ff8f00,stroke-width:2.5px,color:#e65100,font-size:13px
Processing Layers (light to dark = more sophisticated)
Layer 1: Preprocessing
Layer 2: Morphology
Layer 3: Syntax
Layer 4: Semantics
Orange arrows: data transforms

Stage Details

Hover a stage to learn what that layer does.