flowchart TD subgraph Sources["Data Sources"] GitHub[("GitHub Repos
dmccreary/*")]:::sourceNode end subgraph Crawl["Step 1: Crawl Metadata"] Crawler["crawl-microsims.py
Fetch metadata.json"]:::processNode end subgraph Combine["Step 2: Combine Data"] DataFile[("microsims-data.json
~500 KB")]:::dataNode end subgraph Embed["Step 3: Generate Embeddings"] EmbedScript["generate-embeddings.py
384-dim vectors"]:::processNode EmbedFile[("microsims-embeddings.json
7 MB")]:::warningNode end subgraph Precompute["Step 4: Precompute Similarities"] SimilarScript["generate-similar-microsims.py
Cosine similarity matrix"]:::processNode SimilarFile[("similar-microsims.json
870 KB")]:::dataNode end subgraph PCA["Step 5: Generate PCA Projection"] PCAScript["generate-pca.py
Reduce to 2D"]:::processNode PCAFile[("pca-projection.json
~50 KB")]:::dataNode end subgraph WebApps["Step 6: Web Applications"] Search["Faceted Search
demo.html"]:::appNode Similar["Similar MicroSims
list-similar-microsim"]:::appNode PCAMap["2D PCA Map
pca-map"]:::appNode end GitHub --> Crawler Crawler --> DataFile DataFile --> EmbedScript EmbedScript --> EmbedFile EmbedFile --> SimilarScript EmbedFile --> PCAScript SimilarScript --> SimilarFile PCAScript --> PCAFile DataFile --> Search SimilarFile --> Similar PCAFile --> PCAMap DataFile --> Similar DataFile --> PCAMap Search -->|Links To| Similar classDef sourceNode fill:#9b59b6,stroke:#333,stroke-width:2px,color:#fff,font-size:14px classDef processNode fill:#3498db,stroke:#333,stroke-width:2px,color:#fff,font-size:14px classDef dataNode fill:#27ae60,stroke:#333,stroke-width:2px,color:#fff,font-size:14px classDef warningNode fill:#e74c3c,stroke:#333,stroke-width:2px,color:#fff,font-size:14px classDef appNode fill:#f39c12,stroke:#333,stroke-width:2px,color:#fff,font-size:14px linkStyle default stroke:#666,stroke-width:2px
Workflow Details
Hover over a node to see details about that step in the workflow.