Similarity Viewer
This 2D chart shows a series of points. Each point corresponds to a single paper. The distance between any two points represents how similar the papers are. Points that are close together indicated that the papers are similar. Points that are far apart indicate that the papers are different. Note that the X and Y dimensions don't actually mean anything concrete like the score. They are just ways to group similar papers.
Embeddings
The way we create this chart is to use large language models (LLMs) to create a data structure for each paper called an embedding. An embedding is a vector of numbers that place each paper in a multi-dimensional space based on the words and concepts described in each paper.
Sample Run
1 | |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 | |