Visuals

UMAP Embedding Projections

Each plot shows 64-dimensional embeddings reduced to 2D via UMAP. Points are colored by ground-truth class labels. Tight, well-separated clusters indicate that the embedding captures the graph's community structure.

Cora 2,708 nodes · 7 classes · Citation Network

Cleora UMAP on Cora
Cleora 86.1%
NetMF UMAP on Cora
NetMF 83.9%
DeepWalk UMAP on Cora
DeepWalk 83.5%
HOPE UMAP on Cora
HOPE 82.1%
ProNE UMAP on Cora
ProNE 17.9%
RandNE UMAP on Cora
RandNE 24.7%
Cleora shows the tightest, most distinct clusters — 7 classes cleanly separated. NetMF and DeepWalk also show structure but with more overlap. ProNE and RandNE produce noisy, unstructured projections.

CiteSeer 3,312 nodes · 6 classes · Citation Network

Cleora UMAP on CiteSeer
Cleora 82.4%
NetMF UMAP on CiteSeer
NetMF 81.0%
HOPE UMAP on CiteSeer
HOPE 72.4%
ProNE UMAP on CiteSeer
ProNE 21.1%
RandNE UMAP on CiteSeer
RandNE 20.2%
Cleora separates 6 subject areas into clear, compact groups. NetMF is close behind but with more inter-class mixing. HOPE captures some structure. ProNE and RandNE show little meaningful separation.

Facebook 4,039 nodes · Louvain Communities · Social Network

Cleora UMAP on Facebook
Cleora 99.0%
NetMF UMAP on Facebook
NetMF 95.7%
ProNE UMAP on Facebook
ProNE 68.5%
RandNE UMAP on Facebook
RandNE 63.2%
Facebook ego-network communities detected via Louvain. Cleora and NetMF both capture the community structure well, with Cleora achieving the tightest separation. ProNE and RandNE show less organized projections.

PubMed 19,717 nodes · 3 classes · Citation Network

Only 3 algorithms complete on PubMed — HOPE, NetMF, GraRep, DeepWalk, and Node2Vec fail (timeout or OOM).

Cleora UMAP on PubMed
Cleora 87.9%
ProNE UMAP on PubMed
ProNE 33.9%
RandNE UMAP on PubMed
RandNE 35.1%
At 19.7K nodes, most algorithms crash. Cleora cleanly separates all 3 diabetes paper categories with distinct, well-formed clusters. ProNE and RandNE produce uniform noise — the difference is dramatic.

PPI 3,890 nodes · 50 classes · Protein Interaction

Only 3 algorithms complete on PPI — HOPE, NetMF, GraRep, DeepWalk, and Node2Vec fail (timeout or OOM).

Cleora UMAP on PPI
Cleora 100%
ProNE UMAP on PPI
ProNE 2.3%
RandNE UMAP on PPI
RandNE 7.3%
PPI has 50 protein function classes. Cleora achieves perfect classification accuracy and its UMAP shows rich, multi-cluster structure. ProNE and RandNE produce near-random noise with no discernible class separation.

Methodology

  • All embeddings use 64 dimensions, reduced to 2D via UMAP (n_neighbors=15, min_dist=0.1, random_state=42)
  • Cleora uses num_iterations=40, propagation='left', normalization='l2', whiten=True
  • Points are colored by ground-truth labels (Cora/CiteSeer/PubMed/PPI) or Louvain communities (Facebook)
  • Accuracy numbers from the full benchmark suite (separate 128-dim runs, 3-run average)