# STRING Enrichment Report

This report summarizes functional enrichment of gene clusters using the STRING API.

Important interpretation notes:
- Lower k values favor enrichment due to larger cluster size.
- Increasing k may *increase the number of enriched clusters* as distinct biology separates.
- Loss of enrichment at high k reflects statistical underpowering, not absence of biology.

## Per-k enrichment overview

|   k |   n_clusters |   median_cluster_size |   n_clusters_enriched |   fraction_enriched |   total_enriched_terms |   median_best_fdr |
|----:|-------------:|----------------------:|----------------------:|--------------------:|-----------------------:|------------------:|
|   5 |            5 |                    11 |                     5 |            1        |                     13 |           0.0025  |
|   7 |            7 |                     6 |                     4 |            0.571429 |                     11 |           0.00415 |
|  15 |           11 |                     4 |                     6 |            0.545455 |                      8 |           0.01705 |

## Enrichment source breakdown

The table below shows how enrichment is distributed across annotation sources.
This helps distinguish pathway-driven, functional, structural, or literature-driven signal.

|   k | category_label        |   n_terms |   n_clusters |   best_fdr |
|----:|:----------------------|----------:|-------------:|-----------:|
|   5 | HPO                   |         5 |            3 |   4.61e-08 |
|   5 | NetworkNeighborAL     |         2 |            1 |   0.0381   |
|   5 | PMID                  |         2 |            2 |   0.0075   |
|   5 | SMART domains         |         2 |            1 |   0.01     |
|   5 | TISSUES               |         2 |            1 |   0.0272   |
|   7 | HPO                   |         4 |            2 |   4.61e-08 |
|   7 | PMID                  |         2 |            2 |   0.0075   |
|   7 | SMART domains         |         2 |            1 |   0.01     |
|   7 | TISSUES               |         2 |            1 |   0.0272   |
|   7 | NetworkNeighborAL     |         1 |            1 |   0.0064   |
|  15 | HPO                   |         5 |            3 |   0.0023   |
|  15 | GO Biological Process |         1 |            1 |   0.0277   |
|  15 | NetworkNeighborAL     |         1 |            1 |   0.0064   |
|  15 | TISSUES               |         1 |            1 |   0.0449   |

## How to use this report

- Look for k values where enrichment spreads across *multiple clusters*.
- Prefer k where GO / pathway categories dominate over domain-only or PubMed-only signal.
- Use heatmaps alongside this report to confirm tissue coherence.
