Standard text-to-image models overwhelmingly produce majority attribute combinations. Simply suppressing common outputs does not amplify rare ones — the probability mass redistributes among other majorities. RAIGen instead locates minority attributes that are already encoded inside the model but systematically suppressed during generation.
RAIGen identifies minority features directly from diffusion model internals — no predefined categories, no external VLMs required.
Hierarchical decomposition of diffusion representations into interpretable sparse features at multiple levels of granularity, from broad concepts to fine-grained details.
A novel signal combining activation rarity and semantic distinctiveness to automatically surface suppressed attributes across any architecture.
For each MSAE neuron zi we compute two complementary signals:
νi — Activation frequency: the fraction of samples where neuron zi fires. Low νi means a rarer feature.
di — Semantic distinctiveness: cosine distance between the neuron's activation-weighted CLIP centroid and the global dataset centroid. High di means the feature is semantically far from the majority.
Neurons with high minority score are both infrequent and semantically separated from dominant patterns — hallmarks of genuine minority attributes.
RAIGen reveals contextual, stylistic, interaction, and compositional rare attributes across prompts. Each image pair shows the generated image (top) and its MSAE activation heatmap (bottom), highlighting the spatial regions driving the minority neuron.
Attribute Presence measures how often a discovered attribute appears in generated images. Lower is rarer. RAIGen attributes appear in fewer than 20% of images, confirming our method surfaces genuinely underrepresented features.
| Model | Approach | WinoBias | COCO |
|---|---|---|---|
| SD v1.4 | OpenBias | 0.941 | 0.933 |
| RAIGen | 0.205 | 0.220 | |
| SDXL | OpenBias | 0.941 | 0.933 |
| RAIGen | 0.194 | 0.199 |
| Profession | Mean ↓ | 95% CI |
|---|---|---|
| Analyst | 1.35 | [1.03, 1.67] |
| CEO | 0.70 | [0.44, 0.96] |
| Doctor | 1.18 | [0.97, 1.39] |
| Salesperson | 1.45 | [0.99, 1.91] |
| Sheriff | 2.64 | [2.21, 3.07] |
If you find our work useful, please cite: