SimT2I

non-profit
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

binaryXwizard  updated a Space 1 day ago
SimT2I/README
binaryXwizard  published a Space 1 day ago
SimT2I/README
Lyy0725  submitted a paper 22 days ago
ELF: Embedded Language Flows
View all activity

Organization Card

SimT2I Demo

This Space is an interactive research demo for Simple Text-to-Image (SimT2I). It currently serves the released PyTorch/Diffusers checkpoints:

  • SimT2I-B/16
  • SimT2I-L/16

The demo is intended for quick qualitative inspection of the released models, rather than as a benchmark or a replacement for the full research codebase.

Usage

Enter a prompt, choose either SimT2I-B/16 or SimT2I-L/16, and adjust the CFG scale if you want stronger or weaker prompt guidance. The advanced controls let you sample more candidates per prompt when you want a broader qualitative look.

The negative-prompt field is an experimental steering control. When it is set, the CFG reference branch uses prompt + negative prompt instead of an empty text condition.

Checkpoints

The app loads model weights from the private SimT2I/SimT2I model repository at runtime. The Space must therefore be configured with an HF_TOKEN secret that can read the SimT2I organization repositories.

Notes

For each prompt, the demo can generate several candidates and return the one with the highest score under a CLIP+MLP aesthetic predictor. The selector follows the public christophschuhmann/improved-aesthetic-predictor implementation, using the sac+logos+ava1-l14-linearMSE.pth predictor on top of CLIP ViT-L/14 features.

Output quality can vary with prompt phrasing, CFG scale, random seed, and the number of generated candidates. The selected image is the highest-scoring sample under the aesthetic selector, not a guarantee of factual or compositional correctness.

models 0

None public yet

datasets 0

None public yet