Open source text to image and image LLM
Large models related to image processing
- OmniGen: Unified Image Generation.
- FLUX.1 minimal inference code to run image generation
- Llama OCRAbout Document to Markdown OCR library with Llama 3.2 vision
- deepfaceAbout A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
- PIXART-αFast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
- LTX-VideoDiT-based video generation model
- Hugging face playground: play ground