Open source text to image and image LLM

  • OmniGen: Unified Image Generation.
  • FLUX.1 minimal inference code to run image generation
  • Llama OCRAbout Document to Markdown OCR library with Llama 3.2 vision
  • deepfaceAbout A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
  • PIXART-αFast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
  • LTX-VideoDiT-based video generation model