DeepSeek has been in the media a lot lately. They have just introduced another model, this time one that can compete with DALL-E 3 and Stable Diffusion. As they explain:
Janus-Pro is a unified understanding and generation MLLM, which decouples visual encoding for multimodal understanding and generation. Janus-Pro is constructed based on the DeepSeek-LLM-1.5b-base/DeepSeek-LLM-7b-base.
I tested the image generation feature, and it was quite impressive. You can also ask the model to describe memes or extract information from a image.
You can test it here.