Meet LLMScore: A New LLM-based Instruction-Following Matching Pipeline to Evaluate the Alignment Between Text Prompts and Synthesized Images in Text-to-Image Synthesis

GPT-4: Researchers have introduced LLMScore, a framework that leverages large language models (LLMs) to evaluate text-image alignment in text-to-image synthesis. LLMScore mimics human review by accessing compositionality at various granularities and providing alignment scores with justifications. The system can adapt to different standards, such as overall alignment or error counting, and has demonstrated strong correlation with human judgments across multiple datasets.
