Current location: Home> Ai News

Li Feifei's team launches FlowMo image processing technology to improve reconstruction quality

Author: LoRA Time: 22 Mar 2025 191

In the field of computer vision, the team of Professor Li Feifei and Professor Wu Jiajun of Stanford University recently released an innovative research result - "FlowMo" image tokenizer. This approach significantly improves the quality of image reconstruction without relying on convolutional neural networks (CNNs) and generative adversarial networks (GANs).

AI Painting Two-dimensional Office Professional Women (1) Animation

FlowMo adopts a two-stage training strategy. The first stage is to learn by capturing multiple possible image reconstruction results to ensure that the diversity and quality of the generated images coexist; the second stage is to focus on optimizing the reconstruction results to make them closer to the original image. This process not only improves the accuracy of reconstruction, but also enhances the visual perception quality of the generated images.

Experimental results show that FlowMo performs better than traditional image tokenizer on multiple standard datasets. For example, on the ImageNet-1K dataset, FlowMo's reconstruction performance achieved optimal results at multiple bit rate settings, especially at low bit rate cases, with the reconstruction FID value of 0.95, far exceeding the best model at present.

This study marks an important breakthrough in image processing technology, provides new ideas for future image generation models, and lays the foundation for the optimization of various visual application scenarios. With the continuous advancement of technology, image generation and processing will become more efficient and intelligent.