Hyper-SD:

Trajectory Segmented Consistency Model for Efficient Image Synthesis

ByteDance
*  Project Lead

Visual Comparison between Hyper-SD and Other Methods. From the first column to the fourth column, the prompts of these images are (1) A dog wearing a white t-shirt, with the word “hyper” written on it ... (2) Abstract beauty, approaching perfection, pure form, golden ratio, minimalistic, unfinished, ... (3) A crystal heart laying on moss in a serene zen garden ... (4)Anthropomorphic art of a scientist stag, victorian inspired clothing by krenz cushart ... , respectively.

Real-Time Generation Demo of Hyper-SD.

Abstract


Recently, a series of diffusion-aware distillation algorithms have emerged to alleviate the computational overhead associated with the multi-step inference process of Diffusion Models (DMs). Current distillation techniques often dichotomize into two distinct aspects: i) ODE Trajectory Preservation; and ii) ODE Trajectory Reformulation. However, these approaches suffer from severe performance degradation or domain shifts. To address these limitations, we propose Hyper-SD, a novel framework that synergistically amalgamates the advantages of ODE Trajectory Preservation and Reformulation, while maintaining near-lossless performance during step compression. Firstly, we introduce Trajectory Segmented Consistency Distillation to progressively perform consistent distillation within pre-defined time-step segments, which facilitates the preservation of the original ODE trajectory from a higher-order perspective. Secondly, we incorporate human feedback learning to boost the performance of the model in a low-step regime and mitigate the performance loss incurred by the distillation process. Thirdly, we integrate score distillation to further improve the low-step generation capability of the model and offer the first attempt to leverage a unified LoRA to support the inference process at all steps. Extensive experiments and user studies demonstrate that Hyper-SD achieves SOTA performance from 1 to 8 inference steps for both SDXL and SD1.5. For example, Hyper-SDXL surpasses SDXL-Lightning by +0.68 in CLIP Score and +0.51 in Aes Score in the 1-step inference.

Pipeline


Hyper-SD take the two-stage Progressive Consistency Distillation. The first stage involves consistency distillation in two separate time segments: [0, T/2] and [T/2 , T] to obtain the two segments consistency ODE. Then, this ODE trajectory is adopted to train a global consistency model in the subsequent stage

Experiment

Qualitative comparisons between Hyper-SD and other LoRA-based acceleration approaches on SDXL architecture.

Qualitative comparisons between Hyper-SD and other LoRA-based acceleration approaches on SD15 architecture.

Hyper-SD exhibits a remarkable superiority over existing methods that concentrate on acceleration and obtain more user preference on both SD1.5 and SDXL architectures.

Hyper-SD LoRAs with different steps can be applied to different base models and consistently generate high-quality images

The unified LoRAs of Hyper-SD are compatible with ControlNet. The examples are conditioned on either scribble or canny images.

BibTeX

@misc{ren2024hypersd,
      title={Hyper-SD: Trajectory Segmented Consistency Model for Efficient Image Synthesis}, 
      author={Yuxi Ren and Xin Xia and Yanzuo Lu and Jiacheng Zhang and Jie Wu and Pan Xie and Xing Wang and Xuefeng Xiao},
      year={2024},
      eprint={2404.13686},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}
主站蜘蛛池模板: 欧美成人免费全部网站| 日本a∨在线播放高清| 狠狠入ady亚洲精品| 女欢女爱第一季| 免费毛片a线观看| jizz在线看片| 爽好舒服使劲添我要视频| 天天躁日日躁狠狠躁av中文| 俄罗斯激情女同互慰在线| www.99精品| 波多野结衣和邻居老人| 国精品午夜福利视频不卡麻豆| 99re精彩视频| 岳在我胯下哭泣| 青青草国产在线| 亚洲成AV人综合在线观看| 夫妇交换性3中文字幕| 精品精品国产高清a毛片| 久久亚洲AV成人无码| 国产无遮挡又黄又爽在线观看| 欧美精品stoya在线| a级国产乱理伦片| 免费无码又爽又刺激高潮| 小雪老师又嫩又紧的| 美女被男人扒开腿猛视频| 久久国产免费福利永久| 萌白酱视频在线| 怡红院免费手机在线观看| 伊人久久大香线蕉影院95| 98精品国产高清在线看入口| 欧美另类xxxx图片| 国产女合集六超多超嫩部| 中文字幕精品亚洲无线码一区 | 蜜桃成熟时2005| 性高湖久久久久久久久aaaaa| 免费v片在线看| 2021乱理片宅它网| 日韩中文字幕免费在线观看| 四虎永久免费影院| 99精品在线播放| 欧洲女人牲交性开放视频|