.footer { } Logo Logo
deutsch
/// News
Sigma teases new product launch for 21 February: High-light Fisheye 15mm F1.4 and more?

ConsiStory in Stable Diffusion - Finally consistent AI characters without fine-tuning?

[09:12 Wed,14.February 2024   by Rudi Schmidts]    

Although the project website still says "anonymous authors", the arxiv.org/pdf/2402.03286.pdf (linked PDF paper) makes it clear that "ConsiStory" originates from Nvidia&s research facilities. This project addresses the problem that it is often difficult to use one or more characters consistently over several image generations. For example, an "old man with a hat" usually looks significantly different with each generation attempt, depending on the other prompt tokens. This is known as the current consistency problem of generative AI.

Until now, this problem has been tackled with so-called fine-tuning - in other words, an already trained AI model is "personalized" with additional images of one or more specific people. However, this is computationally complex and also requires a certain amount of expertise.



With ConsiStory, on the other hand, it should now be possible to generate consistent motifs across a series of images within Stable Diffusion XL (SDXL) without additional training. The researchers at Nvidia use a new feature for this, which they call "correspondence-based feature injection". ConsiStory should even be able to be extended to multi-subject scenarios and enable training-free personalization for common objects.

ConsiStory
ConsiStory allows the use of consistent characters without fine-tuning



The lack of training means that such images can be created on a single Nvidia H100 in just ten seconds - which, according to the paper, is around twenty times faster than previous state-of-the-art methods.

Judging by the quality of the results published so far, Nvidia is likely to have achieved a small milestone in generative AI research here - because consistency in characters is one of the major problems currently "hanging" over many practical application scenarios for generative AI. And, of course, some rather unwanted AI projects, such as fully automated, virtual AI influencers.

Self-usable code to try out ConsiStory will be made available "shortly" on the Github project page as a link for interested parties.


Link more infos at bei consistory-paper.github.io

deutsche Version dieser Seite: ConsiStory in Stable Diffusion - Endlich konsistente KI-Charaktere ohne Finetuning?

  



[nach oben]












Archiv Newsmeldungen

2025

August - July - June - May - April - March - February - January

2024
December - November - October - September - August - July - June - May - April - March - February - January

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000






































deutsche Version dieser Seite: ConsiStory in Stable Diffusion - Endlich konsistente KI-Charaktere ohne Finetuning?



last update : 8.August 2025 - 08:02 - slashCAM is a project by channelunit GmbH- mail : slashcam@--antispam:7465--slashcam.de - deutsche Version