NovelAI Aspect Ratio Bucketing Source Code Release (MIT Licensed)

2 min readNov 2, 2022

Training with aspect ratio bucketing can greatly improve the quality of outputs of Image Generations (and we personally don’t want another base model trained with center crops), so we have decided to release the NovelAI bucketing code under a permissive MIT license.

https://github.com/NovelAI/novelai-aspect-ratio-bucketing

The repository provides an implementation of aspect ratio bucketing for training generative image models as described in our previous blogpost:

Aspect Ratio Bucketing
One common issue of existing image generation models is that they are very prone to producing images with unnatural crops. This is due to the fact that these models are trained to produce square images. However, most photos and artworks are not square. However, the model can only work on images of the same size at the same time, and during training, it is common practice to operate on multiple training samples at once to optimize the efficiency of the GPUs used. As a compromise, square images are chosen, and during training, only the center of each image is cropped out and then shown to the image generation model as a training example.

Knight wearing a crown with darkened regions removed by the center crop

For example, humans are often generated without feet or heads, and swords consist of only a blade with a hilt and point outside the frame.
As we are creating an image generation model to accompany our storytelling experience, it is important that our model is able to produce proper, uncropped characters, and generated knights should not be holding a metallic-looking straight line extending to infinity.
Another issue with training on cropped images is that it can lead to a mismatch between the text and the image.
For example, an image with a `crown` tag will often no longer contain a crown after a center crop is applied and the monarch has been, thereby, decapitated.
We found that using random crops instead of center crops only slightly improves these issues.
Using Stable Diffusion with variable image sizes is possible, although it can be noticed that going too far beyond the native resolution of 512x512 tends to introduce repeated image elements, and very low resolutions produce indiscernible images.
Still, this indicated to us that training the model on variable sized images should be possible. Training on single, variable sized samples would be trivial, but also extremely slow and more liable to training instability due to the lack of regularization provided by the use of mini batches.

We hope to see many non-cropped images in the future!

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Try for $5/month

Written by Anlatan

3.8K Followers

2 Following

novelai.net Driven by AI, painlessly construct unique stories, thrilling tales, seductive romances, or just fool around. Anything goes!

Responses (1)

Write a response

What are your thoughts?

Also publish to my profile

JacobV Enterprises

Nov 2, 2022

oh but no anies just yer, because it didn't have my subscription

More from Anlatan

Anlatan

NovelAI Diffusion V4 Full版、正式公開しました！

本日、私たちの新しい画像生成モデル「NovelAI Diffusion V4」をご紹介します！

Feb 28

Anlatan

Introducing NovelAI Diffusion V4 Full

The next generation Anime & Furry AI image generation model

Feb 28

Release — NovelAI Anime Diffusion V4 Curated Preview (EN)

Anlatan

Release — NovelAI Anime Diffusion V4 Curated Preview (EN)

After showing off some early results from our NovelAI V4 model, we have decided to get it into your hands as soon as possible. We’re very…

Dec 21, 2024

NovelAI Anime Diffusion V4 Curated Previewのご紹介

Anlatan

NovelAI Anime Diffusion V4 Curated Previewのご紹介

NovelAI V4モデルの初期成果をお見せした後、できるだけ早く皆様の手元にお届けしたいと考えました。この度、NovelAI Anime Diffusion V4 — Curated Previewのリリースを発表できることを大変喜ばしく思います。

Dec 21, 2024

See all from Anlatan

Recommended from Medium

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

732

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Level Up Coding

Jacob Bennett

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jan 7

260

Lists

Staff picks

827 stories1648 saves

Stories to Help You Level-Up at Work

19 stories948 saves

Self-Improvement 101

20 stories3355 saves

Productivity 101

20 stories2819 saves

15 AI Agent Business Ideas to Get Rich in 2025

Everyday AI

Manpreet Singh

15 AI Agent Business Ideas to Get Rich in 2025

Feb 6

How I Am Using a Lifetime 100% Free Server

Harendra

How I Am Using a Lifetime 100% Free Server

Get a server with 24 GB RAM + 4 CPU + 200 GB Storage + Always Free

Oct 26, 2024

170

Mohit Vaswani

6 AI Agents That Are So Good, They Feel Illegal

AI agents are the future because they can replace all the manual work with automation with 100% accuracy and fast speed.

Jan 11

228

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

DataDrivenInvestor

Austin Starks

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

It literally took one try. I was shocked.

Sep 15, 2024

242

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams