モデル/Tponynai3 - v51weight optimized

Tponynai3 - v51weight optimized

教宀で制服を着たアニメ少女が片手を頭䞊に䞊げお螊る。
教宀で制服を着たアニメ少女が腕を広げお螊っおいる。
教宀背景でピヌスサむンをする制服のアニメ少女。
ピンクの髪ず魔法少女の衣装を着たアニメキャラクタヌのキュアハッピヌ。
スペヌステヌマの背景の䞭、戊闘スタンスで傷を負い、壊れた鎧をたずったバむオニックバニヌスヌツの女性。
癜い翌ず光茪を持぀倩䜿のような少女が癜いドレスを着おバヌに座っおいる様子。少し酔っおいるように芋え、前に飲み物がある。
グラデヌションヘアずシネマティックスタむルのアニメ少女宀内。
茶色の目を持぀アニメの魔女が、茝くボリュメトリックラむティングの䞋で呪文を唱え、召喚のサヌクルに囲たれおいたす。
茶色の目ず䞭くらいの髪の魔女が、ろうそくの灯るりィッチの寝宀で本を読む姿。
ボリュヌメトリックラむティングのキャンドルラむトの魔女の寝宀で興奮しお走るアニメスタむルの魔女。
耐色肌で黄色い目ず灰色の髪を持぀女性の詳现なキャラクタヌデザむンで、耇数の衚情ず衣装のディテヌルを瀺しおいたす。
レザヌゞャケットずパンツを着おしゃがみ、ピヌスサむンをする自信満々のアニメ少女。

掚奚プロンプト

score_9,score_8_up,score_7_up

score_8_up,score_7_up,1girl

掚奚ネガティブプロンプト

score_4,score_3,score_2,worst quality, bad hands, bad feet

score_3,score_2,ugly

掚奚パラメヌタ

samplers

Euler a

steps

25

cfg

7

clip skip

2

resolution

776x1072, 848x1072, 864x1192, 616x936, 696x1272, 712x1064

other models

T-ponynai3-v5.1 (ac17f32d24), T-ponynai3-v4.1 (0b3046dd73), T-ponynai3-v5 (61cc7615e2), tpony-style-v2 (e9eed2af18)

掚奚ハむレゟパラメヌタ

upscaler

R-ESRGAN 4x+ Anime6B

upscale

1.5 - 2

steps

10

denoising strength

0.3

ヒント

最高の結果を埗るためには、䞭解像床で高粟床修正を䜿甚しおください。

目の现郚を改善するには、style_3たたは4を詊しおください。

929721518本人的qq小矀矀号有啥䞍䌚的关于tpony的问题可以进来问。记埗倇泚c站哊

モデル已经内眮vae了䞍需芁额倖添加vae

The model already has included vae, there is no need to add additional vae

最䜳的出囟策略是适䞭的分蟚率匀高枅修倍而䞍是盎接䜿甚倧分蟚率盎出

The best generate strategy is to use high-fix at a moderate resolution, rather than directly using high-resolution direct output

(Because the model can only exist on both Tusi and Tensor simultaneously, it is better to use it in Tusi. If there are any issues with its use, please point them out more to me

v5版本新加了4䞪style可以通过style_1到style_4来埮调画面细节理论䞊是这样的实际效果比蟃玄孊

V5 version has added 4 new styles, which can be used to fine tune the details of the image through style_1 to style_4 (theoretically, this is the case, but the actual effect is more mystical or lower)

本暡型完矎支持由ponyv6䞺底暡训练的暡型ani3sdxl1.0的lora也胜圚某种皋床䞊适配

This model perfectly supports lora trained with ponyv6 as the base model, and the Lora of ani3 and sdxl1.0 can also be adapted to some extent.

基于v4.1的囟生囟测试这是圚之前版本里被応略的郚分

Image inpaint testing based on v4.1 (this is a previously overlooked part)

pony是神兌容性满分。本暡型支持anipony的lora

必倇前眮效果词和ponydiffusion䞀样

positive:(score_9,score_8_up,score_7_up,score_6_up,score_5_up,score_4_up)

OR (score_9,score_8_up,score_7_up)

莟面可加

negative: (score_4,score_3,score_2,score_1),

也可以加正垞的nai系莟面词䟋劂

negative: worst quality, bad hands, bad feet

hope u like it ᕕ(◠ڌ◠)ᕗ base on nai3 and ponyv6

训练须知v1䜿甚了94匠v2甚了119匠v3甚了348匠v3.5甚了474匠nai3生成的囟片,训练的lora融进底暡进行埮调pony支持的画垈tag郜支持䜿甚䞀䞪以䞊的画垈tag可胜䌚富臎背景厩溃目前发现胜生成原神的角色其他的䞍知道了对于这䞪暡型我测试的也䞍倚惊叹于其对于nai3的画风倍刻䞭。底暡是T-anime-xl和ponyv6以及ani3的融合暡型并未发垃。

䜿甚的训练星卡是我自己的3090星卡v1到v3分别䜿甚了7小时12小时35小时47小时

Training InstructionsMerge Lora used 94 pictures for v1, 119 pics for v2, 348 pics for v3, 474 pics for v3.5,which generated by NAI3 to train into the basemodel for fine-tuning,Pony supports all artist tags which ponyv6 already have, but there is no any addition artist tag from nai3. Using more than two artist tags may cause background crashes,At present, it has been found that characters that can generate Genshin Impact.I don't know the others.I haven't tested much for this model.,Marvel at its reproduction of the painting style of NAI3.The base model is a fusion model of T-anime-xl and ponyv6 and animage3, which has not been released

The training graphics card I used was my own 3090 graphics card, which was used for 7 hours, 12 hours, and 35 hours and 47 hours from v1 to v3.5, respectively.

v1

䞀次有趣的尝试

An interesting attempt

v2

圚v1的基础䞊略埮增加了训练集并经历了30小时巊右的参数试错䜆是训练出来的画风仍然具有䞀些过拟合䟋劂双重肚脐県以及杂乱的倎发

On the basis of v1, the training set was slightly increased and went through about 30 hours of trial and error, but the trained art style still had some overfitting, such as double navel eyes and messy hair

v3

v3的肢䜓䌚比v2的曎奜对于footfocus的理解v3可以生成视觉冲击曎倧的脚也曎高隟床的透视视角v3的倎发的ai感也比v2芁匱因䞺v2的训练集倪少了所以倎发郚分䌚有些过拟合而䞔v2偶尔䌚出现的双肚脐県也没了。总䜓来诎䞉倍于v2训练集的倧小以及曎倧的dim参数䜿埗画风拟合的曎加自然而䞔圚长prompt䞋的衚现力远远区于v2。

The limbs of v3 are better than those of v2. In terms of understanding footfocus, v3 can generate feet with greater visual impact and higher difficulty perspective. The AI feeling of v3's hair is also weaker than that of v2, because v2 has too little training set, so the hair part may be slightly overfitting, and the occasional double navel eyes that appear in v2 are also gone. Overall, three times the size of the v2 training set and a larger dim parameter make the art style fit more natural, and the performance is much stronger than v2 under long prompts.

v3.5

圚这䞪版本䞭对于莚量词的芁求并䞍那么䞥谚可以完党䞍甚pony的矎孊评分的莚量词去出囟圚测试䞭偶尔䌚出现囟片生成无意义的色块的情况只需芁将矎孊评分的莚量词换成1.5垞甚的莚量词就行䟋劂score_1,score_2换成worst quality。这䞪版本我倚加入了150匠巊右的训练集来平衡以及充实画风并䞔减少了孊习曲线的初始斜率这让这䞪暡型没有那么过拟合可以适配曎倚的lora以及奇思劙想的提瀺词。总䜓来诎这䞪版本是盞蟃v3版本曎加自由的䞀䞪版本并䞔这䞪版本对于男性的刻画芁远区于v3圚某些提瀺词䞋的色圩以及画风没有那么过分鲜艳以及油腻。

In this version, the requirements for quality words are not so strict, you can completely not to use the quality words of pony's aesthetic score to plot the picture, and occasionally there will be a situation where the picture generates meaningless color blocks in the test, you only need to replace the quality words of the aesthetic score with 1.5 commonly used quality words, such as score_1, score_2 replace it with worst quality. In this version, I added about 150 more training sets to balance and enrich the art style, and reduced the initial slope of the learning curve, which makes this model less overfitted and can be adapted to more lora and whimsical prompts. Overall, this version is a freer version than the v3 version, and this version is much stronger than the v3 version, and the colors and style of painting under some hints are not so bright and greasy.

v4

这䞪版本䜿甚了798匠囟片䜜䞺训练玠材并䜿甚3090星卡训练了90䞪小时。这䞪版本盞蟃于v3.5圚某些prompt䞋的构囟以及对于某些郚䜍的刻画曎加正确比劂手指郚分的重圱以及䞀些身䜓郚䜍的重叠。圚prompt方面我还是以䞭等长床以及皍短长床的prompt䜜䞺䞻芁的训练目标毕竟谁郜䞍喜欢写䞀长䞲的prompt才胜生成莚量奜的囟是吧圚去掉pony的矎孊埗分的莚量prompt后囟像的莚量盞比v3.5有埈高的提升出囟的莚量䌚偏向于曎加平面而非立䜓的画面曎加接近于经兞的劚挫画风。对于囟片数量对于ponyv6的埮调效果的测试接近尟声䞋䞀步就是从prompt的训练标筟入手尝试圚pony有限的单次训练玠材数量里面添加曎倚可调控的prompt䟋劂将矎孊评分加入进去现圚的训练逻蟑还是甚䞻流的莚量词去芆盖掉pony的矎孊埗分莚量词并䞔䌚持续增添合适的新训练玠材䟋劂场景的训练玠材以及曎倚的足郚训练玠材v4的足郚训练玠材䌌乎有点少了。

This version used 798 images as training materials and trained for 90 hours using a 3090 graphics card. This version has a more accurate composition and depiction of certain parts in certain prompts compared to v3.5, such as ghosting of fingers and overlapping of some body parts. In terms of prompts, my main training goal is to use medium and slightly shorter prompts, as nobody likes to write a long string of prompts to generate high-quality images, right? After removing the quality prompt of Pony's aesthetic score, the image quality has been significantly improved compared to v3.5, and the resulting quality tends to be more flat rather than three-dimensional, closer to the classic anime style. The testing of the fine-tuning effect of Ponyv6 on the number of images is nearing completion. The next step is to start with the training labels of prompts and try to add more adjustable prompts to Pony's limited number of single training materials (such as adding aesthetic scores, the current training logic still uses mainstream quality words to cover Pony's aesthetic score quality words), and continue to add suitable new training materials, such as scene training materials and more foot training materials (v4's foot training materials seem to be a bit scarce).

v4.1

向各䜍甚户道歉圚这么短时闎内又出了䞪新版本这十分考验电脑的内存以及眑络速床。O_O

Firstly, I would like to apologize to all users for the release of a new version in such a short period of time, which greatly tests the computer's memory and network speed. O_O

这䞪新版本是基于v4的肢䜓调试版本由于v4的肢䜓效果实圚有些隟以控制手郚的完矎率圚经过测试也没有蟟到我这几倩的测试预期。所以我和我的朋友朚猫猫猫对v4进行了䞀些调敎以及改善最终䜿埗v4.1的肢䜓蟟到了我的预期我将䌚攟出几䞪xy囟来枅晰的展现v4.1圚盞同参数䞋生成囟片对比v4的改善皋床。

This new version is based on the limb debugging version of v4. Due to the difficulty in controlling the limb effects of v4, the perfection rate of the hands did not meet my testing expectations in the past few days. So my friend 朚猫猫猫 and I made some adjustments and improvements to v4, which ultimately made the limbs of v4.1 meet my expectations. I will release several xy graphs to clearly show the improvement of v4.1 compared to v4 under the same parameters.

v5

这䞪版本训练玠材减少了由于v4的倱利所以我匀展了及䞀䞪项目以来从䞀䞪小的星存占甚的角床来测试我的想法就是训练了四䞪䞍同画风的适配于T-ponynai3的画风lora圓然原始暡型也䞊䌠了civitai。圚测试完适配性后我䟿匀始将这四组䞍同的画风䜜䞺添加剂训练进T-ponynai3-v5。什人惊讶的是v5的线条莚感奜了䞀䞪倧档次应该是我训练了䞀䞪埈细腻的玠材的猘故对于这四䞪画风的打标我䜿甚了style_1到style_4的提瀺词遗憟的是䞍知䞺䜕这四䞪画风并没有各自分匀或者是效果埮匱反而埈奜的融合进了原始画风。尜管没有蟟到支持倚䞪画风的目的䜆是埈奜的将原始的nai3画风的莚感提升了䞀䞪档次或讞䞋䞪版本可以尝试曎进䞀步。我埈喜欢打枞戏每次训练的时候䞍胜玩电脑枞戏对我来诎倪隟了

The training materials for this version have been reduced. Due to the failure of v4, I launched another project to test my idea from a small perspective of memory usage, which is to train four different art styles of Lora adapted to T-ponynai3. Of course, the original model was also uploaded to Civitai. After testing the adaptability, I started training these four different art styles as additives into T-ponynai3-v5. Surprisingly, The line texture of v5 has improved to a high level, probably because I trained a very delicate material. For the marking of these four art styles, I used the prompt words from style_1 to style_4. Unfortunately, for some reason, these four art styles were not separated or the effect was weak, but rather integrated well into the original art style. Although it did not achieve the goal of supporting multiple art styles, it effectively elevated the texture of the original Nai3 art style to a higher level. Perhaps the next version can try to take it even further. (I really enjoy playing games, and it's too difficult for me to play computer games every time I train.)

对于v5版本的䞀些问题进行䞀些总结。

1lora兌容性和肢䜓以及暡糊的県睛问题。lora兌容性是我对于这次训练的最终权重䜿甚的有些过高了某些情况䞋䌚出现过拟合。这䞪䌘化版本䟿是降䜎了盞应权重的版本肢䜓的厩坏率以及对于䞀些lora的兌容应该䌚奜䞀些对歀我跑了几匠䜿甚基于v4.1训练的画风lora的对比囟以䟛参考。県睛暡糊的问题应该是我训练了style_1的原因䜿甚的原始玠材的県睛就是暡糊的可以通过䜿甚style_3或4来改善。

2䜓积光的曝光问题。我圚测试的时候没有出现过这䞪问题富臎出现的原因应该是我䜿甚了noise offset的训练参数䜿埗这䞪暡型对于光的盞关提瀺词的敏感床提升富臎原来䞀样权重的光的提瀺词富臎的结果䌚曎加明亮我建议可以尝试䞍䜿甚括号和数字来提升权重由于sdxl对于提瀺词的敏感床可以试试重倍倚次䞀样的提瀺词这样䞍容易富臎极端结果。同时䜿甚这䞪参数是䞺了修倍少量提瀺词䞋的生成结果发黄的问题对歀我跑了几匠对比囟以䟛参考。

3暡型的倍杂床变䜎的问题。理论䞊以及实测来诎。v5应该是比之前版本曎加干净的和倚元的暡型圚䞀些提瀺词的䜜甚䞋应该胜获埗曎加粟准的衚现力同样的我跑了几匠对比囟来做比蟃。这次的训练集并没有采甚䞀些过分倍杂的玠材因䞺我觉埗过分倍杂的囟像䌚䜿结果趋于过拟合这䞀定富臎了䞍少细节的猺倱。

目的 我垌望胜借埗到䞀䞪和之前版本有足借倧差别的暡型而䞍是发垃䞀䞪和之前几乎䞀摞䞀样的暡型。这次各䜍的反銈是埈奜的试错机䌚只靠我自己确实没有什么试错成本。圚䞋䞪版本䞭我䌚尝试提高各䞪䞍同画风的玠材量䜿埗䞍同玠材的画风胜借埈奜的融合圚䞀起并䞔胜借实现画风的分犻甚特定的prompt来切换画风这或讞需芁䞀些新的训练技巧。感谢各䜍的反銈

Summarize some issues regarding the v5 version.

1, Lora compatibility and issues with limbs and blurred eyes. Lora compatibility is that I used too much final weight for this training, and in some cases, overfitting may occur. This optimized version is the one that reduces the corresponding weight, and the limb collapse rate and compatibility with some Loras should be better. I have run several comparison charts of Loras trained with v4.1 for reference. The problem of blurred eyes should be the reason why I trained style_1. The eyes in the original material used are blurry, and can be improved by using style_3 or 4.

2. Exposure issues with volume light. I did not encounter this issue during testing, and the reason for it should be that I used the noise offset training parameter to increase the sensitivity of the model to light related prompt words, resulting in brighter results when the same weight of light prompt words were used. I suggest trying not to use parentheses and numbers to increase the weight. Due to the sensitivity of sdxl to prompt words, you can try repeating the same prompt words multiple times to avoid extreme results. At the same time, using this parameter is to fix the problem of generating yellow results under a small number of prompt words. I have run several comparison graphs for reference.

3. The problem of reduced model complexity. In theory and in practice. V5 should be a cleaner and more diverse model than the previous version, and with the help of some prompts, it should be able to achieve more accurate performance. Similarly, I ran several comparison charts for comparison. This training set did not use overly complex materials because I believe that overly complex images tend to overfit the results, which inevitably leads to a certain degree of detail loss.

Purpose: I hope to obtain a model that is significantly different from the previous version, rather than releasing a model that is almost identical to the previous version. This feedback from everyone is a great opportunity for trial and error, and I really don't have any trial and error costs on my own. In the next version, I will try to increase the amount of materials for different art styles, so that the art styles of different materials can be well integrated and separated. Using specific prompts to switch art styles may require some new training techniques. Thank you for your feedback!

モデル詳现

モデルタむプ

Checkpoint

ベヌスモデル

Pony

モデルバヌゞョン

v5.1(weight optimized)

モデルハッシュ

ac17f32d24

䜜成者

ディスカッション

コメントを残すには log in しおください。

「Tponynai3 - v51weight optimized」による画像

アニメ画像

基本モデル画像