entry-slick
entry-slick
entry-slick
entry-slick
entry-slick
entry-slick
About DeepFloyd IF

DeepFloyd IF is a modular neural network based on the cascaded approach.

img

- IF is built with multiple neural modules (independent neural networks that tackle specific tasks), joining forces within a single architecture to produce a synergistic effect.

- IF generates high-resolution images in a cascading manner: the action kicks off with a base model that produces low-resolution samples, which are then boosted by a series of upscale models to create stunning high-resolution images.

- IF’s base and super-resolution models adopt diffusion models, making use of Markov chain steps to introduce random noise into the data, before reversing the process to generate new data samples from the noise.

- IF operates within the pixel space, as opposed to latent diffusion (e.g. Stable Diffusion) that depends on latent image representations

img

PROMPT DESCRIPTION

A fuzzy cute owlA spiky fierce porcupineA scaly mischievous dragon

is drinking very dark beer in the baris playing volleyball on the beachis driving the car

in a photorealistic stylein a street art stylein a Chinese watercolour style

picture

What makes DeepFloyd IF better?

img

- The IF-4.3B base model is the largest diffusion model in terms of the number of effective parameters of the U-Net

- The IF-4.3B model achieves a state-of-the-art zero-shot FID score of 6.66, outperforming both Imagen and the diffusion model with expert denoisers eDiff-I

- A deep text understanding is achieved by employing a large language model T5-XXL as a text encoder, using optimal attention pooling, and utilizing the additional attention layers in super-resolution modules to extract information from the text.

PROMPT DESCRIPTION

A cuddly adorable koalaA slimy agile frogA playful furry fox

playing the drums in a rock bandparticipating in a hot dog eating contestworking as a pilot

in a photorealistic stylein a mosaic stylein a pop art style

picture

CAPABILITIES

Different texts, styles, textures, spatial relations, concepts fusion — IF can unravel it all.

Avatar

Avatar

Avatar

Avatar

Avatar

Avatar

Avatar

Avatar

STYLE VARIATIONS

From the dark side to the bright side: image-to-image translation can be achieved by resizing the original image to 64 pixels, adding some level of noise via forward diffusion, and denoising the image with a new prompt during the backward diffusion process.

This approach opens up vast possibilities to tweak the style, patterns, and details in the output while preserving the essence of the source image. The best part is that no fine-tuning required.

img

CREATIVE USE CASES

Words fill the air: IF has a special affection for the text — and can embroider it on fabric, insert it into a stained-glass window, include it in a collage, light it up on a neon sign. Most text-to-image models you can try have struggled with these use cases up until now.

Visit Official Website

https://deepfloyd.ai/deepfloyd-if

DeepFloyd IF
Soo.. Finally! Meet IF – a state-of-the-art text-to-image model that can also generate 'I ❤️ DeepFloyd' on your mug

👀
🐱
🔻

AVShonenkov _bra_ket _gugutse_ susiaiv vauimpuls StabilityAI
#deepfloydif IF - a Hugging ...GitHub - deep-f...DeepFloyd IF — ...
image
Share
DeepFloyd IF
Soo.. Finally! Meet IF – a state-of-the-art text-to-image model that can also generate 'I ❤️ DeepFloyd' on your mug

👀 IF - a Hugging ...
🐱 GitHub - deep-f...
🔻 DeepFloyd IF — ...

@AVShonenkov @_bra_ket @_gugutse_ @susiaiv @vauimpuls @StabilityAI
#deepfloydif
image
Share
Community Posts
DeepFloyd IF
Soo.. Finally! Meet IF – a state-of-the-art text-to-image model that can also generate 'I ❤️ DeepFloyd' on your mug

👀
🐱
🔻

AVShonenkov _bra_ket _gugutse_ susiaiv vauimpuls StabilityAI
#deepfloydif IF - a Hugging ...GitHub - deep-f...DeepFloyd IF — ...
image
Share
DeepFloyd IF
Soo.. Finally! Meet IF – a state-of-the-art text-to-image model that can also generate 'I ❤️ DeepFloyd' on your mug

👀 IF - a Hugging ...
🐱 GitHub - deep-f...
🔻 DeepFloyd IF — ...

@AVShonenkov @_bra_ket @_gugutse_ @susiaiv @vauimpuls @StabilityAI
#deepfloydif
image
Share
DeepFloyd IF
RT @EMostaque: Code for IF by @deepfloydai is up, amazing work by the team.

Weights in a couple of days along with blog posts, inference…
Share
DeepFloyd IF
Our Astronomy Domine _gugutse_ and Interstellar Overdrive _bra_ket shed light on the DeepFloyd IF's architecture and performance at the Weights & Biases MLOps virtual conference, Fully Connected 2023. Building The Ne...
link
Building The Next Large Model: DeepFloyd LLM + Text-to-Image = IF (Stability AI)
*From Fully Connected 2023*Daria Bakshandeava and Misha Konstantinov of DeepFloyd discuss large language modeling for text-image models, with a focus on thei...
1
Share
DeepFloyd IF
Building The Ne...

Our Astronomy Domine @_gugutse_ and Interstellar Overdrive @_bra_ket shed light on the DeepFloyd IF's architecture and performance at the Weights & Biases MLOps virtual conference, Fully Connected 2023.
link
Building The Next Large Model: DeepFloyd LLM + Text-to-Image = IF (Stability AI)
*From Fully Connected 2023*Daria Bakshandeava and Misha Konstantinov of DeepFloyd discuss large language modeling for text-image models, with a focus on thei...
Share
DeepFloyd IF
letters made of clouds that says 'really soon' above beautiful ocean by Ksenia
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
letters made of clouds that says 'really soon' above beautiful ocean by Ksenia
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
a photo of a small tropical frog sitting on a branch in an overgrown tropical forest, amongst many branches, volumetric mist, rays of light, national photographic, canon 4k, nature photography, 4k by nin_artificial
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
a photo of a small tropical frog sitting on a branch in an overgrown tropical forest, amongst many branches, volumetric mist, rays of light, national photographic, canon 4k, nature photography, 4k by @nin_artificial
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
To die by your side
Is such a heavenly way to die

Dobrokotov: The Smiths lyric video, but the text is AI Generated (via #DeepFloyd)
video
01:03
Share
DeepFloyd IF
DeepFloyd ❤️ LAION

LAION: We release a new ViT-G/14 CLIP model with OpenCLIP which achieves 80.1% zero-shot accuracy on ImageNet and 74.9% zero-shot image retrieval (Recall5) on MS COCO. As of January 2023, this is the best open source CLIP model.

Reaching 80% ze...laion/CLIP-ViT-...
image
Share
DeepFloyd IF
To die by your side
Is such a heavenly way to die
-------------
From @Dobrokotov:The Smiths lyric video, but the text is AI Generated (via #DeepFloyd)
Share
DeepFloyd IF
DeepFloyd ❤️ LAION
-------------
From @LAION:We release a new ViT-G/14 CLIP model with OpenCLIP which achieves 80.1% zero-shot accuracy on ImageNet and 74.9% zero-shot image retrieval (Recall@5) on MS COCO. As of January 2023, this is the best open source CLIP model.
Share
DeepFloyd IF
'happy birthday mr president', color close-up photo of a ideal blond marilyn monroe in a white dress holding in her perfect hands large paper with text "happy birthday mr president". the flowers ... by @_bra_ket
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
RT @NathanThinks: "Death will come, and she will have your eyes"

Took me a while 🙈 @_bra_ket

#DeepFloyd #DeepFloydIF t.co/oSgM6mA
Share
DeepFloyd IF
“don't be a coward”. color photo of a real leopard in a german bar with germany flags holds a sign in his hands with the text "don't be a coward", photo realism, portrait photography, photo reali ... by _bra_ket
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
“don't be a coward”. color photo of a real leopard in a german bar with germany flags holds a sign in his hands with the text "don't be a coward", photo realism, portrait photography, photo reali ... by @_bra_ket
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
vsco preset, 35mm photo, inside a bar, film grain, above a bar, a cursive neon sign saying "opensource me" by _bra_ket
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
vsco preset, 35mm photo, inside a bar, film grain, above a bar, a cursive neon sign saying "opensource me" by @_bra_ket
#deepfloydif #deepfloyd
image
Share
DeepFloyd IF
4k dslr photo of "cute yellow canary bird head with tennis ball body". wow thats detailed, hyper realistic, ultra fine details. side view by AVShonenkov
#deepfloydif #deepfloyd
image
Share