|||

Article originally appeared on Replicate.

Editor’s note

The news in open source this week was all FLUX.1. People have been amazed with the open image models, running nearly 5 million predictions on FLUX.1 [schnell] in the first week!

Fine-tuning scripts are starting to come out. Expect to see some interesting new downstream models next week. For now we have image to image generation, and a ton of cool images people are creating. Check out our X feed and blog posts for some great examples.

deepfates


Image to image generation with FLUX.1

FLUX.1 [dev] now supports image-to-image transformations on Replicate. Bring a starter image, write a prompt, and play with the prompt strength input to balance influence between the starting image and the prompt.

This works by setting the original pixels to your starter image, instead of random noise. It does well for style transfer and composition control, but has its weaknesses. For example, it’s hard to get black-and-white line art from a color image. Imagine how far the pixels would have to wiggle to get there.

Experiment and let us know what you find!

try it on Replicate


Cool tools

A video interview featuring Zeke

Streamlit has launched a new video series, and in the inaugural episode, our very own Zeke joins to demonstrate how to build AI-powered apps using Replicate. This tutorial covers everything from getting started with Streamlit to integrating various AI models hosted on Replicate.

The revelation, for me, was that these language models have become so sophisticated, that there are so many different amazing apps that people can build on top of them where all of the hard work is being done by the language model. And all you have to do is build a compelling and simple user interface on top that delivers value to users.” — Zeke

video | code


Research radar

Odyssey: Empowering Agents with Open-World Skills

Odyssey is a new framework that empowers language model agents with open-world skills to explore the vast Minecraft world. It includes an interactive agent with a skill library, a fine-tuned LLaMA-3 model, and a new open-world benchmark.

The framework demonstrates effective planning and exploration capabilities, making it a significant advancement in autonomous agent solutions. More importantly, they have cool videos on their GitHub page. Go watch them.

code | paper


Bye for now

Thanks for reading! If you have any thoughts or feedback, hit reply and let me know. Forward this to a friend who might find it interesting! Smash that subscribe button. Confirm and submit! Consume and obey. Eat at Joe’s. Run AI with an API on Replicate. I love you.

— deepfates

Up next Sleepyhead The following story was written by the Llama 3.1 405b base model. Everything after the bold text came directly out of the model, zero shot, no Mimic In D&D there’s a monster called a mimic which camouflages itself as a treasure chest, but then it’s a big mouth inside and it eats you. What would
Latest posts Deep Fates Program The Will Smith test Magic crystals Experimenting with Flux Fill Convert your Twitter archive into training data I love San Francisco Experimenting with Recraft v3 Experimenting with Recraft v3 text Criticism AI dating apps Infinite Keltham Machine Bot swarm Grug code editor Replicate Intelligence #12 Replicate Intelligence #11 Experimenting with Flux seeds Deepfits on Flux Mimic Replicate Intelligence #10 Sleepyhead Replicate Intelligence #9 AI cartoons Four takes on the same prompt Experimenting with FLUX.1 The dithered look Replicate Intelligence #8 Hyperstition Replicate Intelligence #7 Experimenting with LivePortrait Replicate Intelligence #6 What is “AI engineering” anyway?