|||

Article originally appeared on Replicate.

Editor’s note

The big open source AI news this week is the release of Stable Diffusion 3 Medium. People are already doing cool things with it, but public reaction has been mixed.

On a personal note, I got banned from X Dot Com. Apparently it is against the rules to change your profile picture to the old Twitter logo and announcing WE ARE SO BACK.

Anyway, here’s some things that caught my eye this week. Find me on Bluesky, I guess.

deepfates


Stable Diffusion 3 Medium

The long-awaited image generation model is related in the 2B size (no word yet about the larger 8B version).

Users say the model is much better at creating legible text, but that it has problems with anatomy and composition.

Model weights are available under a non-commercial license.

try on replicate


Cool tools

Find concepts in GPT models

OpenAI does dictionary learning on their own models to extract and interpret patterns that may to specific concepts. Similar technique to the one Anthropic used to create Golden Gate Claude.

They release a research paper and feature explorer, but also code that will steer the (practically retro at this point) GPT-2-small model.

post | paper | github | visualizer

Real-time speech to text in the browser

The Transformers.js project has implemented OpenAI’s Whisper model in JavaScript. This means you can open a browser tab, talk to it, and get an accurate transcript of your words in real time. No coding required.

demo


Research radar

A new way to tokenize images

Researchers at ByteDance, find a way to encode images into a single short vector instead of a 2D grid of patches. The new vectors can be as short as 32 elements, instead of 256 or even 1024 for existing methods.

This could make multimodal models and image generators much more compute efficient.

post | paper


Changelog

H100s are coming

We’ll soon be adding support for NVIDIAs powerful H100 GPUs.

If you’re interested in getting early access to H100s, email

changelog


Bye for now

How am I doing so far? You going to keep opening these letters? Let me know, so I can fix everything to be exactly perfect. Thanks in advance.

— deepfates

Up next Replicate Intelligence #3 Garden State Llama, applied LLMs guide, real-time image generation Replicate Intelligence #5 Really good coding model, AI search breakthroughs, Discord support bot
Latest posts Replicate Intelligence #12 Replicate Intelligence #11 Replicate Intelligence #10 Replicate Intelligence #9 Replicate Intelligence #8 Replicate Intelligence #7 Replicate Intelligence #6 Replicate Intelligence #5 Replicate Intelligence #4 Replicate Intelligence #3 Replicate Intelligence #2 Replicate Intelligence #1 The 3½ Tenets of Biocosmism Hypervector Redactions Rufus, your AI-powered shopping assistant Oh Turing Two scientists The Ascension of Cerebro The Hyperstition Array Crawling Chat Instructions 2 as a user The OOM Source Text Paradoxes Message from SF Instructions Another carved fragment Data Cognitive Security 101