🍡 Tiny AutoEncoder for Mochi 1

What is TAEM1?

TAEM1 is a Tiny AutoEncoder for the Mochi 1 (Preview) video generation model. TAEM1 should be able to encode/decode Mochi's latents more cheaply than the full-size Mochi VAE (at the cost of slightly lower quality). This means TAEM1 should be useful for previewing outputs from Mochi 1.

Sample Video	Reconstruction with TAEM1

How does TAEM1 work?

TAEM1 consists of an MSE-distilled encoder and an MSE+adversarial-distilled decoder, both trained to mimic the Mochi 1 VAE behavior. TAEM1 has the vae_latents_to_dit_latents and dit_latents_to_vae_latents transforms baked in, and consumes/produces [0, 1]-scaled images, so TAEM1 shouldn't require much additional code to use. TAEM1 is causal (like the Mochi 1 VAE) so you can either run TAEM1 timestep-parallel (faster, higher memory usage) or timestep-sequential (slower, reduced memory usage).

How can I use TAEM1?

You can try running python3 taem1.py test_video.mp4 to test reconstruction. Mochi T2V demo notebook TBD.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
taem1.pth		taem1.pth
taem1.py		taem1.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🍡 Tiny AutoEncoder for Mochi 1

What is TAEM1?

How does TAEM1 work?

How can I use TAEM1?

About

Uh oh!

Releases

Packages

Languages

License

madebyollin/taem1

Folders and files

Latest commit

History

Repository files navigation

🍡 Tiny AutoEncoder for Mochi 1

What is TAEM1?

How does TAEM1 work?

How can I use TAEM1?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages