mihirp1998

Follow

Mihir Prabhudesai mihirp1998

Follow

researcher at CMU MLD

52 followers · 0 following

Pittsburgh

Achievements

Achievements

Pinned Loading

AlignProp AlignProp Public

AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…

Python 298 11
VADER VADER Public

Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

Python 299 16
alexanderswerdlow/unidisc alexanderswerdlow/unidisc Public

UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

Python 120 5
wmn-231314/diffusion-data-constraint wmn-231314/diffusion-data-constraint Public

Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion models are significantly more data-efficient than standard left…

Python 90 1
Diffusion-TTA Diffusion-TTA Public

Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.

Python 76 5
huggingface/trl huggingface/trl Public

Train transformer language models with reinforcement learning.

Python 15.5k 2.2k