Skip to content
View mihirp1998's full-sized avatar
  • Pittsburgh

Block or report mihirp1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. AlignProp AlignProp Public

    AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…

    Python 298 11

  2. VADER VADER Public

    Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

    Python 299 16

  3. alexanderswerdlow/unidisc alexanderswerdlow/unidisc Public

    UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

    Python 120 5

  4. wmn-231314/diffusion-data-constraint wmn-231314/diffusion-data-constraint Public

    Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion models are significantly more data-efficient than standard left…

    Python 90 1

  5. Diffusion-TTA Diffusion-TTA Public

    Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.

    Python 76 5

  6. huggingface/trl huggingface/trl Public

    Train transformer language models with reinforcement learning.

    Python 15.5k 2.2k