From tf becomes from h5 #3

Narsil · 2022-03-16T13:43:36Z

Fixes #{issue number}

* Add first draft * Make model importable * Make SwinForMaskedImageModeling importable * Fix imports * Add missing inits * Add support for Swin * Fix bug * Fix bug * Fix another bug * Fix Swin MIM implementation * Fix default encoder stride * Fix Swin * Add print statements for debugging * Add image_size data argument * Fix Swin * Fix image_size * Add print statements for debugging * Fix print statement * Remove print statements * Improve reshaping of bool_masked_pos * Add support for DeiT, fix tests * Improve docstrings * Apply new black version * Improve script * Fix bug * Improve README * Apply suggestions from code review * Remove DS_Store and add to gitignore * Apply suggestions from code review + fix BEiT Flax * Revert BEiT changes * Improve README * Fix code quality * Improve README Co-authored-by: Niels Rogge <nielsrogge@Nielss-MBP.localdomain> Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

* doc for adding a model to the hub * run make style * resolved conversation * removed a line * removed ) * Update docs/source/add_new_model.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/add_new_model.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * make style Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

huggingface#15067) Very big changes concerning the tokenizer fast of CLIP which did not correspond to the tokenizer slow of CLIP Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

…ggingface#15684)

* add undo padding * fix * fix tuple issue * make style and quality * move unpad logic to LongformerEncoder + unpad attentions + update tests * move unpad logic to TFLongformerEncoder Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Init PLBART * Add missing configuration file * Add conversion script and configurationf ile * Fix style * Update modeling and conversion scripts * Fix scale embedding in config * Add comment * Fix conversion script * Add classification option to conversion script * Fix vocab size in config doc * Add tokenizer files from MBart50 * Allow no lang code in regular tokenizer * Add PLBart Tokenizer Converters * Remove mask from multi tokenizer * Remove mask from multi tokenizer * Change from MBart-50 to MBart tokenizer * Fix names and modify src/tgt behavior * Fix imports for tokenizer * Remove <mask> from multi tokenizer * Fix style * Change tokenizer_class to processor_class * Add attribute map to config class * Update modeling file to modified MBart code * Update configuration file to MBart style configuration * Fix tokenizer * Separate tokenizers * Fix error in tokenization auto * Copy MBart tests * Replace with MBart tokenization tests * Fix style * Fix language code in multi tokenizer * Fix configuration docs * Add entry for plbart_multi in transformers init * Add dummy objects and fix imports * Fix modeling tests * Add TODO in config * Fix copyright year * Fix modeling docs and test * Fix some tokenization tests and style * Add changes from review * Fix copies * Fix docs * Fix docs * Fix style * Fix year * Add changes from review * Remove extra changes * Fix base tokenizer and doc * Fix style * Fix modeling and slow tokenizer tests * Remove Multi-tokenizer Converter and Tests * Delete QA model and Multi Tokenizer dummy objects * Fix repo consistency and code quality issues * Fix example documentation * Fix style * Remove PLBartTokenizer from type checking in init * Fix consistency issue * Add changes from review * Fix style * Remove PLBartTokenizerFast * Remove FastTokenizer converter * Fix AutoTokenzier mapping * Add plbart to toctree and fix consistency issues * Add language codes tokenizer test * Fix styling and doc issues * Add fixes for failing tests * Fix copies * Fix failing modeling test * Change assert to assertTrue in modeling tests

`HfDeepSpeedConfig` accepts a dictionary or path to `.json` file containing DS configurations, not `TrainingArguments`.

* fix bug in PT speech-encoder-decoder * add pt test for `inputs is not None` * fix test * new pt test * Update tests/test_modeling_speech_encoder_decoder.py * make fixup Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Add missing PLBart entry in index * Fix README * Fix README * Fix style * Change to master model doc

Remove input and target reset after preprocessing

) * begin script * update script * fix features and data args * main * add requirements * add column name args * fix captions * don't jit transforms * fix caption * fix labels, handle attention mask * convert pixel values to numpy * labels => input_ids * transform images on the fly * use AutoModel class, create the hybird model outside of the script * fix version message * add readme * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * adderss review comments * add more comments * allow freezing vision and text models Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Add layer_idx to CrossAttention * Add layer_idx to crossattention of ImageGPT model

* Working example with to_tf_dataset * updated text_classification * more comments

…uggingface#15717)

* TF train_step docstring

@sgugger

* Add GeLU10 (clipped version of GeLU) to transformers to improve quantization performances. * Add unittests. * Import tensorflow after `is_tf_available` check. * Fix tensorflow wrong function `tf.tensor` to `tf.constant` * style. * use `tf.math.max` * Fix tf tests. * style. * style style style style style style * style style style style style style * Address @sgugger comments. * Fix wrong operator for raising ValueError for ClippedGELUActivation.

* [Wav2Vec2 Time Stamps] * Add first version * add word time stamps * Fix * save intermediate space * improve * [Finish CTC Tokenizer] * remove @ * remove @ * push * continue with phonemes * up * finish PR * up * add example * rename * finish * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * correct split * finalize Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Co-authored-by: Boumadane Abdelmoumene <moumene.boumadane@gmail.com>

cna -> can

* Fix `HfArgumentParser` when passing a generator * Add missing import * Always convert `dataclass_types` into a list

* [Proposal] Adding ZeroShotImageClassificationPipeline - Based on CLIP * WIP, Resurection in progress. * Resurrection... achieved. * Reword handling different `padding_value` for `feature_extractor` and `tokenizer`. * Thanks doc-builder ! * Adding docs + global namespace `ZeroShotImageClassificationPipeline`. * Fixing templates. * Make the test pass and be robust to floating error. * Adressing suraj's comments on docs mostly. * Tf support start. * TF support. * Update src/transformers/pipelines/zero_shot_image_classification.py Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>

…e#15751)

* first commit * ResNet model correctly implemented. basic modeling + weights conversion is done removed unused doc mdx file doc and conversion script added feature_extractor to auto test minor changes + style + quality doc test Delete process.yml A left over from my attempt of running circleci locally * minor changes * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * new test format * minor changes from conversations * minor changes from conversations * make style + quality * readded the tests * test + README * minor changes from conversations * error in README * make fix-copies * removed regression for classification head * make quality * fixed loss control flow * fixed loss control flow * resolved conversations * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * READMEs * index.mdx * minor changes * updated tests and models * unused import * outputs * Update docs/source/model_doc/resnet.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * added embeddings_size * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * conversation * added push to hub * test * embedding_size * make fix-copies * resolved conversations * CI * changed organization * minor changes * CI * minor changes * conversations * conversation * doc * tests * removed unused docstring * conversation * removed unused outputs * CI Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add type hints for SqueezeBert PyTorch * Add type hints for GPTNeo PyTorch * style fixes * chenged List with Tuple

* encoder works * addded files * norm in stage * convertion script * tests * fix copies * make fix-copies * fixed __init__ * make fix-copies * fix * shapiro test needed * make fix-copie * minor changes * make style + quality * minor refactor conversion script * rebase + tests * removed unused variables * updated doc * toctree * CI * doc * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * resolved conversations * make fixup * config passed to modules * config passed to modules * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * conversations * conversations * copyrights * normal test * tests Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Add Swin2Bart test * Fix swin tests Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

* Configurable Relative Position Max. Distance * fix missing config Co-authored-by: ahmed-elnaggar <ahmed.elnaggar@allianz.com>

* Added spanish translation of quicktour.mdx * Suggestions applied in the revision of the translation Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

* Use tempaltes for all doc building jobs * Add this branch to the doc build * Switch to main branch

…uggingface#16087) * Fix inconsistent example variable naming - Example code for a sequence classification in Tensorflow had spelling mistakes and incorrect and inconsistent naming - Changed variable naming to be consistent with the two other TF examples * Fix incorrect incorrect training examples

…16076) * fix 2 pytorch vilt docstring examples * add vilt to doctest list file * remove device Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* First attempt at TF XLA generation * Fix comments * Update XLA greedy generate with direct XLA calls * Support attention mask, prepare_inputs_for_generation no longer hardcoded for greedy * Handle position_ids correctly * make xla generate work for non xla case * force using xla generate * refactor * more fixes * finish cleaning * finish * finish * clean gpt2 tests * add gpt2 tests * correct more cases * up * finish * finish * more fixes * flake 8 stuff * final rag fix * Update src/transformers/models/rag/modeling_tf_rag.py * finish t5 as well * finish * Update src/transformers/generation_utils.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

This reverts commit 140cf18.

HuggingFaceDocBuilderDev · 2022-03-16T13:57:36Z

The documentation is not available anymore as the PR was closed or merged.

@ydshieh

* chore: initial commit Copied the torch implementation of regnets and porting the code to tf step by step. Also introduced an output layer which was needed for regnets. * chore: porting the rest of the modules to tensorflow did not change the documentation yet, yet to try the playground on the model * Fix initilizations (#1) * fix: code structure in few cases. * fix: code structure to align tf models. * fix: layer naming, bn layer still remains. * chore: change default epsilon and momentum in bn. * chore: styling nits. * fix: cross-loading bn params. * fix: regnet tf model, integration passing. * add: tests for TF regnet. * fix: code quality related issues. * chore: added rest of the files. * minor additions.. * fix: repo consistency. * fix: regnet tf tests. * chore: reorganize dummy_tf_objects for regnet. * chore: remove checkpoint var. * chore: remov unnecessary files. * chore: run make style. * Update docs/source/en/model_doc/regnet.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * chore: PR feedback I. * fix: pt test. thanks to @ydshieh. * New adaptive pooler (#3) * feat: new adaptive pooler Co-authored-by: @Rocketknight1 * chore: remove image_size argument. Co-authored-by: matt <rocketknight1@gmail.com> Co-authored-by: matt <rocketknight1@gmail.com> * Empty-Commit * chore: remove image_size comment. * chore: remove playground_tf.py * chore: minor changes related to spacing. * chore: make style. * Update src/transformers/models/regnet/modeling_tf_regnet.py Co-authored-by: amyeroberts <aeroberts4444@gmail.com> * Update src/transformers/models/regnet/modeling_tf_regnet.py Co-authored-by: amyeroberts <aeroberts4444@gmail.com> * chore: refactored __init__. * chore: copied from -> taken from./g * adaptive pool -> global avg pool, channel check. * chore: move channel check to stem. * pr comments - minor refactor and add regnets to doc tests. * Update src/transformers/models/regnet/modeling_tf_regnet.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * minor fix in the xlayer. * Empty-Commit * chore: removed from_pt=True. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: matt <rocketknight1@gmail.com> Co-authored-by: amyeroberts <aeroberts4444@gmail.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Typos/fixes to link syntax * Trying section headers * Add header formatting for Rule #3

gchhablani and others added 30 commits February 17, 2022 08:42

Fix shapes in model docstrings (huggingface#15696)

426b962

fix CLIP fast tokenizer and change some properties of the slow version (

e93763d

huggingface#15067) Very big changes concerning the tokenizer fast of CLIP which did not correspond to the tokenizer slow of CLIP Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

Fix SiluActivation (huggingface#15718)

416dff7

TF: add initializer_std with a small value in TFFunnelModelTester (hu…

f8ff3fa

…ggingface#15684)

Fix DETR model deprecation warnings for int div (huggingface#15702)

68dec6b

style_doc handles decorators in examples (huggingface#15719)

d5083c3

Fix auto (huggingface#15706)

83f45cd

fix: hfdeepspeed config argument (huggingface#15711)

3de1290

`HfDeepSpeedConfig` accepts a dictionary or path to `.json` file containing DS configurations, not `TrainingArguments`.

Add missing PLBart entry in README (huggingface#15721)

2c2a31f

* Add missing PLBart entry in index * Fix README * Fix README * Fix style * Change to master model doc

Remove input and target reset after preprocessing (huggingface#15741)

a63bd36

Remove input and target reset after preprocessing

Fix minor comment typos (huggingface#15740)

5444687

Add layer_idx to CrossAttention of GPT2 model (huggingface#15730)

142b69f

* Add layer_idx to CrossAttention * Add layer_idx to crossattention of ImageGPT model

TF text classification examples (huggingface#15704)

3956b13

* Working example with to_tf_dataset * updated text_classification * more comments

revert temporary addition to test next version of CLIPTokenizerFast (h…

0187c6f

…uggingface#15717)

added link to our writing-doc document (huggingface#15756)

38bed91

TF train_step docstring (huggingface#15755)

2c3fcc6

* TF train_step docstring

fixed pipeline code (huggingface#15607)

2cdb6db

Co-authored-by: Boumadane Abdelmoumene <moumene.boumadane@gmail.com>

Fix typo on examples/pytorch/question-answering (huggingface#15644)

3db2e8f

cna -> can

Cleanup transformers-cli (huggingface#15767)

db57bb2

Fix HfArgumentParser when passing a generator (huggingface#15758)

05a12a0

* Fix `HfArgumentParser` when passing a generator * Add missing import * Always convert `dataclass_types` into a list

[M2M100, XGLM] fix create_position_ids_from_inputs_embeds (huggingfac…

24588c6

…e#15751)

sgugger and others added 26 commits March 14, 2022 13:26

Use HF_ENDPOINT for custom endpoints (huggingface#16139)

e109edf

update albert with tf decorator (huggingface#16147)

3779325

TF Electra - clearer model variable naming (huggingface#16143)

6458236

Add type hints for GPTNeo PyTorch (huggingface#16127)

8f3ea7a

* Add type hints for SqueezeBert PyTorch * Add type hints for GPTNeo PyTorch * style fixes * chenged List with Tuple

Improve Swin for VisionEncoderDecoder (huggingface#16070)

a7aca42

* Add Swin2Bart test * Fix swin tests Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>

Make transformers.utils.fx. _SUPPORTED_MODELS unique (huggingface#16015)

5a386fb

Shift responsibilities a bit (huggingface#16154)

5664d27

typo "conaining" -> "containing" (huggingface#16132)

cd1ffb4

Configurable Relative Position Max. Distance (huggingface#16155)

5771344

* Configurable Relative Position Max. Distance * fix missing config Co-authored-by: ahmed-elnaggar <ahmed.elnaggar@allianz.com>

Added spanish translation of quicktour.mdx (huggingface#16158)

daa4944

* Added spanish translation of quicktour.mdx * Suggestions applied in the revision of the translation Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>

Use templates (huggingface#16142)

8bfd2fb

* Use tempaltes for all doc building jobs * Add this branch to the doc build * Switch to main branch

[Fix doc example] Fix 2 PyTorch Vilt docstring examples (huggingface#…

e5bc438

…16076) * fix 2 pytorch vilt docstring examples * add vilt to doctest list file * remove device Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Attempt to load from TF more efficiently in PT without copies.

6060ee2

Update src/transformers/generation_logits_process.py

271e7b4

Prefix shenanigan.

3310870

Adding unexpected and missing support.

4f4c996

Handling shared tensors in PT.

3e88be3

Bailing ?

fde37e9

Cleanup.

7e99cef

Quality.

551fb7d

Can' t find test that requires magic reshape.

140cf18

Revert "Can' t find test that requires magic reshape."

1ed3972

This reverts commit 140cf18.

Narsil closed this Mar 16, 2022

Narsil pushed a commit that referenced this pull request Feb 8, 2023

Typos/fixes to link syntax (huggingface#21450)

28ec07d

* Typos/fixes to link syntax * Trying section headers * Add header formatting for Rule #3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

From tf becomes from h5 #3

From tf becomes from h5 #3

Uh oh!

Narsil commented Mar 16, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Mar 16, 2022 •

edited

Loading

Uh oh!

Uh oh!

From tf becomes from h5 #3

From tf becomes from h5 #3

Uh oh!

Conversation

Narsil commented Mar 16, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Mar 16, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Mar 16, 2022 •

edited

Loading