Add model card for MobileViT #40033

Shivamjan · 2025-08-08T13:49:15Z

What does this PR do?

This PR adds a detailed and beginner-friendly model card for MobileViT to the Hugging Face Transformers documentation. The previous model card was minimal and lacked clear explanations about the model architecture. This model retains several elements from the earlier version, as they remain applicable and effective for users.

The new version includes:

A clear explanation of the MobileViT architecture.
Notes on preprocessing and image format.
Clarifies how to use the model for classification and segmentation.
Highlights TensorFlow Lite compatibility for mobile use.
Primary references to the original paper and related resources.

Fixes # (issue)

Before submitting

[ x] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[ x] Did you read the contributor guideline,
Pull Request section?
[ x] Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
[ x] Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Shivamjan · 2025-08-08T13:51:46Z

@stevhliu Please take a look at your convenience and do let me know if there is any further changes required.

stevhliu

Good start! Please check the model card format again as its missing Pipeline and AutoModel examples!

docs/source/en/model_doc/mobilevit.md

stevhliu · 2025-08-08T20:23:37Z

docs/source/en/model_doc/mobilevit.md

+Most of the models that uses Transformers for vision would first divide the images into several patches which are further flattened and converted into vectors. This causes in the loss of structural properties of an image, which isn't the case for CNNs. Now, this causes the Transformer models to go bigger and deeper to learn visual representations. 
+But MobileViT uses both convolutions and transformers in a way that the resultant block has convolution-like properties while simultaneously allowing for global interactions. This allows us to design a more shallow and narrow models, which are light-weight.
+
+![enter image description here](https://user-images.githubusercontent.com/67839539/136470152-2573529e-1a24-4494-821d-70eb4647a51d.png)


Can you upload the image here and then ping me to merge please :)

The image should be formatted like:

<div class="flex justify-center"> <img> </div>

Hey @stevhliu , I have uploaded the image and pinged you there, can you please check and merge it. I have also made the changes you suggested in the model card, let me know if everything looks good. I will add the image to the model card once you merge it. Thanks

docs/source/en/model_doc/mobilevit.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Shivamjan added 2 commits August 8, 2025 19:01

Add model card for MobileViT

ede1745

Merge branch 'main' into add-mobilevit-model-card

3482fdc

stevhliu mentioned this pull request Aug 8, 2025

[Community contributions] Model cards #36979

Open

stevhliu reviewed Aug 8, 2025

View reviewed changes

Shivamjan and others added 8 commits August 9, 2025 09:19

Update docs/source/en/model_doc/mobilevit.md

0d60ed1

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/model_doc/mobilevit.md

352ce2d

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/model_doc/mobilevit.md

3c5f262

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/model_doc/mobilevit.md

a25bc33

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update docs/source/en/model_doc/mobilevit.md

694f26d

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Update mobilevit.md

f7012b7

Update mobilevit.md

5f15b28

Update mobilevit.md

d1a4df1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add model card for MobileViT #40033

Add model card for MobileViT #40033

Shivamjan commented Aug 8, 2025

Uh oh!

Shivamjan commented Aug 8, 2025 •

edited

Loading

Uh oh!

stevhliu left a comment

Uh oh!

Uh oh!

Uh oh!

stevhliu Aug 8, 2025

Uh oh!

Shivamjan Aug 9, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add model card for MobileViT #40033

Are you sure you want to change the base?

Add model card for MobileViT #40033

Conversation

Shivamjan commented Aug 8, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

Shivamjan commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stevhliu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

stevhliu Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

Shivamjan Aug 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Shivamjan commented Aug 8, 2025 •

edited

Loading